This website requires JavaScript.
Explore
Help
Sign In
tangger
/
Search-R1
Watch
2
Star
0
Fork
0
You've already forked Search-R1
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
5eccb5fa140a1431dbdcd1e3ef7d1ef4a345d21a
Search-R1
/
verl
/
trainer
History
PeterGriffinJin
0b26e614f7
fix proto bug
2025-04-04 02:54:21 +00:00
..
config
Initial commit
2025-02-28 15:16:19 +00:00
ppo
fix proto bug
2025-04-04 02:54:21 +00:00
__init__.py
Initial commit
2025-02-28 15:16:19 +00:00
fsdp_sft_trainer.py
Initial commit
2025-02-28 15:16:19 +00:00
main_eval.py
Initial commit
2025-02-28 15:16:19 +00:00
main_generation.py
Initial commit
2025-02-28 15:16:19 +00:00
main_ppo.py
fix reward bug
2025-03-13 19:18:56 +00:00
runtime_env.yaml
Initial commit
2025-02-28 15:16:19 +00:00