Files
Search-R1/scripts/nq_hotpotqa/README.md
PeterGriffinJin 584ce9deb5 add paper scripts
2025-03-13 13:57:47 +00:00

425 B

Reproduce the paper results

Download the dataset

huggingface-cli download --repo-type dataset PeterJinGo/nq_hotpotqa_train --local-dir $WORK_DIR/data/hotpot_qa

Run PPO training

bash train_ppo.sh

Run GRPO training

bash train_ppo.sh

Run evaluation

bash evaluate.sh

You can change $BASE_MODEL to the path of the model you would loike to evaluate.