Files
Search-R1/scripts/nq_hotpotqa
2025-04-09 19:38:29 +00:00
..
2025-04-09 19:38:29 +00:00
2025-04-09 19:38:29 +00:00
2025-03-31 12:58:04 +00:00
2025-03-13 13:57:47 +00:00
2025-03-13 14:42:21 +00:00

Reproduce the paper results

Download the dataset

huggingface-cli download --repo-type dataset PeterJinGo/nq_hotpotqa_train --local-dir $WORK_DIR/data/nq_hotpotqa_train

Run PPO training

bash train_ppo.sh

Run GRPO training

bash train_ppo.sh

Run evaluation

bash evaluate.sh

You can change $BASE_MODEL to the path of the model you would like to evaluate.