Files
Search-R1/scripts/nq_hotpotqa/README.md
PeterGriffinJin ff85cb7f1e fix file name bug
2025-03-13 14:42:21 +00:00

432 B

Reproduce the paper results

Download the dataset

huggingface-cli download --repo-type dataset PeterJinGo/nq_hotpotqa_train --local-dir $WORK_DIR/data/nq_hotpotqa_train

Run PPO training

bash train_ppo.sh

Run GRPO training

bash train_ppo.sh

Run evaluation

bash evaluate.sh

You can change $BASE_MODEL to the path of the model you would like to evaluate.