Search-R1/README.md at fb9940972ce9fa23ae54ec7fd6438b7e8295a327

Files

PeterGriffinJin fb9940972c fix typo

2025-03-13 13:58:23 +00:00

Reproduce the paper results

huggingface-cli download --repo-type dataset PeterJinGo/nq_hotpotqa_train --local-dir $WORK_DIR/data/hotpot_qa

bash train_ppo.sh

bash train_ppo.sh

bash evaluate.sh

You can change $BASE_MODEL to the path of the model you would like to evaluate.