add paper scripts

2025-03-13 13:57:47 +00:00
parent 0ecaf6da76
commit 584ce9deb5
5 changed files with 270 additions and 5 deletions
--- a/scripts/nq_hotpotqa/README.md
+++ b/scripts/nq_hotpotqa/README.md
@@ -0,0 +1,26 @@
+
+## Reproduce the paper results
+
+### Download the dataset
+
+```bash
+huggingface-cli download --repo-type dataset PeterJinGo/nq_hotpotqa_train --local-dir $WORK_DIR/data/hotpot_qa
+```
+
+### Run PPO training
+```bash
+bash train_ppo.sh
+```
+
+
+### Run GRPO training
+```bash
+bash train_ppo.sh
+```
+
+### Run evaluation
+```bash
+bash evaluate.sh
+```
+
+You can change ```$BASE_MODEL``` to the path of the model you would loike to evaluate.