add paper scripts

This commit is contained in:
PeterGriffinJin
2025-03-13 13:57:47 +00:00
parent 0ecaf6da76
commit 584ce9deb5
5 changed files with 270 additions and 5 deletions

View File

@@ -0,0 +1,26 @@
## Reproduce the paper results
### Download the dataset
```bash
huggingface-cli download --repo-type dataset PeterJinGo/nq_hotpotqa_train --local-dir $WORK_DIR/data/hotpot_qa
```
### Run PPO training
```bash
bash train_ppo.sh
```
### Run GRPO training
```bash
bash train_ppo.sh
```
### Run evaluation
```bash
bash evaluate.sh
```
You can change ```$BASE_MODEL``` to the path of the model you would loike to evaluate.