update readme

This commit is contained in:
PeterGriffinJin
2025-02-28 20:53:31 +00:00
parent 91a452c21a
commit 5880c6e03c

View File

@@ -82,7 +82,7 @@ conda activate retriever
bash retrieval_launch.sh
```
(4) Run training with Qwen2.5-3b-Instruct.
(4) Run RL training (PPO) with Llama-3.2-3b-base.
```bash
conda activate searchr1
bash train_ppo.sh