Commit Graph

5 Commits

Author SHA1 Message Date
PeterGriffinJin
95d16f4548 fix potential float bug 2025-03-27 16:21:04 +00:00
PeterGriffinJin
6272082a64 fix action status 2025-03-22 14:49:23 +00:00
PeterGriffinJin
83d10313be add action status 2025-03-19 22:19:27 +00:00
xiaobo-yang
32719b5119 Fix bugs related to loss mask, meta info, and response length
1. Construct the loss mask immediately after obtaining the observation to prevent encoding misalignment when converting back to tokens after text transformation.
2. Follow up on meta info to ensure that the test batch can apply do sample.
3. Remove the recording of info information for response length.
2025-03-14 14:25:40 +08:00
PeterGriffinJin
068516be64 Initial commit 2025-02-28 15:16:19 +00:00