选项平衡后的第一次试跑,约70%正确率
This commit is contained in:
3942
results/20250602_1706/qwen-max-2025-01-25.json
Normal file
3942
results/20250602_1706/qwen-max-2025-01-25.json
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user