This website requires JavaScript.
Explore
Help
Sign In
lzy
/
sci-gui-agent-benchmark
Watch
1
Star
0
Fork
0
You've already forked sci-gui-agent-benchmark
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
1,382
Commits
3
Branches
0
Tags
3890ee5fc32aa537b3aa7d57037e0a3bea8aec4f
Commit Graph
3 Commits
Author
SHA1
Message
Date
cui0711
3890ee5fc3
fix(vllm_eval): add image compression to prevent 413 error with large max_steps
2026-02-09 14:24:59 +08:00
cui0711
9bc54c0a66
feat(vllm_eval): add structured JSON response format with step analysis
2026-02-09 13:58:14 +08:00
cui0711
dd58a1de03
feat(evaluator): add vision-language model evaluator
2026-02-05 16:52:35 +08:00