This website requires JavaScript.
Explore
Help
Sign In
lzy
/
sci-gui-agent-benchmark
Watch
1
Star
0
Fork
0
You've already forked sci-gui-agent-benchmark
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
ba037841963d1159c2831379f0d1e40e8be3afa7
sci-gui-agent-benchmark
/
desktop_env
History
cui0711
ba03784196
fix(env): handle None result_getter for vllm_eval evaluator
2026-02-09 17:46:05 +08:00
..
controllers
Add Claude Sonnet 4.5 support and improve action handling (
#362
)
2025-11-14 13:54:32 +08:00
evaluators
fix(vllm_eval): add image compression to prevent 413 error with large max_steps
2026-02-09 14:24:59 +08:00
providers
Add Claude Sonnet 4.5 support and improve action handling (
#362
)
2025-11-14 13:54:32 +08:00
server
feat(server): add cross-platform support and improve screenshot handling
2026-01-30 16:27:49 +08:00
__init__.py
Refactor examples; Start to load examples into benchmark; vlc initialization
2023-12-25 00:24:13 +08:00
actions.py
Refactoring VMware Integration and Implementing AWS Support (
#44
)
2024-06-15 20:52:29 +08:00
desktop_env_os_symphony.py
fix(os_symphony_evaluation) (
#410
)
2026-01-04 15:56:51 +08:00
desktop_env.py
fix(env): handle None result_getter for vllm_eval evaluator
2026-02-09 17:46:05 +08:00