This website requires JavaScript.
Explore
Help
Sign In
lzy
/
sci-gui-agent-benchmark
Watch
1
Star
0
Fork
0
You've already forked sci-gui-agent-benchmark
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
be24e77d93ed5e5a8fa6f27c7239a804e154ab79
sci-gui-agent-benchmark
/
desktop_env
History
cui0711
be24e77d93
feat(env): add eval_model parameter and result_dir support for vllm evaluation
2026-02-05 16:53:12 +08:00
..
controllers
Add Claude Sonnet 4.5 support and improve action handling (
#362
)
2025-11-14 13:54:32 +08:00
evaluators
feat(evaluator): add vision-language model evaluator
2026-02-05 16:52:35 +08:00
providers
Add Claude Sonnet 4.5 support and improve action handling (
#362
)
2025-11-14 13:54:32 +08:00
server
feat(server): add cross-platform support and improve screenshot handling
2026-01-30 16:27:49 +08:00
__init__.py
Refactor examples; Start to load examples into benchmark; vlc initialization
2023-12-25 00:24:13 +08:00
actions.py
Refactoring VMware Integration and Implementing AWS Support (
#44
)
2024-06-15 20:52:29 +08:00
desktop_env_os_symphony.py
fix(os_symphony_evaluation) (
#410
)
2026-01-04 15:56:51 +08:00
desktop_env.py
feat(env): add eval_model parameter and result_dir support for vllm evaluation
2026-02-05 16:53:12 +08:00