sci-gui-agent-benchmark/desktop_env at be24e77d93ed5e5a8fa6f27c7239a804e154ab79 - sci-gui-agent-benchmark - Git of MAIC

lzy/sci-gui-agent-benchmark

Files

History

cui0711 be24e77d93 feat(env): add eval_model parameter and result_dir support for vllm evaluation

2026-02-05 16:53:12 +08:00

..

Add Claude Sonnet 4.5 support and improve action handling (#362 )

2025-11-14 13:54:32 +08:00

feat(evaluator): add vision-language model evaluator

2026-02-05 16:52:35 +08:00

Add Claude Sonnet 4.5 support and improve action handling (#362 )

2025-11-14 13:54:32 +08:00

feat(server): add cross-platform support and improve screenshot handling

2026-01-30 16:27:49 +08:00

__init__.py

Refactor examples; Start to load examples into benchmark; vlc initialization

2023-12-25 00:24:13 +08:00

actions.py

Refactoring VMware Integration and Implementing AWS Support (#44 )

2024-06-15 20:52:29 +08:00

desktop_env_os_symphony.py

fix(os_symphony_evaluation) (#410 )

2026-01-04 15:56:51 +08:00

desktop_env.py

feat(env): add eval_model parameter and result_dir support for vllm evaluation

2026-02-05 16:53:12 +08:00