This website requires JavaScript.
Explore
Help
Sign In
lzy
/
sci-gui-agent-benchmark
Watch
1
Star
0
Fork
0
You've already forked sci-gui-agent-benchmark
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
afd5952e4429f1f0ac8e639c130605ef6b1010f0
sci-gui-agent-benchmark
/
evaluation_examples
/
examples
History
Danyang Zhang
afd5952e44
ver Oct3rd (
#349
)
...
updated a series of instructions to ask the agent not to do any unnecessary actions.
2025-10-04 00:13:29 +08:00
..
chrome
Refactor evaluator functions in JSON examples to use URL pattern matching. Update expected URL formats to regex patterns for better validation in chrome evaluation examples.
2025-10-01 19:20:06 +00:00
gimp
Update GIMP evaluation examples to replace local file paths with cloud file URLs for consistency and accessibility.
2025-10-01 09:54:52 +00:00
libreoffice_calc
ver Oct3rd (
#349
)
2025-10-04 00:13:29 +08:00
libreoffice_impress
Update instruction wording in LibreOffice Impress example to clarify text color change requirements. Address
https://github.com/xlang-ai/OSWorld/issues/324
2025-09-01 23:29:47 +08:00
libreoffice_writer
feat: standardize configuration fields across all evaluation examples
2025-07-16 13:45:34 +00:00
multi_apps
Add AutoGLM-OS agent (
#309
)
2025-08-17 12:08:40 +08:00
os
refactor: update command in JSON example to use placeholder for client password
2025-07-31 05:20:04 +00:00
thunderbird
Update 10a730d5-d414-4b40-b479-684bed1ae522.json
2025-07-24 15:44:52 +08:00
vlc
feat: standardize configuration fields across all evaluation examples
2025-07-16 13:45:34 +00:00
vs_code
feat: standardize configuration fields across all evaluation examples
2025-07-16 13:45:34 +00:00