Logo
Explore Help
Sign In
lzy/sci-gui-agent-benchmark
1
0
Fork 0
You've already forked sci-gui-agent-benchmark
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
afd5952e4429f1f0ac8e639c130605ef6b1010f0
sci-gui-agent-benchmark/evaluation_examples/examples
History
Danyang Zhang afd5952e44 ver Oct3rd (#349)
updated a series of instructions to ask the agent not to do any
unnecessary actions.
2025-10-04 00:13:29 +08:00
..
chrome
Refactor evaluator functions in JSON examples to use URL pattern matching. Update expected URL formats to regex patterns for better validation in chrome evaluation examples.
2025-10-01 19:20:06 +00:00
gimp
Update GIMP evaluation examples to replace local file paths with cloud file URLs for consistency and accessibility.
2025-10-01 09:54:52 +00:00
libreoffice_calc
ver Oct3rd (#349)
2025-10-04 00:13:29 +08:00
libreoffice_impress
Update instruction wording in LibreOffice Impress example to clarify text color change requirements. Address https://github.com/xlang-ai/OSWorld/issues/324
2025-09-01 23:29:47 +08:00
libreoffice_writer
feat: standardize configuration fields across all evaluation examples
2025-07-16 13:45:34 +00:00
multi_apps
Add AutoGLM-OS agent (#309)
2025-08-17 12:08:40 +08:00
os
refactor: update command in JSON example to use placeholder for client password
2025-07-31 05:20:04 +00:00
thunderbird
Update 10a730d5-d414-4b40-b479-684bed1ae522.json
2025-07-24 15:44:52 +08:00
vlc
feat: standardize configuration fields across all evaluation examples
2025-07-16 13:45:34 +00:00
vs_code
feat: standardize configuration fields across all evaluation examples
2025-07-16 13:45:34 +00:00
Powered by Gitea Version: 24.6.0 Page: 124ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API