sci-gui-agent-benchmark/examples at afd5952e4429f1f0ac8e639c130605ef6b1010f0 - sci-gui-agent-benchmark - Git of MAIC

lzy/sci-gui-agent-benchmark

Files

History

Danyang Zhang afd5952e44 ver Oct3rd (#349 )

updated a series of instructions to ask the agent not to do any
unnecessary actions.

2025-10-04 00:13:29 +08:00

..

Refactor evaluator functions in JSON examples to use URL pattern matching. Update expected URL formats to regex patterns for better validation in chrome evaluation examples.

2025-10-01 19:20:06 +00:00

Update GIMP evaluation examples to replace local file paths with cloud file URLs for consistency and accessibility.

2025-10-01 09:54:52 +00:00

libreoffice_calc

ver Oct3rd (#349 )

2025-10-04 00:13:29 +08:00

libreoffice_impress

Update instruction wording in LibreOffice Impress example to clarify text color change requirements. Address https://github.com/xlang-ai/OSWorld/issues/324

2025-09-01 23:29:47 +08:00

libreoffice_writer

feat: standardize configuration fields across all evaluation examples

2025-07-16 13:45:34 +00:00

Add AutoGLM-OS agent (#309 )

2025-08-17 12:08:40 +08:00

refactor: update command in JSON example to use placeholder for client password

2025-07-31 05:20:04 +00:00

Update 10a730d5-d414-4b40-b479-684bed1ae522.json

2025-07-24 15:44:52 +08:00

feat: standardize configuration fields across all evaluation examples

2025-07-16 13:45:34 +00:00

feat: standardize configuration fields across all evaluation examples

2025-07-16 13:45:34 +00:00