Commit Graph

32 Commits

Author SHA1 Message Date
Yuan Mengqi
27319ce1e3 fix password&resolution (#251)
* fix chrome

* fix: fix proxy setup

* feat&fix: add proxy support in setup and remove hardcoded proxy from example

* fix tasks

* fix chrome finished

* fix

* clean chrome_fix code

* clean chrome_fix code

* fix chrome 2888b4e6-5b47-4b57-8bf5-c73827890774

* fix multiapps

* fix chrome 2888b4e6-5b47-4b57-8bf5-c73827890774

* fix some multi_apps tasks

* fix some multi_apps tasks

* fix password&resolution

* fix password&resolution

---------

Co-authored-by: adlsdztony <zzl0712@connect.hku.hk>
2025-07-13 00:25:37 +08:00
yuanmengqi
a651b04e49 Update AWS AMI ID, enhance directory creation logic in file upload, modify osworld service configuration, and refine JSON evaluation examples for improved clarity and functionality. 2025-07-03 16:57:41 +00:00
ChenYXxxx
bdaf37e0e5 fix_os&gimp (#220)
* Update ec4e3f68-9ea4-4c18-a5c9-69f89d1178b3.json

* Update c288e301-e626-4b98-a1ab-159dcb162af5.json

* Update 3ce045a0-877b-42aa-8d2c-b4a863336ab8.json

* Update b3d4a89c-53f2-4d6b-8b6a-541fb5d205fa.json

* Update 2e6f678f-472d-4c55-99cc-8e7c5c402a71.json

Please batch process all images on the desktop by increasing their brightness to 50, instead of adjusting them individually.

* Update 5ca86c6f-f317-49d8-b6a7-b527541caae8.json

* Update a746add2-cab0-4740-ac36-c3769d9bfb46.json

* Update a746add2-cab0-4740-ac36-c3769d9bfb46.json

* Update 62f7fd55-0687-4a43-b6e1-3eda16fc6252.json

* Update d52d6308-ec58-42b7-a2c9-de80e4837b2b.json

* Update d16c99dc-2a1e-46f2-b350-d97c86c85c15.json

* Update d16c99dc-2a1e-46f2-b350-d97c86c85c15.json

* Update 58d3eeeb-e9d0-499f-962e-fd0db2a744d8.json
2025-07-03 16:59:05 +08:00
Tianbao Xie
0cc93543a8 Environment is_used flag; OS domain fix (#219)
* Refactor evaluator structure in LibreOffice Writer example JSON to support multiple expected and result files, enhancing evaluation flexibility.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update time format in get_vm_file function to include hours, minutes, and seconds for more precise file naming with time suffix.

* More delay for 936321ce-5236-426a-9a20-e0e3c5dc536f; support one more potential solutions.

* Enhance SetupController with configurable retry limit and improved error handling for file opening requests. Introduce new function to compare unique training records, and update logging for better debugging. Adjust JSON examples for evaluation to support multiple expected and result files.

* Clean debug code

* Enhance DesktopEnv to track environment usage for optimized snapshot management. Introduce is_environment_used flag to determine if a snapshot revert is necessary based on provider type. Update setup and step methods to mark environment usage appropriately. Add new execute_with_verification method in SetupController for command execution with result verification, improving reliability. Change AWS instance type to m5.large for better performance and update AMI ID for compatibility. Update file opening logic in main.py to handle both file paths and application commands more effectively.

---------

Co-authored-by: yuanmengqi <yuanmengqi@mail.ustc.edu.cn>
2025-06-28 00:45:53 +08:00
Timothyxxx
fb7bafb885 feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without 2025-06-05 18:46:53 +08:00
Timothyxxx
34748567a5 feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation
- Add detailed README for file cache repository
- Implement migration script with retry logic and browser simulation
- Support automatic file type detection and deduplication
- Ensure reliable hosting for OSWorld evaluation files
2025-05-28 04:29:37 +08:00
Timothyxxx
2f0f3f31aa Fix Duplicate ids; Remove unused JSON files across multiple applications 2025-02-10 15:49:54 +08:00
MillanK
983283a86a patch: minor bug fixes for evaluator and task configurations, documentation update (#121)
* fix: /cursor_position api return format fix

* chore: update README.md to remove deprecated command

* fix: add base score for evaluators and minor bug fixes

* fix: add base score for setup configurations

---------

Co-authored-by: Jiaqi Deng <jiaqideng@Jiaqis-MacBook-Pro.local>
2025-01-18 22:25:18 +08:00
Timothyxxx
25e808cc91 Fix known errors found from feedback (DBUS problems, pulseaudio start, one vlc example with error. typos) 2024-05-18 04:49:29 +08:00
Timothyxxx
792d8844c7 Fix examples, clean files, clean README 2024-02-25 00:39:38 +08:00
Timothyxxx
91bc795de1 Examine and load new batch of OS examples from NL2Bash 2024-02-22 00:04:02 +08:00
Timothyxxx
e1cf8da4e0 Fix the infeasible examples support 2024-02-21 21:22:12 +08:00
Jason Lee
e31e1dacde Merge branch 'xiaochuanli/addChromeExtensions' of github.com:xlang-ai/DesktopEnv into xiaochuanli/addChromeExtensions 2024-02-18 22:16:48 +08:00
Jason Lee
17cd897780 add new examples for chrome 2024-02-18 22:11:16 +08:00
Timothyxxx
543fa840f8 Update OS examples 2024-02-16 15:05:34 +08:00
Timothyxxx
1596770410 Add new os examples 2024-02-15 19:24:22 +08:00
Timothyxxx
3f59ff46dc Add infeasible support 2024-02-14 11:59:50 +08:00
Timothyxxx
66304b3bab Fix OS example 2024-02-07 00:20:44 +08:00
Timothyxxx
f162acbbe3 Fix 2 OS examples 2024-02-06 23:56:53 +08:00
BlankCheng
fe21c47533 Fix samples of Impress and OS 2024-02-01 16:27:24 +08:00
tsuky_chen
e9651b0a53 Update 7688b85f-87a4-4e4a-b2f8-f3d6c3f29b82.json 2024-01-31 19:34:20 +08:00
Tianbao Xie
0d135ced29 Merge pull request #3 from xlang-ai/os
Fix some examples in OS
2024-01-30 15:08:00 +08:00
Liu Yitao
8edd3b9aad Fix examples 2024-01-29 18:22:13 -05:00
Timothyxxx
909aa868f3 Improve on agent codes; add auto-running experiments code; Fix some examples 2024-01-27 19:47:47 +08:00
Timothyxxx
6952b45de4 Improve on agent and tasks configs 2024-01-26 23:30:04 +08:00
Timothyxxx
c346b4379d Make up one OS example 2024-01-26 20:33:24 +08:00
Liu Yitao
0fcdbf63d1 Update tasks 2024-01-26 04:53:41 -05:00
Timothyxxx
64594701ae Load OS Ubuntu examples batch 2 2024-01-26 00:15:04 +08:00
Timothyxxx
b9ae4174b1 Fix OS examples annotated by Yitao 2024-01-25 19:57:32 +08:00
Timothyxxx
0c34fccc15 Initialize Ubuntu OS examples 2024-01-25 16:18:10 +08:00
Liu Yitao
344e7db55c Update OS evals 2024-01-25 10:55:49 +08:00
Liu Yitao
93b4ff7d95 Update OS evals 2024-01-25 10:45:51 +08:00