Commit Graph

107 Commits

Author SHA1 Message Date
Yuan Mengqi
b2fb8b4222 fix chrome tasks (#230)
* fix chrome

* fix: fix proxy setup

* feat&fix: add proxy support in setup and remove hardcoded proxy from example

* fix tasks

* fix chrome finished

* fix

* clean chrome_fix code

* clean chrome_fix code

---------

Co-authored-by: adlsdztony <zzl0712@connect.hku.hk>
2025-07-03 21:32:41 +08:00
Tianbao Xie
4e11eafd1d Robust Evaluation, Blocking File Open, Grader Sensitivity, and LibreOffice Writer Fixes (#217)
* Refactor evaluator structure in LibreOffice Writer example JSON to support multiple expected and result files, enhancing evaluation flexibility.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update time format in get_vm_file function to include hours, minutes, and seconds for more precise file naming with time suffix.

* More delay for 936321ce-5236-426a-9a20-e0e3c5dc536f; support one more potential solutions.

* Enhance SetupController with configurable retry limit and improved error handling for file opening requests. Introduce new function to compare unique training records, and update logging for better debugging. Adjust JSON examples for evaluation to support multiple expected and result files.

* Clean debug code

---------

Co-authored-by: yuanmengqi <yuanmengqi@mail.ustc.edu.cn>
2025-06-16 21:37:19 +08:00
Xubin Ren
1d10514125 Fix Search Engine Detection Discrepancy in Chrome Evaluation (#172)
* Update bb5e4c0d-f964-439c-97b6-bdb9747de3f4.json

* Update __init__.py

* Update general.py
2025-04-10 17:24:50 +08:00
Timothyxxx
9c75df5dce Clean code; Refactor environment to pass screenshot content instead of path 2024-04-13 23:34:01 +08:00
Timothyxxx
d1e2b12b41 Fix GIMP bug; Speedup the environment, when there is not a11y tree needed, we can do no controller.get 2024-03-20 22:22:59 +08:00
Timothyxxx
0aae756538 Code clean 2024-03-14 12:54:10 +08:00
Jason Lee
775cef744f xiaochuan correct his bugs in multiapp examples, you can try it again now 2024-03-10 14:48:56 +08:00
Timothyxxx
62b3b2390d Fix bugs from merging 2024-03-08 23:09:11 +08:00
Tianbao Xie
f01153cadd Merge branch 'main' into xiaochuanli/addChromeExtensions 2024-03-08 20:45:49 +08:00
Tianbao Xie
4b841c199a Merge pull request #12 from xlang-ai/zhoujun/multi-app
Update multi-app examples
2024-03-08 20:41:14 +08:00
Jason Lee
62fd8feebb xiaochuan's multiapp examples 2024-03-08 19:24:15 +08:00
Timothyxxx
1af9d8911d Update multi-apps examples 2024-03-07 22:15:23 +08:00
tsuky_chen
e295430bcf Merge branch 'main' of https://github.com/xlang-ai/DesktopEnv 2024-03-07 01:25:37 +08:00
tsuky_chen
5b5475094e update multi apps 2024-03-07 01:24:36 +08:00
David Chang
a08f842666 Merge branch 'main' of github.com:ztjhz/DesktopEnv 2024-03-06 23:30:37 +08:00
David Chang
054e016aff ver Mar6thv3
new multi_app tasks and metrics
2024-03-06 23:29:01 +08:00
BlankCheng
7e41955eb7 Merge conflicts 2024-03-06 21:34:58 +08:00
BlankCheng
5ebd080237 Update multi-app examples 2024-03-06 21:33:38 +08:00
rhythmcao
da0dafc32c add multi-apps 5 examples by ruisheng 2024-03-06 2024-03-06 21:20:26 +08:00
tsuky_chen
69ef653a7c update multi apps 2024-03-05 22:46:56 +08:00
Timothyxxx
f4869e17af Update GUI game debugging multiple-apps examples and eval 2024-03-03 15:02:53 +08:00
David Chang
33ace6937b ver Feb28th
a new multi app task --- init a web extension project with web tool
2024-02-28 22:35:04 +08:00
Tianbao Xie
0a6b5b3f57 Merge branch 'main' into xiaochuanli/addChromeExtensions 2024-02-25 00:45:17 +08:00
Jason Lee
3244098664 finish the rest part of chrome examples and verify them on mac arm64 2024-02-24 21:57:01 +08:00
Timothyxxx
f812436ad3 Update loaded Chrome examples 2024-02-23 14:15:16 +08:00
Timothyxxx
e1cf8da4e0 Fix the infeasible examples support 2024-02-21 21:22:12 +08:00
Jason Lee
e31e1dacde Merge branch 'xiaochuanli/addChromeExtensions' of github.com:xlang-ai/DesktopEnv into xiaochuanli/addChromeExtensions 2024-02-18 22:16:48 +08:00
Jason Lee
17cd897780 add new examples for chrome 2024-02-18 22:11:16 +08:00
Timothyxxx
8dd62178be Fix Impress examples 2024-02-07 21:52:21 +08:00
rhythmcao
538b9928fe fix some problems in libreoffice writer 2024-02-02 02:23:25 +08:00
rhythmcao
fc15a33b70 finish multi-app examples 2024-02-01 00:53:31 +08:00
rhythmcao
4726ea7edb update multi-app examples 2024-01-30 19:24:29 +08:00
Timothyxxx
cb7643713e Add impress examples, format the import 2024-01-30 03:25:27 +08:00
Timothyxxx
20f48759fd Fix path in GIMP examples 2024-01-30 02:14:52 +08:00
BlankCheng
7d2d8c855e Merge main 2024-01-29 21:51:26 +08:00
BlankCheng
af61d776c4 Update GIMP getters and metrics 2024-01-29 21:47:12 +08:00
Timothyxxx
343813a29b Add impress examples; remove the auto-saving pyautogui commands change to libreoffice pre-setting 2024-01-29 21:34:58 +08:00
Timothyxxx
f9d9895541 Fix some errors found in impress and thunderbird examples 2024-01-29 20:14:47 +08:00
Timothyxxx
461651e127 Merge remote-tracking branch 'origin/main' 2024-01-29 13:23:21 +08:00
Timothyxxx
37e09a994e Fix some errors found in impress and thunderbird examples 2024-01-29 13:23:06 +08:00
rhythmcao
532826835d chrome (google drive related) + X multi-app examples finished (leaving two emails and thunderbird-profile.tar.gz to be crafted) 2024-01-29 11:16:27 +08:00
Timothyxxx
cc21c3a6b1 Fix some errors found in calc examples 2024-01-28 21:19:18 +08:00
BlankCheng
460507b5c3 Merge main 2024-01-26 16:43:18 +08:00
BlankCheng
50f7669e59 Update gimp metrics and getters 2024-01-26 16:39:13 +08:00
tsuky_chen
f0b073989a feat:fix 2024-01-26 02:21:04 +08:00
tsuky_chen
932b73c67d load libreoffice writer eval -batch 2 2024-01-26 02:15:42 +08:00
tsuky_chen
3e7cfa8699 load libreoffice writer eval -batch 2 2024-01-26 02:07:26 +08:00
Timothyxxx
64594701ae Load OS Ubuntu examples batch 2 2024-01-26 00:15:04 +08:00
Timothyxxx
b9ae4174b1 Fix OS examples annotated by Yitao 2024-01-25 19:57:32 +08:00
rhythmcao
f194fb8d75 add multi_apps; update chrome utilities 2024-01-25 13:53:19 +08:00