MillanK
48ac57697a
VSCode fix ( #222 )
2025-06-24 17:08:09 +08:00
Tianbao Xie
4e11eafd1d
Robust Evaluation, Blocking File Open, Grader Sensitivity, and LibreOffice Writer Fixes ( #217 )
...
* Refactor evaluator structure in LibreOffice Writer example JSON to support multiple expected and result files, enhancing evaluation flexibility.
* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.
* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.
* Update time format in get_vm_file function to include hours, minutes, and seconds for more precise file naming with time suffix.
* More delay for 936321ce-5236-426a-9a20-e0e3c5dc536f; support one more potential solutions.
* Enhance SetupController with configurable retry limit and improved error handling for file opening requests. Introduce new function to compare unique training records, and update logging for better debugging. Adjust JSON examples for evaluation to support multiple expected and result files.
* Clean debug code
---------
Co-authored-by: yuanmengqi <yuanmengqi@mail.ustc.edu.cn >
2025-06-16 21:37:19 +08:00
yuanmengqi
7315aec6e6
clean code
2025-06-10 04:06:54 +00:00
chenjix
5959c0846e
Fix libreoffice impress evaluation
2025-06-07 00:13:38 +08:00
Xubin Ren
1d10514125
Fix Search Engine Detection Discrepancy in Chrome Evaluation ( #172 )
...
* Update bb5e4c0d-f964-439c-97b6-bdb9747de3f4.json
* Update __init__.py
* Update general.py
2025-04-10 17:24:50 +08:00
Timothyxxx
d373817edb
Modify VLC launch command and fullscreen detection
...
- Add VLC_VERBOSE=-1 to suppress verbose logging in VLC launch commands across multiple example files
- Update is_vlc_fullscreen function to handle cases where screen size or window size is None
- Improve robustness of VLC-related metrics and example configurations
2025-03-06 22:11:42 +08:00
Tianbao Xie
f4750701d4
Address https://github.com/xlang-ai/OSWorld/issues/130
2025-02-10 12:55:44 +08:00
Eric Patey
bf3f054564
Fix crash caused by referencing an unbound local variable. ( #128 )
...
Co-authored-by: Eric Patey <>
2025-02-07 23:31:53 +08:00
Eric Patey
3ee6c34a36
Fix referenced before assignment regression introduced with #121 . ( #125 )
...
Co-authored-by: Eric Patey <>
2025-02-05 10:51:59 +08:00
MillanK
983283a86a
patch: minor bug fixes for evaluator and task configurations, documentation update ( #121 )
...
* fix: /cursor_position api return format fix
* chore: update README.md to remove deprecated command
* fix: add base score for evaluators and minor bug fixes
* fix: add base score for setup configurations
---------
Co-authored-by: Jiaqi Deng <jiaqideng@Jiaqis-MacBook-Pro.local >
2025-01-18 22:25:18 +08:00
Tianbao Xie
7d84a21962
Fix minor problems when aggragating the results ( #106 )
2024-11-22 17:37:34 +08:00
Tianbao Xie
20442244fa
[Feature] Initialize and Implement Aguvis Evaluation on OSWorld ( #98 )
...
* Initialize Aguvis eval on OSWorld
* Debug
* Debug
* v1, internal version
* Add experiments script
* Fix minor bugs
* Update new endpoint
* Update ip
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Fix model name
* Fix docker close issues; update prompting
* Fix missed
* Fix the default port to avoid crashing on examples like '_update_browse_history_setup'
* Fix server and chromium ports in setup
* Revert and add missed dependency
* Add VLC port for docker
* Update
* Clean
---------
Co-authored-by: Tianbao Xie <tianbaoxie@U-492FC39R-0217.local >
Co-authored-by: FredWuCZ <fredwucz@outlook.com >
2024-11-11 12:36:16 +08:00
Pierre Carrier
924e0fcd17
metrics: fix time regex ( #81 )
2024-10-24 22:45:42 +08:00
HappySix
6419d707bc
Support Docker VM manager and provider ( #75 )
...
* Add docker provider framework
* Update VM download link
* Add stop container
* Update docker manager & provider
* Update
* Update
* Update provider
2024-09-28 21:10:40 +08:00
Jason Lee
0c5fbb8be4
fix local state file's location on macos M-chip computer
2024-06-23 08:43:53 -05:00
Jason Lee
7ab4ae360e
fix bugs of functions in getters (macos M-chip version)
2024-06-23 08:30:34 -05:00
Jason Lee
0058add84d
fix function: get_cookie_data (macos M-chip version)
2024-06-23 08:24:51 -05:00
Jason Lee
1ec95f7d61
fix function: "get_bookmarks"
2024-06-22 04:08:13 -05:00
Jason Lee
1c50770817
fix chrome evaluation bugs for macbook ( #43 )
2024-06-11 12:15:27 +08:00
Timothyxxx
25e808cc91
Fix known errors found from feedback (DBUS problems, pulseaudio start, one vlc example with error. typos)
2024-05-18 04:49:29 +08:00
Timothyxxx
9c75df5dce
Clean code; Refactor environment to pass screenshot content instead of path
2024-04-13 23:34:01 +08:00
Timothyxxx
07d9c08bd5
Clean code; Add todos in desktop_env README
2024-04-02 22:34:29 +08:00
Timothyxxx
2d8eeaad58
Fix one bug in Chrome getter; fix one erro for corner case in doc
2024-04-02 14:50:29 +08:00
Timothyxxx
fad621093f
Fix one bug in Chrome getter
2024-04-01 15:05:48 +08:00
tsuky_chen
ca03baacf5
fix conflict
2024-03-21 16:01:31 +08:00
tsuky_chen
169a0a15ad
add libreoffice examples for windows
2024-03-21 15:49:54 +08:00
Timothyxxx
d1e2b12b41
Fix GIMP bug; Speedup the environment, when there is not a11y tree needed, we can do no controller.get
2024-03-20 22:22:59 +08:00
BlankCheng
f5da5e940b
Merge main
2024-03-18 22:21:01 +08:00
BlankCheng
4671455b56
Fix eval func
2024-03-18 22:16:04 +08:00
Timothyxxx
eeae1442cd
Add execute timeout to server; Fix error examples
2024-03-18 20:42:57 +08:00
Timothyxxx
0aae756538
Code clean
2024-03-14 12:54:10 +08:00
BlankCheng
4b15595146
Update fix
2024-03-12 00:17:46 +08:00
Timothyxxx
b4cb64d861
Fix bugs in multiple examples
2024-03-11 00:26:59 +08:00
Timothyxxx
b3d27f6387
Fix bugs in multiple examples
2024-03-10 23:52:29 +08:00
Timothyxxx
e51d0e8cc9
Fix bugs in multiple apps example 0e53
2024-03-10 15:18:14 +08:00
Jason Lee
812be97a41
Merge branch 'main' of github.com:xlang-ai/DesktopEnv
2024-03-10 14:50:17 +08:00
Jason Lee
775cef744f
xiaochuan correct his bugs in multiapp examples, you can try it again now
2024-03-10 14:48:56 +08:00
Timothyxxx
e481afcf5c
Fix multiple examples
2024-03-09 23:01:22 +08:00
Timothyxxx
447c886b0a
Fix multiple apps 5990457f-2adb-467b-a4af-5c857c92d762
2024-03-09 20:54:52 +08:00
Timothyxxx
b0607c4f79
Fix bugs imported by Xiaochuan xs
2024-03-09 19:32:05 +08:00
tsuky_chen
aae848196b
merge
2024-03-09 18:53:27 +08:00
tsuky_chen
5b07ec17bf
fix multi apps
2024-03-09 18:50:16 +08:00
tsuky_chen
f4ec36bdfb
fix multi apps
2024-03-09 18:48:17 +08:00
Jason Lee
2291af394f
update google drive file link in json
2024-03-09 18:06:48 +08:00
Jason Lee
d68e7369b7
Merge branch 'main' of github.com:xlang-ai/DesktopEnv
2024-03-09 14:03:36 +08:00
Jason Lee
6ea3dd856f
fix multiapps bug : "26660ad1-6ebb-4f59-8cba-a8432dfe8d38"
2024-03-09 14:03:26 +08:00
Timothyxxx
1e0a78a453
Add none file handling for general
2024-03-09 00:30:28 +08:00
Timothyxxx
4de0eff703
Add none file handling for doc
2024-03-09 00:16:50 +08:00
Timothyxxx
62b3b2390d
Fix bugs from merging
2024-03-08 23:09:11 +08:00
Tianbao Xie
f01153cadd
Merge branch 'main' into xiaochuanli/addChromeExtensions
2024-03-08 20:45:49 +08:00