Commit Graph

1013 Commits

Author SHA1 Message Date
adlsdztony
e363da2fd7 docs: update README with important execution note & fix: fix auto-refresh logic 2025-06-02 21:11:38 +08:00
Zilong Zhou
1dcb3e069b Merge pull request #204 from yuanmengqi/main
edit operator
2025-06-02 20:25:00 +08:00
yuanmengqi
98a810d31e edit operator 2025-06-02 12:11:25 +00:00
adlsdztony
2b36860a03 refactor&fix: remove unused no-transition styles and simplify refresh logic 2025-06-01 09:36:46 +00:00
adlsdztony
37505f4c3b feat&fix: implement auto-refresh functionality and disable animation when refresh 2025-06-01 08:45:58 +00:00
adlsdztony
9c0cbebf9a refactor: simplify AWS VM management by removing unused methods and improving logging 2025-06-01 08:31:47 +00:00
adlsdztony
e48bd6b059 feat: add .env configuration file and update README with configuration details 2025-06-01 07:07:47 +00:00
adlsdztony
41e9e86379 fix: update task rendering to correctly display error count 2025-06-01 06:58:05 +00:00
adlsdztony
b5efb82172 feat&fix: add task recording endpoint, enhance video player support, and improve mobile responsiveness 2025-06-01 06:50:02 +00:00
adlsdztony
cb62b3c877 feat&fix: update paths in configuration, enhance error handling, and improve UI elements 2025-06-01 04:48:50 +00:00
adlsdztony
d1a001b2b7 fix&refactor: correct port mapping in docker-compose and set fixed port in main.py 2025-06-01 10:57:14 +08:00
adlsdztony
60a2b495b9 feat: add README for OSWorld Monitor with configuration and usage instructions 2025-06-01 10:48:14 +08:00
adlsdztony
9476b5f765 Merge branch 'feat/aws-provider-support' of https://github.com/xlang-ai/OSWorld into feat/aws-provider-support 2025-06-01 10:33:43 +08:00
adlsdztony
53c4106c5b feat: Implement task monitoring web application 2025-06-01 10:31:27 +08:00
Zilong Zhou
157f2b0ca2 Merge pull request #203 from yuanmengqi/main
add openai cua agent code
2025-05-31 20:48:01 +08:00
yuanmengqi
228849ab03 add openai cua agent 2025-05-31 11:22:38 +00:00
yuanmengqi
64ebbceedb fixr: update step method signature and return value in DesktopEnv class 2025-05-31 08:43:53 +00:00
adlsdztony
7faa9554d7 fixr: update step method signature and return value in DesktopEnv class 2025-05-29 17:27:33 +08:00
adlsdztony
13573f6caf feat: add fake env 2025-05-29 17:23:17 +08:00
adlsdztony
8a0ea52a31 Merge branch 'main' into feat/aws-provider-support 2025-05-29 14:05:12 +08:00
adlsdztony
8b4600cb63 feat&refactor: update AWS configuration guidelines and improve environment variable handling 2025-05-28 13:28:29 +08:00
adlsdztony
d8ae209162 fix&refactor: improve connection retry logic and remove unnecessary wait time for AWS instance readiness 2025-05-28 13:05:32 +08:00
Timothyxxx
34748567a5 feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation
- Add detailed README for file cache repository
- Implement migration script with retry logic and browser simulation
- Support automatic file type detection and deduplication
- Ensure reliable hosting for OSWorld evaluation files
2025-05-28 04:29:37 +08:00
Zilong Zhou
f3d3cd9ed0 Merge pull request #198 from yuanmengqi/main
aws_communication_success_log
2025-05-27 16:58:31 +08:00
Zilong Zhou
c9fbea988c Update desktop_env/providers/aws/provider.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-05-27 16:57:33 +08:00
adlsdztony
d073d8775d fix: remove unused region parameter from DesktopEnv initialization 2025-05-27 16:51:12 +08:00
Zilong Zhou
e0e2a33718 Merge branch 'feat/aws-provider-support' into main 2025-05-27 16:36:16 +08:00
adlsdztony
fd3ab09be8 chore&delete: remove awscliv2.zip file 2025-05-27 13:55:13 +08:00
adlsdztony
8837a0130a chore&delete: remove AWS lock and VM files, and delete launch configuration, add them into gitignore 2025-05-27 13:50:48 +08:00
yuanmengqi
b7e83a62ee aws_communication_success 2025-05-27 05:14:33 +00:00
adlsdztony
431a762421 feat&fix: add logging for setup function calls and include snapshot name in AWS provider configuration 2025-05-26 20:37:20 +08:00
adlsdztony
874878e882 feat&fix: update AWS VM management methods and add AWS provider configuration 2025-05-26 18:07:35 +08:00
uvheart
a845824f06 add azure_gpt_4o (#197) 2025-05-23 03:57:42 +08:00
Shihao Liang
119bef25e2 Dev/uitars 15 (#194)
* debug uitars1.0, add uitars1.5

* update pyautogui parser

* modify function name

* update parser

* update prompt

* FIX: bug in ui tars
2025-05-19 17:15:17 +08:00
MillanK
51f5ddea04 Add Jedi agent implementation to mm_agents (#192)
* feat: implement Jedi agent

* chore: code clean
2025-05-10 19:55:33 +08:00
Thomas Kuntz
5678b510d7 fix: Invalid escape sequence in prompts (#191)
Fixes the warning: SyntaxWarning: invalid escape sequence '\`'
2025-05-10 18:19:07 +08:00
Danyang Zhang
2d2e87d168 Merge pull request #187 from thomas-kuntz/fix-thund-path
fix: Broken profile path in 3 Thunderbird tasks
2025-05-06 16:33:44 +08:00
Danyang Zhang
7bf99cb823 Update 15c3b339-88f7-4a86-ab16-e71c58dcb01e.json 2025-05-06 16:29:35 +08:00
Danyang Zhang
e4097783bb Update dfac9ee8-9bc4-4cdc-b465-4a4bfcd2f397.json 2025-05-06 16:28:52 +08:00
Thomas Kuntz
7d88283f8a feat: Support newer Gemini models (#188) 2025-05-06 16:04:30 +08:00
Thomas Kuntz
af993b3a3d fix: Broken profile path in 3 Thunderbird tasks 2025-05-04 14:03:06 +02:00
Tianbao Xie
408ee1ba7d Update README.md 2025-05-01 21:51:14 +08:00
Tianbao Xie
2615d57344 Add file cache 2025-05-01 21:51:02 +08:00
Shihao Liang
b92c716df7 Dev/uitars 15 (#181)
* debug uitars1.0, add uitars1.5

* update pyautogui parser

* modify function name

* update parser

* update prompt
2025-04-21 13:44:08 +08:00
Shihao Liang
bd2e980666 Dev/uitars 15 (#178)
* debug uitars1.0, add uitars1.5

* update pyautogui parser

* modify function name

* update parser
2025-04-17 18:49:21 +08:00
Xubin Ren
1d10514125 Fix Search Engine Detection Discrepancy in Chrome Evaluation (#172)
* Update bb5e4c0d-f964-439c-97b6-bdb9747de3f4.json

* Update __init__.py

* Update general.py
2025-04-10 17:24:50 +08:00
MillanK0817
eb24584098 patch: fix the bug when expected getter is none 2025-04-08 15:35:29 +08:00
Parth A. Patel
bbfeecb475 fix: af2d657a-e6b3-4c6a-9f67-9e3ed015974c task config has type (#169)
Type on "examine_alignment" option results in false negatives
2025-04-06 02:20:51 +08:00
Timothyxxx
ec583d6f0c Enhance metric evaluation in DesktopEnv
- Add assertions to ensure the number of metrics matches the number of result and expected getters.
- Refactor metric calculation logic to handle cases with and without expected values more clearly.
- Improve comments for better understanding of single and multiple metric evaluations.
2025-04-02 23:45:56 +08:00
Timothyxxx
d373817edb Modify VLC launch command and fullscreen detection
- Add VLC_VERBOSE=-1 to suppress verbose logging in VLC launch commands across multiple example files
- Update is_vlc_fullscreen function to handle cases where screen size or window size is None
- Improve robustness of VLC-related metrics and example configurations
2025-03-06 22:11:42 +08:00