Commit Graph

950 Commits

Author SHA1 Message Date
Timothyxxx
2f0f3f31aa Fix Duplicate ids; Remove unused JSON files across multiple applications 2025-02-10 15:49:54 +08:00
Tianbao Xie
f4750701d4 Address https://github.com/xlang-ai/OSWorld/issues/130 2025-02-10 12:55:44 +08:00
Shihao Liang
0bc1e08440 Dev/uitars (#129)
* init uitars

* change agent class name
2025-02-08 12:49:40 +08:00
Eric Patey
bf3f054564 Fix crash caused by referencing an unbound local variable. (#128)
Co-authored-by: Eric Patey <>
2025-02-07 23:31:53 +08:00
Eric Patey
3ee6c34a36 Fix referenced before assignment regression introduced with #121. (#125)
Co-authored-by: Eric Patey <>
2025-02-05 10:51:59 +08:00
MillanK
983283a86a patch: minor bug fixes for evaluator and task configurations, documentation update (#121)
* fix: /cursor_position api return format fix

* chore: update README.md to remove deprecated command

* fix: add base score for evaluators and minor bug fixes

* fix: add base score for setup configurations

---------

Co-authored-by: Jiaqi Deng <jiaqideng@Jiaqis-MacBook-Pro.local>
2025-01-18 22:25:18 +08:00
Tianbao Xie
89426951c9 Fix contrib.rocks 2025-01-14 23:21:11 +08:00
YangJL2003
3148973ce9 Update c1fa57f3-c3db-4596-8f09-020701085416.json 2025-01-14 22:56:32 +08:00
Tianbao Xie
0a4d4ddd63 Update README.md 2024-12-19 20:37:46 +08:00
Timothyxxx
63e69cab08 Fix one instruction error in chrome 6766f2b8-8a72-417f-a9e5-56fcaa735837 2024-12-09 12:35:02 +08:00
Timothyxxx
2c8e8a58f6 Fix minor bug caused by new logging feat in aguvis agent traj 2024-12-05 15:45:09 +08:00
Tianbao Xie
9d6879d334 Fix chromium command for M-chip MacBook device 2024-11-29 20:00:01 +08:00
Tianbao Xie
afba17b510 Server setup readme revision (#108)
* Initialize

* add note for resolution

* Organize

* draft version and todos

* ver Nov24th

supplemented socat installation and switching off automatic suspend and
  screen-off

* Finish Tianbao todos

* Finish Tianbao todos

* Fix typos

* update font install

* Finish Xiaochuan's Part

* Finish Xiaochuan's Part update

* Update README.md

* Fix format

---------

Co-authored-by: zdy023 <zdy004007@126.com>
Co-authored-by: tsuky_chen <3107760494@qq.com>
Co-authored-by: Jason Lee <lixiaochuan20@gmail.com>
Co-authored-by: Siheng Zhao <77528902+sihengz02@users.noreply.github.com>
2024-11-25 16:30:59 +08:00
Junli Wang
1503eb3994 Finish Aguvis eval on OSWorld (#107)
* Initialize Aguvis eval on OSWorld

* Debug

* Debug

* v1, internal version

* Add experiments script

* Fix minor bugs

* Update new endpoint

* Update ip

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Fix model name

* Fix docker close issues; update prompting

* Fix missed

* Fix the default port to avoid crashing on examples like '_update_browse_history_setup'

* Fix server and chromium ports in setup

* Revert and add missed dependency

* Add VLC port for docker

* Update

* Aguvis Grounding

* Add Aguvis as planner

* fix parse bug

* fix pause

* fix planner prompt

* Aguvis Grounding

* fix

* fix

* fix

* add logger for each example

* Modify Aguvis Planner Prompts

* fix logger setup

* fix absolute coordinates

* Finish Aguvis Evaluation on OSWorld

* Merge origin/main into junli/aguvis

* Remove screenshot

---------

Co-authored-by: Tianbao Xie <tianbaoxie@U-492FC39R-0217.local>
Co-authored-by: Timothyxxx <384084775@qq.com>
Co-authored-by: FredWuCZ <fredwucz@outlook.com>
2024-11-24 16:43:25 +08:00
Tianbao Xie
7d84a21962 Fix minor problems when aggragating the results (#106) 2024-11-22 17:37:34 +08:00
MillanK
98f437613d chore: update amazon ami id (#101) 2024-11-12 16:46:46 +08:00
Tianbao Xie
5802065208 docs: add cleanup instruction for residual docker containers
docs: add note about cleaning up residual docker containers

Add note in README about cleaning up residual docker containers after abnormal experiment interruption to prevent performance issues
2024-11-11 12:41:25 +08:00
Tianbao Xie
20442244fa [Feature] Initialize and Implement Aguvis Evaluation on OSWorld (#98)
* Initialize Aguvis eval on OSWorld

* Debug

* Debug

* v1, internal version

* Add experiments script

* Fix minor bugs

* Update new endpoint

* Update ip

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Fix model name

* Fix docker close issues; update prompting

* Fix missed

* Fix the default port to avoid crashing on examples like '_update_browse_history_setup'

* Fix server and chromium ports in setup

* Revert and add missed dependency

* Add VLC port for docker

* Update

* Clean

---------

Co-authored-by: Tianbao Xie <tianbaoxie@U-492FC39R-0217.local>
Co-authored-by: FredWuCZ <fredwucz@outlook.com>
2024-11-11 12:36:16 +08:00
Pierre Carrier
b35dc40ff4 SetupController: no server_port for chrome (#96) 2024-11-07 00:33:03 +08:00
Pierre Carrier
1754f195b0 fix(server): run on non-Windows python (#94) 2024-11-06 15:18:13 +08:00
Tianbao Xie
3c458c63de Update README.md 2024-11-03 21:30:57 +08:00
Timothyxxx
5bc48e57d5 Clean on multi_env feat 2024-11-03 10:33:04 +08:00
Dunjie Lu
8be2a40967 Docker (#92)
* multi_env

* multi_env

---------

Co-authored-by: Timothyxxx <384084775@qq.com>
2024-11-02 22:28:23 +08:00
Timothyxxx
3933e0d303 fix(docker): add file lock for port allocation to prevent race conditions 2024-11-02 14:12:57 +08:00
Pierre Carrier
324371e78b requirements.txt: faster install on latest macOS (#86)
Prebuilt binaries are only available on latest macOS with an upgraded pandas.
2024-10-30 09:43:21 +08:00
Pierre Carrier
2b22d49c22 [completely optional] direnv+mise autosetup (#87)
Makes life a lot easier in my experience.
2024-10-30 09:43:10 +08:00
HappySix
900b511422 Add os_type param to VBox manager (#85) 2024-10-25 14:46:09 +08:00
Pierre Carrier
9229c44393 requirements.txt: Python 3.12 compatibility (#82) 2024-10-24 22:46:04 +08:00
Pierre Carrier
924e0fcd17 metrics: fix time regex (#81) 2024-10-24 22:45:42 +08:00
FredWuCZ
05b317f151 Fix minor error on docs 2024-10-23 09:02:12 +08:00
HappySix
f0ae387b39 Merge pull request #78 from xlang-ai/docker
Support Windows VM in Docker
2024-10-22 22:48:32 +08:00
FredWuCZ
954a78be36 Update Docker guidelines 2024-10-22 22:37:46 +08:00
FredWuCZ
278fe6b7c9 Merge Docker guidelines into Readme 2024-10-22 22:34:40 +08:00
Tianbao Xie
275de550b4 Set the default setting back to vmware and Ubuntu, since people may would try from desktop first 2024-10-22 22:31:42 +08:00
Tianbao Xie
a895757450 Update README.md 2024-10-22 22:12:50 +08:00
FredWuCZ
6635e8f3fd Minor update on docs 2024-10-22 20:47:39 +08:00
FredWuCZ
e9dbc3c374 Update docs 2024-10-22 20:42:27 +08:00
FredWuCZ
82878c885c Update Ubuntu qcow2 link 2024-10-18 20:17:49 +08:00
FredWuCZ
b46b6f0649 Clean up 2024-10-18 18:47:10 +08:00
FredWuCZ
9e86f160e7 Capture cursor on Windows 2024-10-18 18:44:53 +08:00
FredWuCZ
7eaa4189ae Fix unzip 2024-10-17 19:15:37 +08:00
FredWuCZ
ec3671ae01 Update Docker image link 2024-10-17 14:55:20 +08:00
FredWuCZ
6e75e37eb0 Enable Windows VM in Docker 2024-10-17 13:05:29 +08:00
Tianbao Xie
5dc919cc14 Revert default provider to VMware as Docker is not fully ready 2024-10-15 14:12:39 +08:00
FredWuCZ
3cba868ff3 Update 2024-10-08 17:59:06 +08:00
FredWuCZ
b9339217ef Update 2024-10-03 16:09:12 +08:00
FredWuCZ
fd65cf47f6 Update Windows URL 2024-10-02 12:19:01 +08:00
FredWuCZ
6bb27d3ddd Merge branch 'main' into docker 2024-10-02 12:18:44 +08:00
FredWuCZ
24bad80b53 Add requirements for docker 2024-09-28 22:01:06 +08:00
HappySix
6419d707bc Support Docker VM manager and provider (#75)
* Add docker provider framework

* Update VM download link

* Add stop container

* Update docker manager & provider

* Update

* Update

* Update provider
2024-09-28 21:10:40 +08:00