Commit Graph

28 Commits

Author SHA1 Message Date
Timothyxxx
fb7bafb885 feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without 2025-06-05 18:46:53 +08:00
Timothyxxx
34748567a5 feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation
- Add detailed README for file cache repository
- Implement migration script with retry logic and browser simulation
- Support automatic file type detection and deduplication
- Ensure reliable hosting for OSWorld evaluation files
2025-05-28 04:29:37 +08:00
Timothyxxx
2f0f3f31aa Fix Duplicate ids; Remove unused JSON files across multiple applications 2025-02-10 15:49:54 +08:00
MillanK
983283a86a patch: minor bug fixes for evaluator and task configurations, documentation update (#121)
* fix: /cursor_position api return format fix

* chore: update README.md to remove deprecated command

* fix: add base score for evaluators and minor bug fixes

* fix: add base score for setup configurations

---------

Co-authored-by: Jiaqi Deng <jiaqideng@Jiaqis-MacBook-Pro.local>
2025-01-18 22:25:18 +08:00
Timothyxxx
25e808cc91 Fix known errors found from feedback (DBUS problems, pulseaudio start, one vlc example with error. typos) 2024-05-18 04:49:29 +08:00
Timothyxxx
792d8844c7 Fix examples, clean files, clean README 2024-02-25 00:39:38 +08:00
Timothyxxx
91bc795de1 Examine and load new batch of OS examples from NL2Bash 2024-02-22 00:04:02 +08:00
Timothyxxx
e1cf8da4e0 Fix the infeasible examples support 2024-02-21 21:22:12 +08:00
Jason Lee
e31e1dacde Merge branch 'xiaochuanli/addChromeExtensions' of github.com:xlang-ai/DesktopEnv into xiaochuanli/addChromeExtensions 2024-02-18 22:16:48 +08:00
Jason Lee
17cd897780 add new examples for chrome 2024-02-18 22:11:16 +08:00
Timothyxxx
543fa840f8 Update OS examples 2024-02-16 15:05:34 +08:00
Timothyxxx
1596770410 Add new os examples 2024-02-15 19:24:22 +08:00
Timothyxxx
3f59ff46dc Add infeasible support 2024-02-14 11:59:50 +08:00
Timothyxxx
66304b3bab Fix OS example 2024-02-07 00:20:44 +08:00
Timothyxxx
f162acbbe3 Fix 2 OS examples 2024-02-06 23:56:53 +08:00
BlankCheng
fe21c47533 Fix samples of Impress and OS 2024-02-01 16:27:24 +08:00
tsuky_chen
e9651b0a53 Update 7688b85f-87a4-4e4a-b2f8-f3d6c3f29b82.json 2024-01-31 19:34:20 +08:00
Tianbao Xie
0d135ced29 Merge pull request #3 from xlang-ai/os
Fix some examples in OS
2024-01-30 15:08:00 +08:00
Liu Yitao
8edd3b9aad Fix examples 2024-01-29 18:22:13 -05:00
Timothyxxx
909aa868f3 Improve on agent codes; add auto-running experiments code; Fix some examples 2024-01-27 19:47:47 +08:00
Timothyxxx
6952b45de4 Improve on agent and tasks configs 2024-01-26 23:30:04 +08:00
Timothyxxx
c346b4379d Make up one OS example 2024-01-26 20:33:24 +08:00
Liu Yitao
0fcdbf63d1 Update tasks 2024-01-26 04:53:41 -05:00
Timothyxxx
64594701ae Load OS Ubuntu examples batch 2 2024-01-26 00:15:04 +08:00
Timothyxxx
b9ae4174b1 Fix OS examples annotated by Yitao 2024-01-25 19:57:32 +08:00
Timothyxxx
0c34fccc15 Initialize Ubuntu OS examples 2024-01-25 16:18:10 +08:00
Liu Yitao
344e7db55c Update OS evals 2024-01-25 10:55:49 +08:00
Liu Yitao
93b4ff7d95 Update OS evals 2024-01-25 10:45:51 +08:00