Commit Graph

12 Commits

Author SHA1 Message Date
yuanmengqi
e433f35c1f feat: standardize configuration fields across all evaluation examples
- Add `fixed_ip` field to all 369 JSON files in examples directory
  - Set to `true` for 8 files listed in google_chrome.json multi_apps
  - Set to `false` for remaining 361 files
- Add `possibility_of_env_change` field to 363 JSON files missing this field
  - Set to "low" for newly added fields
  - Preserve existing values (4 medium, 2 high) for 6 files that already had this field

This ensures consistent configuration schema across all evaluation examples
while maintaining backward compatibility with existing settings.
2025-07-16 13:45:34 +00:00
Timothyxxx
fb7bafb885 feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without 2025-06-05 18:46:53 +08:00
Timothyxxx
34748567a5 feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation
- Add detailed README for file cache repository
- Implement migration script with retry logic and browser simulation
- Support automatic file type detection and deduplication
- Ensure reliable hosting for OSWorld evaluation files
2025-05-28 04:29:37 +08:00
MillanK
983283a86a patch: minor bug fixes for evaluator and task configurations, documentation update (#121)
* fix: /cursor_position api return format fix

* chore: update README.md to remove deprecated command

* fix: add base score for evaluators and minor bug fixes

* fix: add base score for setup configurations

---------

Co-authored-by: Jiaqi Deng <jiaqideng@Jiaqis-MacBook-Pro.local>
2025-01-18 22:25:18 +08:00
rhythmcao
8b42d699af fix Desktop path error, revise main.py and update google writer tutorial 2024-02-06 21:45:03 +08:00
Timothyxxx
e90bad645b Fix impress background color check func 2024-01-31 02:03:24 +08:00
Timothyxxx
f9d9895541 Fix some errors found in impress and thunderbird examples 2024-01-29 20:14:47 +08:00
tsuky_chen
394d6353fd update libreoffice impress eval examples 2024-01-28 12:18:25 +08:00
Timothyxxx
4eb27fb800 (makeup) Finish Impress v2 loading, some pre-process left 2024-01-24 11:34:38 +08:00
Timothyxxx
42b433dce8 Finish Impress v2 loading, some pre-process left 2024-01-23 15:29:23 +08:00
tsuky_chen
ba8ae104cf update impress eval examples 2024-01-17 18:00:20 +08:00
Timothyxxx
e891eedfde libreoffice impress and writer initialization 2023-12-25 01:40:39 +08:00