Commit Graph

12 Commits

Author SHA1 Message Date
Timothyxxx
fb7bafb885 feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without 2025-06-05 18:46:53 +08:00
Timothyxxx
34748567a5 feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation
- Add detailed README for file cache repository
- Implement migration script with retry logic and browser simulation
- Support automatic file type detection and deduplication
- Ensure reliable hosting for OSWorld evaluation files
2025-05-28 04:29:37 +08:00
David Chang
4897211a46 ver Jan31stv6
finished calc human evaluation
updated calc configs with an extra sleep to guarantee the integrity of
downloaded xlsx file
2024-01-31 22:55:47 +08:00
Timothyxxx
353ab6607d Fix some errors found in thunderbird examples 2024-01-28 16:51:38 +08:00
David Chang
8025bf19f0 ver Jan27th
corrected usage of pyautogui in calc postconfig
2024-01-27 19:46:06 +08:00
David Chang
7a85c76369 ver Jan22nd
updated all the existing calc configs
2024-01-22 12:42:50 +08:00
David Chang
ffc4c32bac ver Jan17th
updated the existing task configs
2024-01-17 17:27:08 +08:00
David Chang
5e2a03720d ver Jan10thv4
updated /home/david to /home/user
2024-01-10 22:33:33 +08:00
David Chang
e4fac09945 ver Dec29th
metric compare_with_formats
2023-12-29 21:19:52 +08:00
David Chang
5a14cf40db Merge branch 'main' into zdy 2023-12-28 21:20:57 +08:00
David Chang
fa6cccc26a Merge branch 'zdy' 2023-12-26 16:56:37 +08:00
David Chang
a6b6022ecb ver Dec26th
evaluation metric checking result file according to rules
2023-12-26 16:46:50 +08:00