Commit Graph

12 Commits

Author SHA1 Message Date
Timothyxxx
fb7bafb885 feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without 2025-06-05 18:46:53 +08:00
Timothyxxx
34748567a5 feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation
- Add detailed README for file cache repository
- Implement migration script with retry logic and browser simulation
- Support automatic file type detection and deduplication
- Ensure reliable hosting for OSWorld evaluation files
2025-05-28 04:29:37 +08:00
David Chang
4897211a46 ver Jan31stv6
finished calc human evaluation
updated calc configs with an extra sleep to guarantee the integrity of
downloaded xlsx file
2024-01-31 22:55:47 +08:00
Timothyxxx
cc21c3a6b1 Fix some errors found in calc examples 2024-01-28 21:19:18 +08:00
Timothyxxx
be17bd3307 Fix some errors found in thunderbird examples 2024-01-28 15:35:31 +08:00
Timothyxxx
c875cad3e5 Fix some errors found in thunderbird examples 2024-01-28 15:32:14 +08:00
David Chang
8025bf19f0 ver Jan27th
corrected usage of pyautogui in calc postconfig
2024-01-27 19:46:06 +08:00
David Chang
7a85c76369 ver Jan22nd
updated all the existing calc configs
2024-01-22 12:42:50 +08:00
David Chang
19214f2107 ver Jan17thv2
updated compare_table with compare the shown value through exported csv
2024-01-17 22:43:26 +08:00
David Chang
ffc4c32bac ver Jan17th
updated the existing task configs
2024-01-17 17:27:08 +08:00
David Chang
5e2a03720d ver Jan10thv4
updated /home/david to /home/user
2024-01-10 22:33:33 +08:00
David Chang
6e6ef03bc9 ver Jan2nd
calc metrics are prapared by and large
2024-01-02 21:03:57 +08:00