Danyang Zhang
adc9ad88c2
Thunderbird eval fix ( #233 )
...
* ver Jul2nd
updated task requiring set up new email account
* ver Jul3rd
fixed several tasks
2025-07-03 21:55:55 +08:00
Timothyxxx
fb7bafb885
feat: Add proxy configuration to all 369 evaluation examples - 55 with proxy, 314 without
2025-06-05 18:46:53 +08:00
Timothyxxx
34748567a5
feat: Migrate OSWorld files to HuggingFace cache with comprehensive documentation
...
- Add detailed README for file cache repository
- Implement migration script with retry logic and browser simulation
- Support automatic file type detection and deduplication
- Ensure reliable hosting for OSWorld evaluation files
2025-05-28 04:29:37 +08:00
David Chang
f08fa4912c
ver Mar10th
...
changed AT element filtering
2024-03-10 18:03:02 +08:00
David Chang
d6cd0936b3
ver Mar7th
...
updated instructions and set-up configs
2024-03-07 16:54:06 +08:00
David Chang
5817403e2e
ver Mar6th
...
three new tasks
2024-03-06 15:06:08 +08:00