Files
sci-gui-agent-benchmark/evaluation_examples/examples
Danyang Zhang adc9ad88c2 Thunderbird eval fix (#233)
* ver Jul2nd

updated task requiring set up new email account

* ver Jul3rd

fixed several tasks
2025-07-03 21:55:55 +08:00
..
2025-07-03 21:32:41 +08:00
2025-07-03 16:59:05 +08:00
2025-06-30 18:23:09 +08:00
2025-06-30 18:23:09 +08:00
2025-07-03 21:55:55 +08:00
2025-07-03 16:59:05 +08:00
2025-07-03 21:55:55 +08:00
2025-06-29 20:18:44 +08:00
2025-06-24 17:08:09 +08:00