Files
sci-gui-agent-benchmark/evaluation_examples/examples
MillanK cbc3b590ff Task fix batch (#383)
* update 873cafdd-a581-47f6-8b33-b9696ddb7b05 task eval

* c1fa57f3-c3db-4596-8f09-020701085416 fix, add tolerance to url matching

* 8df7e444-8e06-4f93-8a1a-c5c974269d82 add more clear instruction to the filename for compress

* add address string normalization for 6f4073b8-d8ea-4ade-8a18-c5d1d5d5aa9a

---------

Co-authored-by: Jiaqi <dengjiaqi@moonshot.cn>
2025-11-19 17:24:25 +08:00
..
2025-11-19 17:24:25 +08:00
2025-10-04 00:13:29 +08:00
2025-11-19 17:24:25 +08:00