Files
sci-gui-agent-benchmark/evaluation_examples/examples
Tianbao Xie bba367b8bc fix: fix multiapps tasks (#231)
* Update JSON example for multi_apps: change snapshot name and specify presenter in instructions for clarity.

* Enhance PDF image comparison in chrome.py by adding existence checks for input files and improving image extraction logic. Introduce image hashing for similarity scoring with a configurable threshold. Update docs.py to support fuzzy matching in DOCX file comparisons, allowing for similarity scoring based on text content. Modify example JSON to enable fuzzy matching option.

---------

Co-authored-by: yuanmengqi <yuanmengqi@mail.ustc.edu.cn>
2025-07-03 16:58:43 +08:00
..
2025-06-07 05:21:04 +00:00
2025-06-30 18:23:09 +08:00
2025-06-30 18:23:09 +08:00
2025-06-29 20:18:44 +08:00
2025-06-24 17:08:09 +08:00