Files
sci-gui-agent-benchmark/desktop_env/evaluators/metrics
lizhanyuan d71f1f976d feat: vllm_eval 关键帧采样 + Gemini OpenAI 代理支持
- vllm_eval.py: 新增 _sample_key_frames 关键帧采样函数
- vllm_eval.py: 当截图超过 max_eval_images 时均匀采样
- vllm_eval.py: Gemini 模型支持通过 OpenAI 兼容代理调用
- test_single.json: 更新测试任务配置
2026-03-04 16:39:24 +08:00
..
2024-03-14 12:54:10 +08:00
2024-03-14 12:54:10 +08:00
2024-03-14 12:54:10 +08:00
2025-11-19 17:24:25 +08:00
2024-03-14 12:54:10 +08:00
2025-07-19 17:15:40 +08:00
2025-07-08 16:25:00 +08:00