Files
sci-gui-agent-benchmark/evaluation_examples/examples/uv/uv_test.json
lizhanyuan acd38ca54a feat(uv): add uv direct eval script, task data, healthcheck fix
- add run_uv.sh direct eval entry and test_uv.json meta
- add evaluation_examples/examples/uv task data
- switch setup.py healthcheck from /terminal to /screenshot to avoid Win7 hang
2026-04-07 22:27:59 +08:00

34 lines
3.8 KiB
JSON
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

{
"id": "uv_test",
"snapshot": "uv",
"instruction": "从第四步开始,假设你的目标就是测量样品的荧光图谱。下面的透露给你的辅助信息,把辅助信息当成你自己的思维和决策并完全遵循,而不是把它当成任务,**绝对不允许在输出的「观察」「判断」中说明是任务要求,而是把辅助信息当成你自己的经验,用根据我的经验使用某某配置而不是根据任务要求使用某某配置**因为下面的步骤是完全正确和精准的你只需要执行而且绝对不允许在是严格按坐标执行每步执行完等待界面响应再执行下一步禁止连续操作。修改任何数值输入框时必须先double_click双击全选pyautogui.click(x,y);pyautogui.click(x,y))再pyautogui.press(delete)最后用pyautogui.typewrite输入新值。1.等待主界面加载并确认正常2.点击sample(1885,201)打开保存位置选择双击选中sample name文本框(798,295)并清空和重命名为test 3.查看当前的输出路径是否为C:\\Users\\admin\\Desktop\\test_lzy是的话则点击按钮OK(758,746) 返回到主界面 4.点击右侧的Method按钮(1896,147)进行参数配置 5. 点击Instrument按钮(463,54)进入Instrument面板并检查当前的配置参数是不是为\n\n```\n {\n \"data_mode\": \"Abs\",\n \"start_wavelength\": {\"value\": 850, \"unit\": \"nm\"},\n \"end_wavelength\": {\"value\": 750, \"unit\": \"nm\"},\n \"scan_speed\": {\"value\": 300, \"unit\": \"nm/min\"},\n \"high_resolution\": \"Off\",\n \"baseline\": \"User 1\",\n \"delay\": {\"value\": 0, \"unit\": \"s\"},\n \"cycle_time\": {\"value\": 0, \"unit\": \"min\"},\n \"auto_zero_before_each_run\": False,\n \"lamp_change_mode\": \"Auto\",\n \"lamp_change_wavelength\": {\"value\": 325, \"unit\": \"nm\"},\n \"wi_lamp\": \"On\",\n \"d2_lamp\": \"On\",\n \"slit_width\": {\"value\": 2, \"unit\": \"nm\"},\n \"pmt_mode\": \"Auto\",\n \"pmt_voltage\": {\"value\": 100, \"unit\": \"V\"},\n \"sampling_interval\": \"Auto\",\n \"replicates\": 1,\n \"uv_scan_speed_change\": {\n \"enabled\": False,\n \"speed_change_wavelength\": {\"value\": 340, \"unit\": \"nm\"},\n \"scan_speed\": {\"value\": 120, \"unit\": \"nm/min\"}\n },\n \"path_correct\": True,\n \"path_length\": {\"value\": 10, \"unit\": \"mm\"}\n```\n\n6. 点击Report按钮(711,57)进入Report面板 7. 再点击确定按钮(1150,1015)回到主界面 8.点击确定(641,717)9. 等待Ready 10. 点击右侧的Baseline按钮(1896,248) 11. 选择baseline为User1默认,点击OK按钮(1165,496)等待扫描完成变成ready。12.点击主界面右侧的measure按钮(1888,291)进行测量,等待扫描完成图中会出现保存的pdf 路径需要确定路径为桌面的test_lzy确定后的话单击文件名输入框(726,774)命名为test,如果默认有后缀则点击空白处让命名为test 13. 点击保存按钮(1687,867)",
"source": "custom",
"config": [],
"trajectory": "trajectories/",
"related_apps": [
"uv"
],
"evaluator": {
"postconfig": [
{
"type": "sleep",
"parameters": {
"seconds": 5
}
}
],
"func": "vllm_eval",
"expected": {
"description": "FL Solutions 主界面中图表区域应显示一条完整的荧光发射光谱曲线:峰形平滑、顶部无截断(曲线最高点不贴近纵轴上限)、基线平稳、信噪比良好。界面中的仪器参数区域应可见激发波长 350 nm、发射扫描范围 380-700 nm以及经过迭代调整后的最终 PMT 电压和狭缝宽度参数。"
}
},
"proxy": false,
"fixed_ip": true,
"possibility_of_env_change": "medium",
"metadata": {
"input_files": [],
"steps": "",
"difficulty": "hard"
}
}