Commit Graph

78 Commits

Author SHA1 Message Date
Timothyxxx
d1e2b12b41 Fix GIMP bug; Speedup the environment, when there is not a11y tree needed, we can do no controller.get 2024-03-20 22:22:59 +08:00
Timothyxxx
ace5842505 Fix typo 2024-03-19 18:57:47 +08:00
David Chang
4df088e2ad ver Mar19thv2
supplemented at info back for som setting
2024-03-19 18:41:55 +08:00
David Chang
05336a8ecf Merge branch 'main' into zdy 2024-03-19 17:47:23 +08:00
David Chang
b5d58b8ecd ver Mar19th
a tiny fix
2024-03-19 17:43:34 +08:00
Fangyu Lei
41db4b44e7 Update agent.py mixtral 2024-03-19 12:06:33 +08:00
David Chang
3db0591868 ver Mar18th
checked Claude agent
2024-03-18 17:42:13 +08:00
Timothyxxx
d74ab1a44e Merge remote-tracking branch 'origin/main' 2024-03-18 14:56:37 +08:00
Timothyxxx
204a2b949f Update claude endpoint 2024-03-18 14:56:23 +08:00
lfy79001
b067d5a840 add cogagent server 2024-03-18 00:22:57 +08:00
Jason Lee
716cf7b9ff add wandb settings 2024-03-17 22:31:43 +08:00
Jason Lee
48aedb09a7 add wandb settings, remember to set WANDB_KEY 2024-03-17 22:30:29 +08:00
lfy79001
acc2d41bdb add mixtral cogagent 2024-03-17 22:27:59 +08:00
Timothyxxx
e156a20e3d Update new func 2024-03-17 22:25:13 +08:00
lfy79001
505e772463 claude3_agent_code 2024-03-16 11:57:49 +08:00
lfy79001
684b4a1b7b claude3_agnet_code 2024-03-16 11:27:09 +08:00
lfy79001
3b13046745 add claude3 agent code 2024-03-16 01:40:41 +08:00
lfy79001
017dde8966 add claude3 agent code 2024-03-16 01:37:42 +08:00
David Chang
57f2257254 ver Mar15th
fixed bugs about infeasible task evaluation
2024-03-15 22:49:35 +08:00
David Chang
6face585f3 Merge branch 'zdy' 2024-03-15 22:48:40 +08:00
David Chang
e166106b6a ver Mar15th
added an option to keep buttons without text information but with an
image for SoM setting
2024-03-15 22:46:14 +08:00
Timothyxxx
1ad4527e8b Change SoM input and output 2024-03-15 22:10:35 +08:00
Timothyxxx
4db207fc27 Merge remote-tracking branch 'origin/main'
# Conflicts:
#	mm_agents/agent.py
#	run.py
2024-03-15 21:10:32 +08:00
Timothyxxx
5cbf1b28ca Fix bugs 2024-03-15 21:06:50 +08:00
Jason Lee
815c7ab67c filter unfinished examples and add timer to ensure upper limit of each example 2024-03-15 16:52:17 +08:00
David Chang
f6b96165e2 Merge branch 'zdy' 2024-03-14 22:40:27 +08:00
Timothyxxx
44ff027801 Refactor experiments and agent implementation 2024-03-14 22:32:49 +08:00
Timothyxxx
71ca8fbe1c refactor on exp code 2024-03-14 19:25:25 +08:00
Timothyxxx
26d52a7231 Code clean 2024-03-14 11:52:38 +08:00
Timothyxxx
741e26c3f8 Update 2024-03-13 23:35:04 +08:00
Timothyxxx
c2aa009ed8 Update server script, baseline and running script 2024-03-13 15:04:19 +08:00
David Chang
0c9c2f214a ver Mar11thv2
minor adjustment
2024-03-11 22:45:16 +08:00
David Chang
e95e8e55ea ver Mar11th
updated filter_nodes
2024-03-11 12:33:47 +08:00
David Chang
f08fa4912c ver Mar10th
changed AT element filtering
2024-03-10 18:03:02 +08:00
Timothyxxx
030574e316 Improve on mmagents prompts; initialize online tasks from Mind2Web 2024-02-22 22:01:22 +08:00
Timothyxxx
068c6f5769 122324154 2024-02-02 14:36:53 +08:00
Timothyxxx
32bcdd0937 Modify the logic of SoM agent 2024-02-01 18:58:22 +08:00
Timothyxxx
c31c9f4e7d Merge remote-tracking branch 'origin/main'
# Conflicts:
#	mm_agents/gpt_4v_agent.py
2024-02-01 16:57:01 +08:00
Timothyxxx
59e2417a08 Add Mistral, Qwen, Gemini support; Fix minor bugs 2024-02-01 16:55:38 +08:00
David Chang
5d436a6b66 ver Feb1st
human evaluation and SoM experiments on Thunderbird
2024-02-01 11:38:46 +08:00
Timothyxxx
606fab4cfa Fix minor bug when get a11y tree and linearize for agent 2024-01-31 00:29:51 +08:00
David Chang
da306376da ver Jan30th
updated function to get AT on Windows
2024-01-30 20:06:58 +08:00
David Chang
9e91b8a5a8 ver Jan29thv2
check som implementation
2024-01-30 00:25:00 +08:00
David Chang
d8a497a417 ver Jan29th
updated the position of SoM marks
2024-01-29 21:49:53 +08:00
Timothyxxx
cc21c3a6b1 Fix some errors found in calc examples 2024-01-28 21:19:18 +08:00
David Chang
8525825fb2 Merge branch 'zdy' 2024-01-27 23:18:33 +08:00
David Chang
5a486b6b37 ver Jan27th
debugged at+screenshot implementation, no issues found
fixed a little bugs
2024-01-27 23:10:48 +08:00
Timothyxxx
909aa868f3 Improve on agent codes; add auto-running experiments code; Fix some examples 2024-01-27 19:47:47 +08:00
David Chang
eef5158663 ver Jan26thv3
fixed bug caused by an empty node.text
remove nodes whose name and text are all empty
2024-01-26 23:49:15 +08:00
David Chang
73de0e387a Merge branch 'zdy' 2024-01-26 23:31:41 +08:00