Commit Graph

16 Commits

Author SHA1 Message Date
Timothyxxx
71ca8fbe1c refactor on exp code 2024-03-14 19:25:25 +08:00
Timothyxxx
313521ac52 Add test meta data 2024-03-14 13:16:49 +08:00
Timothyxxx
a7ad0c70fa Update todos and fixmes 2024-03-13 23:42:31 +08:00
Timothyxxx
9fd63081ea Update todos and fixmes 2024-03-13 23:40:51 +08:00
Timothyxxx
5624bdf144 Merge remote-tracking branch 'origin/main' 2024-03-13 23:35:19 +08:00
Timothyxxx
741e26c3f8 Update 2024-03-13 23:35:04 +08:00
Jason Lee
8812cc9930 update some ids of "failed" examples 2024-03-13 21:45:50 +08:00
Jason Lee
cee3b93009 update all ids in experiment_screenshot.py 2024-03-13 21:06:55 +08:00
Timothyxxx
c2aa009ed8 Update server script, baseline and running script 2024-03-13 15:04:19 +08:00
Timothyxxx
068c6f5769 122324154 2024-02-02 14:36:53 +08:00
Timothyxxx
2292053698 Update some config 2024-01-31 23:50:45 +08:00
Timothyxxx
cc21c3a6b1 Fix some errors found in calc examples 2024-01-28 21:19:18 +08:00
Timothyxxx
c875cad3e5 Fix some errors found in thunderbird examples 2024-01-28 15:32:14 +08:00
Timothyxxx
909aa868f3 Improve on agent codes; add auto-running experiments code; Fix some examples 2024-01-27 19:47:47 +08:00
Timothyxxx
f88331416c Refactor baselines code implementations 2024-01-20 18:55:21 +08:00
Timothyxxx
09f3e776ae Initialize all baselines: screenshot, a11y tree, both, SoM, SeeAct 2024-01-20 00:13:46 +08:00