Commit Graph

11 Commits

Author SHA1 Message Date
David Chang
9df0854469 ver Feb1stv3
rerun SoM experiment on thunderbird
2024-02-01 22:56:09 +08:00
David Chang
be5d55a3f8 ver Feb1stv2
failed to start up experiments of multi_apps
2024-02-01 14:22:34 +08:00
David Chang
5d436a6b66 ver Feb1st
human evaluation and SoM experiments on Thunderbird
2024-02-01 11:38:46 +08:00
David Chang
3dce3ffe63 ver Jan31stv5
parts of calc experiments
2024-01-31 16:46:26 +08:00
David Chang
8a62d96fd3 ver Jan31stv4
evaluating som on calc
2024-01-31 16:22:26 +08:00
David Chang
29f2f3eaf8 ver Jan31stv3
started to run SoM experiments on os tasks
2024-01-31 11:11:23 +08:00
David Chang
9e91b8a5a8 ver Jan29thv2
check som implementation
2024-01-30 00:25:00 +08:00
David Chang
d8a497a417 ver Jan29th
updated the position of SoM marks
2024-01-29 21:49:53 +08:00
Timothyxxx
c875cad3e5 Fix some errors found in thunderbird examples 2024-01-28 15:32:14 +08:00
Timothyxxx
909aa868f3 Improve on agent codes; add auto-running experiments code; Fix some examples 2024-01-27 19:47:47 +08:00
Timothyxxx
6f27c5bf50 Wrap up SeeAct implementation 2024-01-20 19:19:37 +08:00