Finish Aguvis eval on OSWorld (#107)

* Initialize Aguvis eval on OSWorld * Debug * Debug * v1, internal version * Add experiments script * Fix minor bugs * Update new endpoint * Update ip * Update * Update * Update * Update * Update * Update * Update * Update * Fix model name * Fix docker close issues; update prompting * Fix missed * Fix the default port to avoid crashing on examples like '_update_browse_history_setup' * Fix server and chromium ports in setup * Revert and add missed dependency * Add VLC port for docker * Update * Aguvis Grounding * Add Aguvis as planner * fix parse bug * fix pause * fix planner prompt * Aguvis Grounding * fix * fix * fix * add logger for each example * Modify Aguvis Planner Prompts * fix logger setup * fix absolute coordinates * Finish Aguvis Evaluation on OSWorld * Merge origin/main into junli/aguvis * Remove screenshot --------- Co-authored-by: Tianbao Xie <tianbaoxie@U-492FC39R-0217.local> Co-authored-by: Timothyxxx <384084775@qq.com> Co-authored-by: FredWuCZ <fredwucz@outlook.com>
2024-11-24 16:43:25 +08:00
parent 7d84a21962
commit 1503eb3994
6 changed files with 407 additions and 247 deletions
--- a/desktop_env/desktop_env.py
+++ b/desktop_env/desktop_env.py
@@ -223,7 +223,7 @@ class DesktopEnv(gym.Env):
                or (len(self.metric) == len(self.result_getter) == len(self.expected_getter) == len(
                    self.metric_options)))

-    def step(self, action, pause=0.5):
+    def step(self, action, pause=2):
        self._step_no += 1
        self.action_history.append(action)

@@ -252,6 +252,7 @@ class DesktopEnv(gym.Env):
                # the set of all possible python commands insides `pyautogui`
                self.controller.execute_python_command(action)

+        time.sleep(pause)
        observation = self._get_obs()

        return observation, reward, done, info