Feat/claude cua support (#253)

* feat: add claude support * feat: add script for end-to-end evaluation with logging and task distribution * feat&fix: add tool result handling and update model default in evaluation script * chore: remove run_test_env.py script * feat&fix: implement action parsing for tool calls and update default action space * fix: update text formatting in action parsing and replace logger import * feat&fix: implement action parsing for tool calls and add screen size handling * feat: add setup instructions for Anthropic API integration * feat: add notice about image size limitations for Anthropic API * Delete test_env/logger.py * Delete test_env/utils.py
2025-07-13 21:10:49 +08:00
parent 38a30734a6
commit 349f2fd9fe
13 changed files with 1975 additions and 4 deletions
--- a/mm_agents/anthropic/README.md
+++ b/mm_agents/anthropic/README.md
@@ -0,0 +1,18 @@
+# Anthropic Agent Integration
+> Notice: As Anthropic API only supports image’s long edge is less than 1568 pixels and image is less than ~1,600 tokens, we resize the screenshot to 1280x720.
+## Setup
+To run with the Anthropic API, you need to set up your environment with the necessary API keys and configurations. Follow these steps:
+1. **Install Dependencies**: Ensure you have the required Python packages installed. You can do this by running:
+```bash
+pip install anthropic
+```
+2. **Set Environment Variables**: You need to set the environment variable with your API key. You can do this in .env:
+For aws bedrock:
+```.env
+AWS_ACCESS_KEY_ID=your_access_key_id
+AWS_SECRET_ACCESS_KEY=your_secret_access_key
+```
+For anthropic, you need set APIProvider to `anthropic` and set the API key:
+```.env
+ANTHROPIC_API_KEY=your_anthropic_api_key
+```