Feat/claude cua support (#253)

* feat: add claude support

* feat: add script for end-to-end evaluation with logging and task distribution

* feat&fix: add tool result handling and update model default in evaluation script

* chore: remove run_test_env.py script

* feat&fix: implement action parsing for tool calls and update default action space

* fix: update text formatting in action parsing and replace logger import

* feat&fix: implement action parsing for tool calls and add screen size handling

* feat: add setup instructions for Anthropic API integration

* feat: add notice about image size limitations for Anthropic API

* Delete test_env/logger.py

* Delete test_env/utils.py
This commit is contained in:
Zilong Zhou
2025-07-13 21:10:49 +08:00
committed by GitHub
parent 38a30734a6
commit 349f2fd9fe
13 changed files with 1975 additions and 4 deletions

View File

@@ -0,0 +1,18 @@
# Anthropic Agent Integration
> Notice: As Anthropic API only supports images long edge is less than 1568 pixels and image is less than ~1,600 tokens, we resize the screenshot to 1280x720.
## Setup
To run with the Anthropic API, you need to set up your environment with the necessary API keys and configurations. Follow these steps:
1. **Install Dependencies**: Ensure you have the required Python packages installed. You can do this by running:
```bash
pip install anthropic
```
2. **Set Environment Variables**: You need to set the environment variable with your API key. You can do this in .env:
For aws bedrock:
```.env
AWS_ACCESS_KEY_ID=your_access_key_id
AWS_SECRET_ACCESS_KEY=your_secret_access_key
```
For anthropic, you need set APIProvider to `anthropic` and set the API key:
```.env
ANTHROPIC_API_KEY=your_anthropic_api_key
```