Files
sci-gui-agent-benchmark/mm_agents/anthropic
Zilong Zhou 74b7c189af Feat/monitor (#254)
* feat: add claude support

* feat: add script for end-to-end evaluation with logging and task distribution

* feat&fix: add tool result handling and update model default in evaluation script

* chore: remove run_test_env.py script

* feat&fix: implement action parsing for tool calls and update default action space

* fix: update text formatting in action parsing and replace logger import

* feat&fix: implement action parsing for tool calls and add screen size handling

* feat: add setup instructions for Anthropic API integration

* feat: add notice about image size limitations for Anthropic API

* Delete test_env/logger.py

* Delete test_env/utils.py

* fix: update logger usage to use global logger and improve error handling

* feat&fix: add configuration management API endpoints and update UI for configuration selection

* feat&fix: update environment configuration, enhance task statistics, and improve UI responsiveness

* feat&fix: add configuration toggle button in UI and improve task loading performance

* feat&fix: add accuracy percentage display to score and style updates for UI
2025-07-14 13:43:41 +08:00
..
2025-07-13 21:10:49 +08:00
2025-07-13 21:10:49 +08:00
2025-07-14 13:43:41 +08:00
2025-07-13 21:10:49 +08:00
2025-07-13 21:10:49 +08:00

Anthropic Agent Integration

Notice: As Anthropic API only supports images long edge is less than 1568 pixels and image is less than ~1,600 tokens, we resize the screenshot to 1280x720.

Setup

To run with the Anthropic API, you need to set up your environment with the necessary API keys and configurations. Follow these steps:

  1. Install Dependencies: Ensure you have the required Python packages installed. You can do this by running:
pip install anthropic
  1. Set Environment Variables: You need to set the environment variable with your API key. You can do this in .env: For aws bedrock:
AWS_ACCESS_KEY_ID=your_access_key_id
AWS_SECRET_ACCESS_KEY=your_secret_access_key

For anthropic, you need set APIProvider to anthropic and set the API key:

ANTHROPIC_API_KEY=your_anthropic_api_key