yuanmengqi
523d553e88
feat: add client password argument to multiple agents and scripts
...
- Introduced `--client_password` argument in `run_multienv_aguvis.py`, `run_multienv_claude.py`, and `run_multienv_gta1.py` for enhanced security and flexibility.
- Updated agent classes (`PromptAgent`, `AguvisAgent`, `GTA1Agent`) to accept and utilize `client_password` for improved configuration.
- Modified evaluation guidelines to reflect the new client password requirement.
- Ensured existing logic remains intact while enhancing functionality for better user experience.
2025-07-27 16:11:23 +00:00
yuanmengqi
73de48af75
Update Public Evaluation Guidelines and README to require Python 3.10 and enhance installation instructions. Added troubleshooting tips for environment issues and clarified access key creation process in AWS for better security practices.
2025-07-22 19:57:55 +00:00
yuanmengqi
921321c5df
Update Public Evaluation Guidelines to clarify proxy settings. Added information on automatic proxy wrapping for proxy-sensitive tasks and retained the recommendation for users to disable the proxy if not needed. Ensured existing content structure remains intact.
2025-07-22 05:59:57 +00:00
yuanmengqi
2727696835
Enhance Public Evaluation Guidelines by adding new images for AWS setup and monitoring instructions. Included additional contact information for leaderboard updates and error reporting. Ensured clarity and usability for users while preserving existing content structure.
2025-07-22 05:53:33 +00:00
yuanmengqi
05e25ba1b7
Enhance Public Evaluation Guidelines with detailed AWS setup instructions and security configurations. Added new sections for host and client machine setup, including recommended instance types, storage considerations, and security group rules. Updated existing content for clarity and added a new image for Google Drive authentication. Ensure all changes maintain original logic while improving usability for users with varying AWS experience.
2025-07-22 05:35:58 +00:00
Yuan Mengqi
b2fb8b4222
fix chrome tasks ( #230 )
...
* fix chrome
* fix: fix proxy setup
* feat&fix: add proxy support in setup and remove hardcoded proxy from example
* fix tasks
* fix chrome finished
* fix
* clean chrome_fix code
* clean chrome_fix code
---------
Co-authored-by: adlsdztony <zzl0712@connect.hku.hk >
2025-07-03 21:32:41 +08:00
Yuan Mengqi
40354322e8
fix pub eval readme typo ( #214 )
...
* update clean code
* fix pub eval readme typo
2025-06-10 22:57:16 +08:00
yuanmengqi
8a1fc5c385
edit pub eval readme
2025-06-10 13:37:26 +00:00
yuanmengqi
b8d229cdb3
edit pub eval readme
2025-06-10 13:36:48 +00:00
yuanmengqi
fbe88799cf
edit pub eval readme
2025-06-10 13:36:03 +00:00
yuanmengqi
3b5e4f3b15
edit pub eval readme
2025-06-10 13:34:42 +00:00
yuanmengqi
2d5439d062
edit pub eval readme
2025-06-10 13:32:24 +00:00
yuanmengqi
2d3347ca3e
edit pub eval readme
2025-06-10 13:28:54 +00:00
yuanmengqi
1b09d63cb2
edit pub eval readme
2025-06-10 13:27:53 +00:00
yuanmengqi
2bae228803
merge upstream
2025-06-10 13:23:03 +00:00