Commit Graph

66 Commits

Author SHA1 Message Date
yuanmengqi
144a87fd9a Merge remote-tracking branch 'upstream/fix/aws-proxy' 2025-07-01 16:07:41 +00:00
adlsdztony
64f47d1a32 fix: fix proxy setup 2025-07-01 13:20:26 +00:00
Tianbao Xie
30138c5db1 VLC fix (#224)
* Enhance SetupController with improved logging and error handling during setup and file upload processes. Update instance type to t3.xlarge and AMI ID for AWS configuration. Add download progress logging and exception handling for better debugging.

* Enhance VLC status evaluation by adding multiple paths for file and URL information extraction, improving robustness against varying VLC XML structures. Implement detailed logging for better debugging and error handling in case of mismatches or missing data. Update example JSON for VLC evaluation to use a valid HLS stream URL.

* Improve audio comparison robustness in VLC evaluator by adding error handling for audio file loading and extraction. Implement detailed logging for empty or corrupt files, and normalize DTW distance calculation for more accurate similarity scoring. Remove deprecated audio fingerprint comparison function.

---------

Co-authored-by: yuanmengqi <yuanmengqi@mail.ustc.edu.cn>
2025-06-29 20:18:44 +08:00
Tianbao Xie
0cc93543a8 Environment is_used flag; OS domain fix (#219)
* Refactor evaluator structure in LibreOffice Writer example JSON to support multiple expected and result files, enhancing evaluation flexibility.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update time format in get_vm_file function to include hours, minutes, and seconds for more precise file naming with time suffix.

* More delay for 936321ce-5236-426a-9a20-e0e3c5dc536f; support one more potential solutions.

* Enhance SetupController with configurable retry limit and improved error handling for file opening requests. Introduce new function to compare unique training records, and update logging for better debugging. Adjust JSON examples for evaluation to support multiple expected and result files.

* Clean debug code

* Enhance DesktopEnv to track environment usage for optimized snapshot management. Introduce is_environment_used flag to determine if a snapshot revert is necessary based on provider type. Update setup and step methods to mark environment usage appropriately. Add new execute_with_verification method in SetupController for command execution with result verification, improving reliability. Change AWS instance type to m5.large for better performance and update AMI ID for compatibility. Update file opening logic in main.py to handle both file paths and application commands more effectively.

---------

Co-authored-by: yuanmengqi <yuanmengqi@mail.ustc.edu.cn>
2025-06-28 00:45:53 +08:00
Zilong Zhou
634e1c3d6f Reduce the startup time of the software on AWS from one minute to five seconds. (#221)
* feat: use SSD with high throughput

* fix&refactor: update AMI ID and change EBS volume type to gp3 with adjusted IOPS and throughput
2025-06-24 15:35:38 +08:00
Zilong Zhou
3d8f1779a2 feat: use SSD with high throughput (#218) 2025-06-17 18:39:42 +08:00
Tianbao Xie
4e11eafd1d Robust Evaluation, Blocking File Open, Grader Sensitivity, and LibreOffice Writer Fixes (#217)
* Refactor evaluator structure in LibreOffice Writer example JSON to support multiple expected and result files, enhancing evaluation flexibility.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update instance type to t3.large and add VNC access URL logging for allocated VMs, enhancing remote access capabilities.

* Update time format in get_vm_file function to include hours, minutes, and seconds for more precise file naming with time suffix.

* More delay for 936321ce-5236-426a-9a20-e0e3c5dc536f; support one more potential solutions.

* Enhance SetupController with configurable retry limit and improved error handling for file opening requests. Introduce new function to compare unique training records, and update logging for better debugging. Adjust JSON examples for evaluation to support multiple expected and result files.

* Clean debug code

---------

Co-authored-by: yuanmengqi <yuanmengqi@mail.ustc.edu.cn>
2025-06-16 21:37:19 +08:00
Kaixin Li
347238e17e Get VM IP again when getting screenshot fails (#215)
In rare cases, the IP of the VM changes after it launches. We can get the IP every time we retry to ensure the correct connection.
2025-06-16 02:40:40 +08:00
yuanmengqi
7315aec6e6 clean code 2025-06-10 04:06:54 +00:00
yuanmengqi
caaa4e5baa fix: update AMI ID for us-east-1 region in AWS manager 2025-06-10 02:32:24 +00:00
yuanmengqi
ca65022137 fix: update AMI ID for us-east-1 region in AWS manager configuration 2025-06-07 21:16:26 +00:00
yuanmengqi
fc3ef6b2be fix: update AMI ID for us-east-1 region in AWS manager configuration 2025-06-07 11:40:09 +00:00
adlsdztony
71e9a1ead8 fix&refactor: improve error handling in download process and enhance start_emulator method signature 2025-06-06 09:08:14 +00:00
Timothyxxx
8373f7cff2 refactor: remove AWSVMManagerWithProxy and integrate proxy support directly into AWSVMManager for streamlined VM allocation;
minor fix on openai_cua_agent
2025-06-06 02:55:50 +08:00
Timothyxxx
8b7727d955 refactor: update proxy configuration script for AWSProviderWithProxy to enhance clarity and support multiple Firefox paths 2025-06-06 02:39:16 +08:00
Timothyxxx
bfd0a7ad0d feat: implement proxy management for AWS VM provider and enhance task configuration handling 2025-06-06 00:36:21 +08:00
adlsdztony
10153ffff6 feat&fix: add signal handling for VM allocation and improve cleanup on termination 2025-06-04 03:15:30 +00:00
adlsdztony
8d54d4302f feat&fix: enhance error handling during environment initialization and VM allocation 2025-06-03 13:38:47 +00:00
Zilong Zhou
1dcb3e069b Merge pull request #204 from yuanmengqi/main
edit operator
2025-06-02 20:25:00 +08:00
yuanmengqi
98a810d31e edit operator 2025-06-02 12:11:25 +00:00
adlsdztony
9c0cbebf9a refactor: simplify AWS VM management by removing unused methods and improving logging 2025-06-01 08:31:47 +00:00
adlsdztony
8b4600cb63 feat&refactor: update AWS configuration guidelines and improve environment variable handling 2025-05-28 13:28:29 +08:00
adlsdztony
d8ae209162 fix&refactor: improve connection retry logic and remove unnecessary wait time for AWS instance readiness 2025-05-28 13:05:32 +08:00
Zilong Zhou
c9fbea988c Update desktop_env/providers/aws/provider.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-05-27 16:57:33 +08:00
Zilong Zhou
e0e2a33718 Merge branch 'feat/aws-provider-support' into main 2025-05-27 16:36:16 +08:00
yuanmengqi
b7e83a62ee aws_communication_success 2025-05-27 05:14:33 +00:00
adlsdztony
874878e882 feat&fix: update AWS VM management methods and add AWS provider configuration 2025-05-26 18:07:35 +08:00
MillanK
c179d0de12 Merge pull request #140 from xlang-ai/aws-maintain
chore: update expired ami ids
2025-02-26 18:01:02 +08:00
MillanK
98f437613d chore: update amazon ami id (#101) 2024-11-12 16:46:46 +08:00
Tianbao Xie
20442244fa [Feature] Initialize and Implement Aguvis Evaluation on OSWorld (#98)
* Initialize Aguvis eval on OSWorld

* Debug

* Debug

* v1, internal version

* Add experiments script

* Fix minor bugs

* Update new endpoint

* Update ip

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Fix model name

* Fix docker close issues; update prompting

* Fix missed

* Fix the default port to avoid crashing on examples like '_update_browse_history_setup'

* Fix server and chromium ports in setup

* Revert and add missed dependency

* Add VLC port for docker

* Update

* Clean

---------

Co-authored-by: Tianbao Xie <tianbaoxie@U-492FC39R-0217.local>
Co-authored-by: FredWuCZ <fredwucz@outlook.com>
2024-11-11 12:36:16 +08:00
Timothyxxx
5bc48e57d5 Clean on multi_env feat 2024-11-03 10:33:04 +08:00
Dunjie Lu
8be2a40967 Docker (#92)
* multi_env

* multi_env

---------

Co-authored-by: Timothyxxx <384084775@qq.com>
2024-11-02 22:28:23 +08:00
Timothyxxx
3933e0d303 fix(docker): add file lock for port allocation to prevent race conditions 2024-11-02 14:12:57 +08:00
HappySix
900b511422 Add os_type param to VBox manager (#85) 2024-10-25 14:46:09 +08:00
FredWuCZ
05b317f151 Fix minor error on docs 2024-10-23 09:02:12 +08:00
FredWuCZ
954a78be36 Update Docker guidelines 2024-10-22 22:37:46 +08:00
FredWuCZ
278fe6b7c9 Merge Docker guidelines into Readme 2024-10-22 22:34:40 +08:00
FredWuCZ
6635e8f3fd Minor update on docs 2024-10-22 20:47:39 +08:00
FredWuCZ
e9dbc3c374 Update docs 2024-10-22 20:42:27 +08:00
FredWuCZ
82878c885c Update Ubuntu qcow2 link 2024-10-18 20:17:49 +08:00
FredWuCZ
7eaa4189ae Fix unzip 2024-10-17 19:15:37 +08:00
FredWuCZ
ec3671ae01 Update Docker image link 2024-10-17 14:55:20 +08:00
FredWuCZ
6e75e37eb0 Enable Windows VM in Docker 2024-10-17 13:05:29 +08:00
FredWuCZ
3cba868ff3 Update 2024-10-08 17:59:06 +08:00
FredWuCZ
b9339217ef Update 2024-10-03 16:09:12 +08:00
FredWuCZ
fd65cf47f6 Update Windows URL 2024-10-02 12:19:01 +08:00
FredWuCZ
0a21e24a81 Update provider 2024-09-28 16:57:58 +08:00
FredWuCZ
2b4b6d8fdf Update 2024-09-28 15:07:56 +08:00
FredWuCZ
d0b37f0831 Update 2024-09-28 12:49:29 +08:00
FredWuCZ
5552ac9dcf Update docker manager & provider 2024-09-28 10:40:27 +08:00