Update README and ROADMAP; Fix typos; optimize the code for llm calling in agent.py
This commit is contained in:
13
README.md
13
README.md
@@ -63,6 +63,7 @@ pip install desktop-env
|
||||
vmrun -T ws list
|
||||
```
|
||||
If the installation along with the environment variable set is successful, you will see the message showing the current running virtual machines.
|
||||
> **Note:** We will also support using [VirtualBox](https://www.virtualbox.org/) in the near future if you have issues with VMware Pro. However, features such as parallelism and macOS on Apple chips are not supported.
|
||||
|
||||
All set! Our setup script will automatically download the necessary virtual machines and configure the environment for you.
|
||||
|
||||
@@ -135,6 +136,14 @@ Correctly implement the agent interface and import your customized version in th
|
||||
Afterward, you can execute a command similar to the one in the previous section to run the benchmark on your agent.
|
||||
|
||||
## ❓ FAQ
|
||||
### What is the username and password for the virtual machines?
|
||||
The username and password for the virtual machines are as follows:
|
||||
- **Ubuntu:** `user` / `password`
|
||||
|
||||
### How can I configure a proxy for the VM if I'm behind a GFW?
|
||||
|
||||
See [Proxy Guideline](PROXY_GUIDELINE.md).
|
||||
|
||||
### What are the running times and costs under different settings?
|
||||
| Setting | Expected Time* | Budget Cost (Full Test Set/Small Test Set) |
|
||||
| ------------------------------ | -------------- | ------------------------------------------ |
|
||||
@@ -145,10 +154,6 @@ Afterward, you can execute a command similar to the one in the previous section
|
||||
|
||||
\*No environment parallelism. Calculated in April 2024.
|
||||
|
||||
### How can I configure a proxy for the VM if I'm behind a GFW?
|
||||
|
||||
See [Proxy Guideline](PROXY_GUIDELINE.md)
|
||||
|
||||
## 📄 Citation
|
||||
If you find this environment useful, please consider citing our work:
|
||||
```
|
||||
|
||||
Reference in New Issue
Block a user