Adapt for Windows os; Refine README

This commit is contained in:
Timothyxxx
2023-11-27 00:29:09 +08:00
parent 6dee58252e
commit 8c0525c20e
6 changed files with 66 additions and 84 deletions

View File

@@ -13,7 +13,7 @@
3. Set up bridge for connecting to VM
1. Option 1: Install [xdotool](https://github.com/jordansissel/xdotool) on VM
2. Option 2: Install [mouse](https://github.com/boppreh/mouse/)
4. Set up SSH server on VM | [Guide](./SSH_SERVER_SETUP.md)
4. Set up SSH server on VM | [Guide](./SERVER_SETUP.md)
5. Install screenshot tool (in vm)
1. `sudo apt install imagemagick-6.q16hdri`
2. `DISPLAY=:0 import -window root screenshot.png`
@@ -22,12 +22,7 @@
2. `rm -rf ~/screenshot.png`
7. Set up python and install [mouse](https://github.com/boppreh/mouse/) and [keyboard](https://github.com/jordansissel/xdotool)
## Windows setup guide
1. Copy and paste the file `windows_server/main.py` to the windows vm
2. Make sure `mouse` and `keyboard` are installed
3. Run the file `pythonw main.py`
4. `ipconfig /all` and find the ip address
## Road map (Proposed)
@@ -36,6 +31,11 @@
- MacOS is closed source and cannot be legally installed
- Windows is available legally and can be installed
- [x] Build gym-like python interface for controlling the VM
- [] Make configuration much easier from code perspective
- [ ] README
- [ ] Make it easier to install the dependencies
- [ ] Make it easier to install the VM
- [ ] Make it easier to set up the VM
- [ ] Recording of actions (mouse movement, click, keyboard) for human to annotate, and we can replay it
- [ ] This part may be conflict with work from [Aran Komatsuzaki](https://twitter.com/arankomatsuzaki) team, a.k.a. [Duck AI](https://duckai.org/)
- [ ] Build a simple task, e.g. open a browser, open a website, click on a button, and close the browser