Commit Graph

50 Commits

Author SHA1 Message Date
Simon Alibert
ab3cd3a7ba (WIP) Add gym-xarm 2024-04-05 15:35:20 +02:00
Cadene
1cdfbc8b52 WIP
WIP

WIP train.py works, loss going down

WIP eval.py

Fix

WIP (eval running, TODO: verify results reproduced)

Eval works! (testing reproducibility)

WIP

pretrained model pusht reproduces same results as torchrl

pretrained model pusht reproduces same results as torchrl

Remove AbstractPolicy, Move all queues in select_action

WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Simon Alibert
6bddcb647e Add test_aloha env test 2024-03-28 10:35:11 +01:00
Alexander Soare
1a1308d62f fix environment seeding
add fixes for reproducibility

only try to start env if it is closed

revision

fix normalization and data type

Improve README

Improve README

Tests are passing, Eval pretrained model works, Add gif

Update gif

Update gif

Update gif

Update gif

Update README

Update README

update minor

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Address suggestions

Update thumbnail + stats

Update thumbnail + stats

Update README.md

Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>

Add more comments

Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
c5635b7d94 Minor fixes for #47 2024-03-25 18:50:47 +01:00
Simon Alibert
7cdd6d2450 Renamed set_seed -> set_global_seed 2024-03-25 17:19:28 +01:00
Simon Alibert
058ac991eb Add simxarm back into tests 2024-03-25 16:35:46 +01:00
Simon Alibert
dc89166bee Upgrade gym to gymnasium 2024-03-25 15:12:21 +01:00
Simon Alibert
5ef813ff1e Remove deprecated code 2024-03-25 13:22:49 +01:00
Simon Alibert
127de1258d WIP 2024-03-25 12:28:07 +01:00
Cadene
b905111895 fix render issue 2024-03-25 12:28:07 +01:00
Simon Alibert
0c41675986 fix __init__ import Base 2024-03-25 12:28:07 +01:00
Simon Alibert
1c24bbda3f WIP Upgrading simxam from mujoco-py to mujoco python bindings 2024-03-25 12:28:07 +01:00
Alexander Soare
15ff3b3af8 add fixes for reproducibility 2024-03-22 15:06:57 +00:00
Alexander Soare
b9047fbdd2 fix environment seeding 2024-03-22 13:25:23 +00:00
Alexander Soare
72d3c3120b Merge remote-tracking branch 'upstream/main' into fix_pusht_diffusion 2024-03-21 10:20:52 +00:00
Alexander Soare
acf1174447 ready for review 2024-03-21 10:18:50 +00:00
Alexander Soare
d323993569 backup wip 2024-03-20 15:01:27 +00:00
Alexander Soare
4b7ec81dde remove abstracmethods, fix online training 2024-03-20 14:49:41 +00:00
Alexander Soare
b1ec3da035 remove internal rendering hooks 2024-03-20 09:23:23 +00:00
Alexander Soare
c5010fee9a fix seeding 2024-03-20 08:21:33 +00:00
Alexander Soare
18fa88475b Move reset_warning_issued flag to class attribute 2024-03-20 08:09:38 +00:00
Alexander Soare
896a11f60e backup wip 2024-03-19 18:50:04 +00:00
Alexander Soare
ea17f4ce50 backup wip 2024-03-19 16:02:09 +00:00
Alexander Soare
ba91976944 wip: still needs batch logic for act and tdmp 2024-03-14 15:24:10 +00:00
Remi Cadene
9d002032d1 Add Aloha env and ACT policy
WIP Aloha env tests pass

Rendering works (fps look fast tho? TODO action bounding is too wide [-1,1])

Update README

Copy past from act repo

Remove download.py add a WIP for Simxarm

Remove download.py add a WIP for Simxarm

Add act yaml (TODO: try train.py)

Training can runs (TODO: eval)

Add tasks without end_effector that are compatible with dataset, Eval can run (TODO: training and pretrained model)

Add AbstractEnv, Refactor AlohaEnv, Add rendering_hook in env, Minor modifications, (TODO: Refactor Pusht and Simxarm)

poetry lock

fix bug in compute_stats for action normalization

fix more bugs in normalization

fix training

fix import

PushtEnv inheriates AbstractEnv, Improve factory Normalization

Add _make_env to EnvAbstract

Add call_rendering_hooks to pusht env

SimxarmEnv inherites from AbstractEnv (NOT TESTED)

Add aloha tests artifacts + update pusht stats

fix image normalization: before env was in [0,1] but dataset in [0,255], and now both in [0,255]

Small fix on simxarm

Add next to obs

Add top camera to Aloha env (TODO: make it compatible with set of cameras)

Add top camera to Aloha env (TODO: make it compatible with set of cameras)
2024-03-12 10:27:48 +00:00
Remi
fab2b3240b Merge pull request #17 from Cadene/user/rcadene/2024_03_11_bugfix_compute_stats
Fix bugs with normalization
2024-03-11 13:44:07 +01:00
Cadene
84a1647c01 fix import 2024-03-11 12:41:14 +00:00
Cadene
ccd5dc5a42 fix training 2024-03-11 12:33:15 +00:00
Simon Alibert
f54ee7cda0 Fix paths 2024-03-10 16:51:50 +01:00
Simon Alibert
134009f337 Remove init files 2024-03-10 16:38:49 +01:00
Simon Alibert
6c867d78ef Integrate pusht env from diffusion 2024-03-10 16:33:03 +01:00
Remi Cadene
524d29aa80 fix tests 2024-03-07 13:23:22 +01:00
Remi Cadene
cfc304e870 Refactor env queue, Training diffusion works (Still not converging) 2024-03-04 11:00:51 +00:00
Remi Cadene
fddd9f0311 Add possibility for the policy to provide a sequence of actions to the env 2024-03-03 14:02:24 +00:00
Remi Cadene
0f2fa4d9ef Add obs queue to pusht, Set n_obs_steps=2 for diffusion (Not fully tested) 2024-03-03 13:21:31 +00:00
Remi Cadene
cbbed590a9 Add mode to NormalizeTransform with mean_std or min_max (Not fully tested) 2024-03-03 13:19:02 +00:00
Remi Cadene
45b4ecb727 pre-commit run -a 2024-03-02 15:58:21 +00:00
Remi Cadene
1ae6205269 Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion) 2024-03-02 15:53:29 +00:00
Simon Alibert
2c05b75f45 Fixes for PR #3 2024-02-29 21:46:41 +01:00
Simon Alibert
7e024fdce6 Ran pre-commit run --all-files 2024-02-29 13:37:48 +01:00
Cadene
b16c334825 Refactor configs to have env in seperate yaml + Fix training 2024-02-25 17:42:47 +00:00
Cadene
ed80db2846 Sanitize cfg.env 2024-02-25 12:02:29 +00:00
Cadene
63d18475cc fix simxarm factory 2024-02-22 13:04:24 +00:00
Cadene
e3643d6146 Wandb works, One output dir 2024-02-22 12:14:12 +00:00
Cadene
3dc14b5576 Add Prod transform, Add test_factory 2024-02-20 14:22:16 +00:00
Cadene
3da6ffb2cb Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing) 2024-02-20 12:26:57 +00:00
Cadene
c202c2b3c2 Online finetuning runs (sometimes crash because of nans) 2024-02-16 15:13:24 +00:00
Cadene
228c045674 Eval reproduced! Train running (but not reproduced) 2024-02-10 15:46:24 +00:00
Cadene
5a5b190f70 Add common, refactor eval with eval_policy 2024-01-31 13:48:12 +00:00