Simon Alibert
ab3cd3a7ba
(WIP) Add gym-xarm
2024-04-05 15:35:20 +02:00
Cadene
c17dffe944
policies/utils.py
2024-04-05 11:47:15 +00:00
Cadene
a420714ee4
fix: action_is_pad was missing in compute_loss
2024-04-05 11:33:39 +00:00
Cadene
ad3379a73a
fix memory leak due to itertools.cycle
2024-04-05 10:59:32 +00:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Cadene
c93ce35d8c
WIP stats (TODO: run tests on stats + cmpute them)
2024-04-04 16:36:03 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
caf4ffcf65
add TODO
2024-04-03 09:56:46 +01:00
Alexander Soare
c50a62dd6d
clarifying math
2024-04-03 09:47:38 +01:00
Alexander Soare
e9eb262293
numerically sound mean computation
2024-04-03 09:44:20 +01:00
Alexander Soare
a6edb85da4
Remove random sampling
2024-04-02 16:52:38 +01:00
Alexander Soare
95293d459d
fix stats computation
2024-04-02 16:40:33 +01:00
Alexander Soare
f1148b8c2d
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-04-01 11:31:31 +01:00
Simon Alibert
6bddcb647e
Add test_aloha env test
2024-03-28 10:35:11 +01:00
Alexander Soare
b7c9c33072
revision
2024-03-27 18:33:48 +00:00
Alexander Soare
120f0aef5c
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-03-27 17:52:36 +00:00
Alexander Soare
68d02c80cf
Remove b/c workaround
2024-03-27 12:03:19 +00:00
Alexander Soare
011f2d27fe
fix tests
2024-03-26 16:40:54 +00:00
Alexander Soare
1ed0110900
finish examples 2 and 3
2024-03-26 16:13:40 +00:00
Cadene
9ced0cf1fb
unskip
2024-03-26 10:45:31 +00:00
Cadene
5a46b8a2a9
fix tests
2024-03-26 10:24:46 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
c5635b7d94
Minor fixes for #47
2024-03-25 18:50:47 +01:00
Simon Alibert
bcfdba109f
Update pre-commit & run on all files
2024-03-25 17:29:35 +01:00
Simon Alibert
7cdd6d2450
Renamed set_seed -> set_global_seed
2024-03-25 17:19:28 +01:00
Simon Alibert
058ac991eb
Add simxarm back into tests
2024-03-25 16:35:46 +01:00
Simon Alibert
d3adaf1379
Add stat.pth for xarm_lift_medium
2024-03-25 15:55:45 +01:00
Simon Alibert
dc89166bee
Upgrade gym to gymnasium
2024-03-25 15:12:21 +01:00
Simon Alibert
5ef813ff1e
Remove deprecated code
2024-03-25 13:22:49 +01:00
Simon Alibert
c0833f1c2d
Remove simxarm download and preproc hack
2024-03-25 12:41:17 +01:00
Simon Alibert
de5c30405e
fix wrong version
2024-03-25 12:35:06 +01:00
Simon Alibert
462e7469e8
Add xarm_lift_medium revision 1.0 to hub
2024-03-25 12:28:07 +01:00
Simon Alibert
127de1258d
WIP
2024-03-25 12:28:07 +01:00
Cadene
b905111895
fix render issue
2024-03-25 12:28:07 +01:00
Simon Alibert
0c41675986
fix __init__ import Base
2024-03-25 12:28:07 +01:00
Simon Alibert
1c24bbda3f
WIP Upgrading simxam from mujoco-py to mujoco python bindings
2024-03-25 12:28:07 +01:00
Remi
f3cfc8b3b4
Merge pull request #46 from huggingface/user/rcadene/2024_03_23_update_stats_v1.2
...
Fix bug with stats.pth + Move from cadene to lerobot + Update datasets to v1.2
2024-03-24 17:53:32 +01:00
Cadene
d2ef43436c
move from cadene to lerobot
2024-03-23 13:34:35 +00:00
Cadene
40f3783fca
v1.2
2024-03-23 11:41:56 +00:00
Alexander Soare
e698d38a35
Merge remote-tracking branch 'upstream/main' into fix_environment_seeding
2024-03-22 15:11:15 +00:00
Alexander Soare
15ff3b3af8
add fixes for reproducibility
2024-03-22 15:06:57 +00:00
Alexander Soare
b9047fbdd2
fix environment seeding
2024-03-22 13:25:23 +00:00
Alexander Soare
8720c568d0
Add ability to eval hub model
2024-03-22 10:26:55 +00:00
Alexander Soare
72d3c3120b
Merge remote-tracking branch 'upstream/main' into fix_pusht_diffusion
2024-03-21 10:20:52 +00:00
Alexander Soare
acf1174447
ready for review
2024-03-21 10:18:50 +00:00
Simon Alibert
1bd50122be
Merge pull request #40 from huggingface/user/aliberts/2024_03_20_enable_mps_device
...
Enable mps backend for Apple silicon devices
2024-03-20 19:33:12 +01:00
Simon Alibert
4631d36c05
Add get_safe_torch_device in policies
2024-03-20 18:38:55 +01:00
Cadene
82e6e01651
v1.1
2024-03-20 17:34:00 +00:00
Alexander Soare
d323993569
backup wip
2024-03-20 15:01:27 +00:00
Alexander Soare
4b7ec81dde
remove abstracmethods, fix online training
2024-03-20 14:49:41 +00:00