Alexander Soare
03b08eb74e
backup wip
2024-04-16 12:51:32 +01:00
Alexander Soare
94cc22da9e
Merge remote-tracking branch 'upstream/main' into refactor_dp
2024-04-11 17:52:10 +01:00
Alexander Soare
976a197f98
backup wip
2024-04-11 17:51:35 +01:00
Cadene
c1a618e567
fix pusht images type from float32 to uint8, update gym-pusht dependencies
2024-04-11 14:29:16 +00:00
Cadene
657b27cc8f
fix load_data_with_delta_timestamps and add tests
2024-04-11 13:00:09 +00:00
Cadene
3914831585
remove __name__ outside script
2024-04-10 17:16:44 +00:00
Cadene
f8c5a2eb10
remove comment
2024-04-10 17:14:02 +00:00
Cadene
9874652c2f
enable test_compute_stats
...
enable test_compute_stats
2024-04-10 17:12:54 +00:00
Cadene
e8622154f8
Replace import gym_pusht in pusht dataset by dynamic import
2024-04-10 15:56:18 +00:00
Cadene
c08003278e
test_examples are passing
2024-04-10 13:45:45 +00:00
Cadene
7c8eb7ff19
Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl
2024-04-10 11:34:51 +00:00
Cadene
06573d7f67
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
2024-04-10 11:34:01 +00:00
Alexander Soare
50e4c8050c
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-08 17:13:11 +01:00
Alexander Soare
9c96349926
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-08 15:44:00 +01:00
Simon Alibert
3f6dfa4916
Add gym-aloha, rename simxarm -> xarm, refactor
2024-04-08 16:24:11 +02:00
Cadene
70aaf1c4cb
test_datasets.py are passing!
2024-04-08 14:16:57 +00:00
Alexander Soare
863f28ffd8
ready for review
2024-04-08 13:10:19 +01:00
Alexander Soare
e982c732f1
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-08 09:25:45 +01:00
Cadene
4371a5570d
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
2024-04-07 16:01:22 +00:00
Cadene
44656d2706
test_envs are passing
2024-04-05 23:27:12 +00:00
Alexander Soare
1e71196fe3
backup wip
2024-04-05 17:38:29 +01:00
Alexander Soare
4863e54ce9
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:00:31 +01:00
Cadene
ad3379a73a
fix memory leak due to itertools.cycle
2024-04-05 10:59:32 +00:00
Alexander Soare
9d77f5773d
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 11:41:11 +01:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Cadene
c93ce35d8c
WIP stats (TODO: run tests on stats + cmpute them)
2024-04-04 16:36:03 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
c7d70a8db9
Merge remote-tracking branch 'upstream/main' into refactor_act
2024-04-03 10:08:12 +01:00
Alexander Soare
caf4ffcf65
add TODO
2024-04-03 09:56:46 +01:00
Alexander Soare
c50a62dd6d
clarifying math
2024-04-03 09:47:38 +01:00
Alexander Soare
e9eb262293
numerically sound mean computation
2024-04-03 09:44:20 +01:00
Alexander Soare
65ef8c30d0
backup wip
2024-04-02 19:13:49 +01:00
Alexander Soare
2b928eedd4
backup wip
2024-04-02 19:11:53 +01:00
Alexander Soare
a6edb85da4
Remove random sampling
2024-04-02 16:52:38 +01:00
Alexander Soare
95293d459d
fix stats computation
2024-04-02 16:40:33 +01:00
Alexander Soare
68d02c80cf
Remove b/c workaround
2024-03-27 12:03:19 +00:00
Cadene
9ced0cf1fb
unskip
2024-03-26 10:45:31 +00:00
Cadene
5a46b8a2a9
fix tests
2024-03-26 10:24:46 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
c5635b7d94
Minor fixes for #47
2024-03-25 18:50:47 +01:00
Simon Alibert
d3adaf1379
Add stat.pth for xarm_lift_medium
2024-03-25 15:55:45 +01:00
Simon Alibert
c0833f1c2d
Remove simxarm download and preproc hack
2024-03-25 12:41:17 +01:00
Simon Alibert
de5c30405e
fix wrong version
2024-03-25 12:35:06 +01:00
Simon Alibert
462e7469e8
Add xarm_lift_medium revision 1.0 to hub
2024-03-25 12:28:07 +01:00
Cadene
b905111895
fix render issue
2024-03-25 12:28:07 +01:00
Simon Alibert
1c24bbda3f
WIP Upgrading simxam from mujoco-py to mujoco python bindings
2024-03-25 12:28:07 +01:00
Cadene
d2ef43436c
move from cadene to lerobot
2024-03-23 13:34:35 +00:00
Cadene
40f3783fca
v1.2
2024-03-23 11:41:56 +00:00
Alexander Soare
8720c568d0
Add ability to eval hub model
2024-03-22 10:26:55 +00:00
Cadene
82e6e01651
v1.1
2024-03-20 17:34:00 +00:00