Simon Alibert
|
7ad1909641
|
Tests cleaning & simplification (#81)
|
2024-04-18 14:47:42 +02:00 |
|
Alexander Soare
|
dd9c6eed15
|
Add temporary patch in TD-MPC
|
2024-04-17 16:27:57 +01:00 |
|
Alexander Soare
|
2298ddf226
|
wip
|
2024-04-17 16:21:37 +01:00 |
|
Alexander Soare
|
63e5ec6483
|
revert some formatting changes
|
2024-04-17 11:40:49 +01:00 |
|
Alexander Soare
|
c50a13ab31
|
draft
|
2024-04-17 10:50:54 +01:00 |
|
Alexander Soare
|
cb3978b5f3
|
backup wip
|
2024-04-16 18:12:39 +01:00 |
|
Alexander Soare
|
0eb899de73
|
Merge remote-tracking branch 'upstream/main' into unify_policy_api
|
2024-04-16 17:30:41 +01:00 |
|
Alexander Soare
|
a9496fde39
|
revision 1
|
2024-04-16 17:15:51 +01:00 |
|
Alexander Soare
|
23be5e1e7b
|
backup wip
|
2024-04-16 16:31:44 +01:00 |
|
Alexander Soare
|
9c2f10bd04
|
ready for review
|
2024-04-16 13:43:58 +01:00 |
|
Alexander Soare
|
03b08eb74e
|
backup wip
|
2024-04-16 12:51:32 +01:00 |
|
Alexander Soare
|
5608e659e6
|
backup wip
|
2024-04-15 19:06:44 +01:00 |
|
Alexander Soare
|
14f3ffb412
|
Merge remote-tracking branch 'upstream/main' into refactor_dp
|
2024-04-15 17:08:28 +01:00 |
|
Alexander Soare
|
30023535f9
|
revision 1
|
2024-04-15 10:56:43 +01:00 |
|
Alexander Soare
|
40d417ef60
|
Make sure to make remove all traces of omegaconf from policy config
|
2024-04-15 09:59:18 +01:00 |
|
Alexander Soare
|
ef4bd9e25c
|
Use dataclass config for ACT
|
2024-04-15 09:39:23 +01:00 |
|
Alexander Soare
|
34f00753eb
|
remove policy.py
|
2024-04-12 17:13:25 +01:00 |
|
Alexander Soare
|
55e484124a
|
draft pr
|
2024-04-12 17:03:59 +01:00 |
|
Alexander Soare
|
6d0a45a97d
|
ready for review
|
2024-04-12 11:36:52 +01:00 |
|
Alexander Soare
|
5666ec3ec7
|
backup wip
|
2024-04-11 18:33:54 +01:00 |
|
Alexander Soare
|
94cc22da9e
|
Merge remote-tracking branch 'upstream/main' into refactor_dp
|
2024-04-11 17:52:10 +01:00 |
|
Alexander Soare
|
976a197f98
|
backup wip
|
2024-04-11 17:51:35 +01:00 |
|
Cadene
|
7c8eb7ff19
|
Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl
|
2024-04-10 11:34:51 +00:00 |
|
Cadene
|
06573d7f67
|
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
|
2024-04-10 11:34:01 +00:00 |
|
Alexander Soare
|
e6c6c2367f
|
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
|
2024-04-09 08:36:28 +01:00 |
|
Cadene
|
6902e01db0
|
tests are passing for aloha/act policies, removes abstract policy
|
2024-04-09 03:28:56 +00:00 |
|
Cadene
|
73dfa3c8e3
|
tests for tdmpc and diffusion policy are passing
|
2024-04-09 02:50:32 +00:00 |
|
Alexander Soare
|
9c96349926
|
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
|
2024-04-08 15:44:00 +01:00 |
|
Cadene
|
70aaf1c4cb
|
test_datasets.py are passing!
|
2024-04-08 14:16:57 +00:00 |
|
Alexander Soare
|
0b4c42f4ff
|
typos
|
2024-04-08 14:59:37 +01:00 |
|
Alexander Soare
|
62b18a7607
|
Add type hints
|
2024-04-08 14:51:45 +01:00 |
|
Alexander Soare
|
86365adf9f
|
revision
|
2024-04-08 14:44:46 +01:00 |
|
Alexander Soare
|
863f28ffd8
|
ready for review
|
2024-04-08 13:10:19 +01:00 |
|
Alexander Soare
|
1bab4a1dd5
|
Eval reproduction works with gym_aloha
|
2024-04-08 10:23:26 +01:00 |
|
Alexander Soare
|
e982c732f1
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-08 09:25:45 +01:00 |
|
Cadene
|
4371a5570d
|
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
|
2024-04-07 16:01:22 +00:00 |
|
Alexander Soare
|
8d2463f45b
|
backup wip
|
2024-04-05 18:46:30 +01:00 |
|
Alexander Soare
|
ab2286025b
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 18:06:00 +01:00 |
|
Alexander Soare
|
1e71196fe3
|
backup wip
|
2024-04-05 17:38:29 +01:00 |
|
Cadene
|
f56b1a0e16
|
WIP tdmpc
|
2024-04-05 13:40:31 +00:00 |
|
Alexander Soare
|
0b8d27ff2c
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 12:48:11 +01:00 |
|
Cadene
|
c17dffe944
|
policies/utils.py
|
2024-04-05 11:47:15 +00:00 |
|
Alexander Soare
|
8ba88ba250
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 12:34:14 +01:00 |
|
Cadene
|
a420714ee4
|
fix: action_is_pad was missing in compute_loss
|
2024-04-05 11:33:39 +00:00 |
|
Alexander Soare
|
9d77f5773d
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 11:41:11 +01:00 |
|
Alexander Soare
|
edb125b351
|
backup wip
|
2024-04-05 11:03:28 +01:00 |
|
Cadene
|
5af00d0c1e
|
fix train.py, stats, eval.py (training is running)
|
2024-04-05 09:31:39 +00:00 |
|
Alexander Soare
|
3a4dfa82fe
|
backup wip
|
2024-04-04 18:34:41 +01:00 |
|
Cadene
|
1cdfbc8b52
|
WIP
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
|
2024-04-04 15:31:03 +00:00 |
|
Alexander Soare
|
278336a39a
|
backup wip
|
2024-04-03 19:23:22 +01:00 |
|