Alexander Soare
|
45f351c618
|
Make sure targets are normalized too (#106)
|
2024-04-26 11:18:39 +01:00 |
|
Remi
|
e760e4cd63
|
Move normalization to policy for act and diffusion (#90)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
|
2024-04-25 11:47:38 +02:00 |
|
Simon Alibert
|
7ad1909641
|
Tests cleaning & simplification (#81)
|
2024-04-18 14:47:42 +02:00 |
|
Alexander Soare
|
dd9c6eed15
|
Add temporary patch in TD-MPC
|
2024-04-17 16:27:57 +01:00 |
|
Alexander Soare
|
2298ddf226
|
wip
|
2024-04-17 16:21:37 +01:00 |
|
Alexander Soare
|
63e5ec6483
|
revert some formatting changes
|
2024-04-17 11:40:49 +01:00 |
|
Alexander Soare
|
c50a13ab31
|
draft
|
2024-04-17 10:50:54 +01:00 |
|
Alexander Soare
|
cb3978b5f3
|
backup wip
|
2024-04-16 18:12:39 +01:00 |
|
Alexander Soare
|
0eb899de73
|
Merge remote-tracking branch 'upstream/main' into unify_policy_api
|
2024-04-16 17:30:41 +01:00 |
|
Alexander Soare
|
a9496fde39
|
revision 1
|
2024-04-16 17:15:51 +01:00 |
|
Alexander Soare
|
23be5e1e7b
|
backup wip
|
2024-04-16 16:31:44 +01:00 |
|
Alexander Soare
|
9c2f10bd04
|
ready for review
|
2024-04-16 13:43:58 +01:00 |
|
Alexander Soare
|
03b08eb74e
|
backup wip
|
2024-04-16 12:51:32 +01:00 |
|
Alexander Soare
|
5608e659e6
|
backup wip
|
2024-04-15 19:06:44 +01:00 |
|
Alexander Soare
|
14f3ffb412
|
Merge remote-tracking branch 'upstream/main' into refactor_dp
|
2024-04-15 17:08:28 +01:00 |
|
Alexander Soare
|
30023535f9
|
revision 1
|
2024-04-15 10:56:43 +01:00 |
|
Alexander Soare
|
40d417ef60
|
Make sure to make remove all traces of omegaconf from policy config
|
2024-04-15 09:59:18 +01:00 |
|
Alexander Soare
|
ef4bd9e25c
|
Use dataclass config for ACT
|
2024-04-15 09:39:23 +01:00 |
|
Alexander Soare
|
34f00753eb
|
remove policy.py
|
2024-04-12 17:13:25 +01:00 |
|
Alexander Soare
|
55e484124a
|
draft pr
|
2024-04-12 17:03:59 +01:00 |
|
Alexander Soare
|
6d0a45a97d
|
ready for review
|
2024-04-12 11:36:52 +01:00 |
|
Alexander Soare
|
5666ec3ec7
|
backup wip
|
2024-04-11 18:33:54 +01:00 |
|
Alexander Soare
|
94cc22da9e
|
Merge remote-tracking branch 'upstream/main' into refactor_dp
|
2024-04-11 17:52:10 +01:00 |
|
Alexander Soare
|
976a197f98
|
backup wip
|
2024-04-11 17:51:35 +01:00 |
|
Cadene
|
7c8eb7ff19
|
Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl
|
2024-04-10 11:34:51 +00:00 |
|
Cadene
|
06573d7f67
|
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
|
2024-04-10 11:34:01 +00:00 |
|
Alexander Soare
|
e6c6c2367f
|
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
|
2024-04-09 08:36:28 +01:00 |
|
Cadene
|
6902e01db0
|
tests are passing for aloha/act policies, removes abstract policy
|
2024-04-09 03:28:56 +00:00 |
|
Cadene
|
73dfa3c8e3
|
tests for tdmpc and diffusion policy are passing
|
2024-04-09 02:50:32 +00:00 |
|
Alexander Soare
|
9c96349926
|
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
|
2024-04-08 15:44:00 +01:00 |
|
Cadene
|
70aaf1c4cb
|
test_datasets.py are passing!
|
2024-04-08 14:16:57 +00:00 |
|
Alexander Soare
|
0b4c42f4ff
|
typos
|
2024-04-08 14:59:37 +01:00 |
|
Alexander Soare
|
62b18a7607
|
Add type hints
|
2024-04-08 14:51:45 +01:00 |
|
Alexander Soare
|
86365adf9f
|
revision
|
2024-04-08 14:44:46 +01:00 |
|
Alexander Soare
|
863f28ffd8
|
ready for review
|
2024-04-08 13:10:19 +01:00 |
|
Alexander Soare
|
1bab4a1dd5
|
Eval reproduction works with gym_aloha
|
2024-04-08 10:23:26 +01:00 |
|
Alexander Soare
|
e982c732f1
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-08 09:25:45 +01:00 |
|
Cadene
|
4371a5570d
|
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
|
2024-04-07 16:01:22 +00:00 |
|
Alexander Soare
|
8d2463f45b
|
backup wip
|
2024-04-05 18:46:30 +01:00 |
|
Alexander Soare
|
ab2286025b
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 18:06:00 +01:00 |
|
Alexander Soare
|
1e71196fe3
|
backup wip
|
2024-04-05 17:38:29 +01:00 |
|
Cadene
|
f56b1a0e16
|
WIP tdmpc
|
2024-04-05 13:40:31 +00:00 |
|
Alexander Soare
|
0b8d27ff2c
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 12:48:11 +01:00 |
|
Cadene
|
c17dffe944
|
policies/utils.py
|
2024-04-05 11:47:15 +00:00 |
|
Alexander Soare
|
8ba88ba250
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 12:34:14 +01:00 |
|
Cadene
|
a420714ee4
|
fix: action_is_pad was missing in compute_loss
|
2024-04-05 11:33:39 +00:00 |
|
Alexander Soare
|
9d77f5773d
|
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
|
2024-04-05 11:41:11 +01:00 |
|
Alexander Soare
|
edb125b351
|
backup wip
|
2024-04-05 11:03:28 +01:00 |
|
Cadene
|
5af00d0c1e
|
fix train.py, stats, eval.py (training is running)
|
2024-04-05 09:31:39 +00:00 |
|
Alexander Soare
|
3a4dfa82fe
|
backup wip
|
2024-04-04 18:34:41 +01:00 |
|