Jack Vial
27ba2951d1
fix(tdmpc): Add missing save_freq to tdmpc policy config ( #404 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
2024-09-02 19:04:41 +01:00
Alexander Soare
f8a6574698
Add online training with TD-MPC as proof of concept ( #338 )
2024-07-25 11:16:38 +01:00
Alexander Soare
c0101f0948
Fix ACT temporal ensembling ( #319 )
2024-07-16 10:27:21 +01:00
Remi
5e54e39795
Add real robot devices and scripts to control real robot ( #288 )
...
Co-authored-by: Simon Alibert <alibert.sim@gmail.com >
2024-07-15 17:43:10 +02:00
Alexander Soare
cc2f6e7404
Train diffusion pusht_keypoints ( #307 )
...
Co-authored-by: Remi <re.cadene@gmail.com >
2024-07-09 12:35:50 +01:00
Simon Alibert
964f9e86d6
Cleanup config defaults ( #300 )
2024-07-04 11:53:29 +02:00
Alexander Soare
342f429f1c
Add test to make sure policy dataclass configs match yaml configs ( #292 )
2024-06-26 09:09:40 +01:00
Seungjae Lee
7d1542cae1
Add VQ-BeT ( #166 )
2024-06-26 08:55:02 +01:00
Remi
d585c73f9f
Add real-world support for ACT on Aloha/Aloha2 ( #228 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
2024-05-31 15:31:02 +02:00
Radek Osmulski
504d2aaf48
add EpisodeAwareSampler ( #217 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
2024-05-31 13:43:47 +01:00
Alexander Soare
e3b9f1c19b
Add resume training ( #205 )
...
Co-authored-by: Remi <re.cadene@gmail.com >
2024-05-28 12:04:23 +01:00
Alexander Soare
2b270d085b
Disable online training ( #202 )
...
Co-authored-by: Remi <re.cadene@gmail.com >
2024-05-20 18:27:54 +01:00
Alexander Soare
5ec0af62c6
Explain why n_encoder_layers=1 ( #193 )
2024-05-17 15:05:40 +01:00
Alexander Soare
625f0557ef
Act temporal ensembling ( #186 )
2024-05-17 14:57:49 +01:00
Alexander Soare
473345fdf6
Fix stats override in ACT config ( #161 )
2024-05-09 15:16:47 +01:00
Alexander Soare
e89521dfa0
Enable tests for TD-MPC ( #160 )
2024-05-09 13:42:12 +01:00
Akshay Kashyap
460df2ccea
Support for DDIMScheduler in Diffusion Policy ( #146 )
2024-05-08 18:05:16 +01:00
Alexander Soare
a8e245fb31
Remove loss masking from diffusion policy ( #135 )
2024-05-06 07:27:01 +01:00
Alexander Soare
f3bba0270d
Remove EMA model from Diffusion Policy ( #134 )
2024-05-05 11:26:12 +01:00
Simon Alibert
c77633c38c
Add regression tests ( #119 )
...
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Alexander Soare
bccee745c3
Refactor eval.py ( #127 )
2024-05-03 17:33:16 +01:00
Alexander Soare
d1855a202a
Refactor TD-MPC ( #103 )
...
Co-authored-by: Cadene <re.cadene@gmail.com >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
2024-05-01 16:40:04 +01:00
Alexander Soare
a4891095e4
Use PytorchModelHubMixin to save models as safetensors ( #125 )
...
Co-authored-by: Remi <re.cadene@gmail.com >
2024-05-01 16:17:18 +01:00
Alexander Soare
9d60dce6f3
Tidy up yaml configs ( #121 )
2024-04-30 16:08:59 +01:00
Simon Alibert
791506dfb8
Remove warnings ( #111 )
...
- Replace `use_pretrained_backbone` with `pretrained_backbone_weights`
- Bump diffusers' minimum version `0.26.3` -> `0.27.2`
- Add ignore flags in CI's pytest
- Change Box observation spaces in simulation environments
- Set `version_base="1.2"` in Hydra initializations
- Bump einops' minimum version `0.7.0` -> `0.8.0`
2024-04-29 00:31:33 +02:00
Alexander Soare
45f351c618
Make sure targets are normalized too ( #106 )
2024-04-26 11:18:39 +01:00
Remi
e760e4cd63
Move normalization to policy for act and diffusion ( #90 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
2024-04-25 11:47:38 +02:00
Alexander Soare
c50a13ab31
draft
2024-04-17 10:50:54 +01:00
Alexander Soare
03b08eb74e
backup wip
2024-04-16 12:51:32 +01:00
Alexander Soare
5608e659e6
backup wip
2024-04-15 19:06:44 +01:00
Alexander Soare
14f3ffb412
Merge remote-tracking branch 'upstream/main' into refactor_dp
2024-04-15 17:08:28 +01:00
Alexander Soare
30023535f9
revision 1
2024-04-15 10:56:43 +01:00
Alexander Soare
ef4bd9e25c
Use dataclass config for ACT
2024-04-15 09:39:23 +01:00
Alexander Soare
55e484124a
draft pr
2024-04-12 17:03:59 +01:00
Alexander Soare
94cc22da9e
Merge remote-tracking branch 'upstream/main' into refactor_dp
2024-04-11 17:52:10 +01:00
Alexander Soare
976a197f98
backup wip
2024-04-11 17:51:35 +01:00
Cadene
7c8eb7ff19
Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl
2024-04-10 11:34:51 +00:00
Cadene
06573d7f67
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
2024-04-10 11:34:01 +00:00
Alexander Soare
e6c6c2367f
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-09 08:36:28 +01:00
Cadene
6902e01db0
tests are passing for aloha/act policies, removes abstract policy
2024-04-09 03:28:56 +00:00
Cadene
73dfa3c8e3
tests for tdmpc and diffusion policy are passing
2024-04-09 02:50:32 +00:00
Alexander Soare
9c96349926
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-08 15:44:00 +01:00
Cadene
70aaf1c4cb
test_datasets.py are passing!
2024-04-08 14:16:57 +00:00
Alexander Soare
86365adf9f
revision
2024-04-08 14:44:46 +01:00
Alexander Soare
863f28ffd8
ready for review
2024-04-08 13:10:19 +01:00
Alexander Soare
1bab4a1dd5
Eval reproduction works with gym_aloha
2024-04-08 10:23:26 +01:00
Alexander Soare
e982c732f1
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-08 09:25:45 +01:00
Cadene
4371a5570d
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
2024-04-07 16:01:22 +00:00
Alexander Soare
ab2286025b
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 18:06:00 +01:00
Cadene
f56b1a0e16
WIP tdmpc
2024-04-05 13:40:31 +00:00