Remi
638d411cd3
Add Pi0 ( #681 )
...
Co-authored-by: Simon Alibert <simon.alibert@huggingface.co >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Co-authored-by: Pablo <pablo.montalvo.leroux@gmail.com >
2025-02-04 18:01:04 +01:00
Simon Alibert
3c0a209f9f
Simplify configs ( #550 )
...
Co-authored-by: Remi <remi.cadene@huggingface.co >
Co-authored-by: HUANG TZU-CHUN <137322177+tc-huang@users.noreply.github.com >
2025-01-31 13:57:37 +01:00
NielsRogge
86bbd16d43
Improve discoverability on the hub ( #325 )
...
Co-authored-by: Lucain <lucainp@gmail.com >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
2024-08-19 15:16:46 +02:00
Alexander Soare
f8a6574698
Add online training with TD-MPC as proof of concept ( #338 )
2024-07-25 11:16:38 +01:00
Alexander Soare
abbb1d2367
Make sure policies don't mutate the batch ( #323 )
2024-07-22 20:38:33 +01:00
Wael Karkoub
54c9776bde
Improves Type Annotations ( #252 )
2024-06-10 19:09:48 +01:00
Remi
d585c73f9f
Add real-world support for ACT on Aloha/Aloha2 ( #228 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
2024-05-31 15:31:02 +02:00
Alexander Soare
68c1b13406
Make policies compatible with other/multiple image keys ( #149 )
2024-05-16 13:51:53 +01:00
Simon Alibert
f52f4f2cd2
Add copyrights ( #157 )
2024-05-15 12:13:09 +02:00
Ikko Eltociear Ashimine
0ea7a8b2a3
refactor: update configuration_tdmpc.py ( #153 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
2024-05-08 18:13:51 +01:00
Simon Alibert
c77633c38c
Add regression tests ( #119 )
...
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Simon Alibert
c015252e20
Remove batch length restrictions in select_action ( #123 )
2024-05-04 15:33:55 +02:00
Alexander Soare
d1855a202a
Refactor TD-MPC ( #103 )
...
Co-authored-by: Cadene <re.cadene@gmail.com >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
2024-05-01 16:40:04 +01:00
Simon Alibert
7ad1909641
Tests cleaning & simplification ( #81 )
2024-04-18 14:47:42 +02:00
Alexander Soare
dd9c6eed15
Add temporary patch in TD-MPC
2024-04-17 16:27:57 +01:00
Cadene
06573d7f67
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
2024-04-10 11:34:01 +00:00
Cadene
6902e01db0
tests are passing for aloha/act policies, removes abstract policy
2024-04-09 03:28:56 +00:00
Cadene
73dfa3c8e3
tests for tdmpc and diffusion policy are passing
2024-04-09 02:50:32 +00:00
Cadene
70aaf1c4cb
test_datasets.py are passing!
2024-04-08 14:16:57 +00:00
Cadene
4371a5570d
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
2024-04-07 16:01:22 +00:00
Cadene
f56b1a0e16
WIP tdmpc
2024-04-05 13:40:31 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
4631d36c05
Add get_safe_torch_device in policies
2024-03-20 18:38:55 +01:00
Alexander Soare
896a11f60e
backup wip
2024-03-19 18:50:04 +00:00
Alexander Soare
88347965c2
revert dp changes, make act and tdmpc batch friendly
2024-03-18 19:18:21 +00:00
Alexander Soare
ba91976944
wip: still needs batch logic for act and tdmp
2024-03-14 15:24:10 +00:00
Simon Alibert
134009f337
Remove init files
2024-03-10 16:38:49 +01:00
Simon Alibert
59397fb44a
Move tdmpc files
2024-03-09 18:44:36 +01:00