Simon Alibert
c4c2ce04e7
Update pre-commits ( #733 )
2025-02-15 15:51:17 +01:00
Simon Alibert
90e099b39f
Remove offline training, refactor train.py and logging/checkpointing ( #670 )
...
Co-authored-by: Remi <remi.cadene@huggingface.co >
2025-02-11 10:36:06 +01:00
Remi
638d411cd3
Add Pi0 ( #681 )
...
Co-authored-by: Simon Alibert <simon.alibert@huggingface.co >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Co-authored-by: Pablo <pablo.montalvo.leroux@gmail.com >
2025-02-04 18:01:04 +01:00
Simon Alibert
3c0a209f9f
Simplify configs ( #550 )
...
Co-authored-by: Remi <remi.cadene@huggingface.co >
Co-authored-by: HUANG TZU-CHUN <137322177+tc-huang@users.noreply.github.com >
2025-01-31 13:57:37 +01:00
NielsRogge
86bbd16d43
Improve discoverability on the hub ( #325 )
...
Co-authored-by: Lucain <lucainp@gmail.com >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
2024-08-19 15:16:46 +02:00
Alexander Soare
f8a6574698
Add online training with TD-MPC as proof of concept ( #338 )
2024-07-25 11:16:38 +01:00
Alexander Soare
abbb1d2367
Make sure policies don't mutate the batch ( #323 )
2024-07-22 20:38:33 +01:00
Wael Karkoub
54c9776bde
Improves Type Annotations ( #252 )
2024-06-10 19:09:48 +01:00
Alexander Soare
68c1b13406
Make policies compatible with other/multiple image keys ( #149 )
2024-05-16 13:51:53 +01:00
Simon Alibert
f52f4f2cd2
Add copyrights ( #157 )
2024-05-15 12:13:09 +02:00
Simon Alibert
c77633c38c
Add regression tests ( #119 )
...
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Simon Alibert
c015252e20
Remove batch length restrictions in select_action ( #123 )
2024-05-04 15:33:55 +02:00
Alexander Soare
d1855a202a
Refactor TD-MPC ( #103 )
...
Co-authored-by: Cadene <re.cadene@gmail.com >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
2024-05-01 16:40:04 +01:00