NielsRogge
86bbd16d43
Improve discoverability on the hub ( #325 )
...
Co-authored-by: Lucain <lucainp@gmail.com >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
2024-08-19 15:16:46 +02:00
Alexander Soare
f8a6574698
Add online training with TD-MPC as proof of concept ( #338 )
2024-07-25 11:16:38 +01:00
Alexander Soare
abbb1d2367
Make sure policies don't mutate the batch ( #323 )
2024-07-22 20:38:33 +01:00
Wael Karkoub
54c9776bde
Improves Type Annotations ( #252 )
2024-06-10 19:09:48 +01:00
Alexander Soare
68c1b13406
Make policies compatible with other/multiple image keys ( #149 )
2024-05-16 13:51:53 +01:00
Simon Alibert
f52f4f2cd2
Add copyrights ( #157 )
2024-05-15 12:13:09 +02:00
Simon Alibert
c77633c38c
Add regression tests ( #119 )
...
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Simon Alibert
c015252e20
Remove batch length restrictions in select_action ( #123 )
2024-05-04 15:33:55 +02:00
Alexander Soare
d1855a202a
Refactor TD-MPC ( #103 )
...
Co-authored-by: Cadene <re.cadene@gmail.com >
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
2024-05-01 16:40:04 +01:00