Commit Graph

30 Commits

Author SHA1 Message Date
Remi Cadene
f95ecd66fc Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements 2024-03-06 10:15:57 +00:00
Remi Cadene
2bcf2631b9 minor comment 2024-03-04 22:34:44 +00:00
Remi
e990f3e148 Merge pull request #6 from Cadene/user/rcadene/2024_03_04_diffusion
Make diffusion work
2024-03-04 18:30:40 +01:00
Remi Cadene
cfc304e870 Refactor env queue, Training diffusion works (Still not converging) 2024-03-04 11:00:51 +00:00
Remi Cadene
4c400b41a5 Improve log msg in train.py 2024-03-03 13:22:09 +00:00
Simon Alibert
b859e89936 Fix for PR #5 2024-03-03 13:05:21 +01:00
Simon Alibert
b33ec5a630 Add run on cpu-only compatibility 2024-03-03 12:47:26 +01:00
Remi Cadene
80785f8d0e Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion) 2024-03-02 17:04:39 +00:00
Remi Cadene
1ae6205269 Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion) 2024-03-02 15:53:29 +00:00
Cadene
ae050d2e94 Solve conflicts + pre-commit run -a 2024-02-29 23:31:32 +00:00
Cadene
0b9027f05e Clean logging, Refactor 2024-02-29 23:21:27 +00:00
Simon Alibert
7e024fdce6 Ran pre-commit run --all-files 2024-02-29 13:37:48 +01:00
Cadene
cf5063e50e Add diffusion policy (train and eval works, TODO: reproduce results) 2024-02-28 15:21:42 +00:00
Cadene
e543c9a42c small fix % 2024-02-27 11:54:31 +00:00
Cadene
7df542445c Small fix and improve logging message 2024-02-27 11:44:26 +00:00
Cadene
21670dce90 Refactor train, eval_policy, logger, Add diffusion.yaml (WIP) 2024-02-26 01:10:09 +00:00
Cadene
b16c334825 Refactor configs to have env in seperate yaml + Fix training 2024-02-25 17:42:47 +00:00
Cadene
ed80db2846 Sanitize cfg.env 2024-02-25 12:02:29 +00:00
Cadene
598bb496b0 Add policies/factory, Add test, Add _self_ in config 2024-02-25 10:50:23 +00:00
Cadene
aed02dc7c6 Add multithreading for video generation, Speed policy sampling 2024-02-24 18:18:39 +00:00
Cadene
63d18475cc fix simxarm factory 2024-02-22 13:04:24 +00:00
Cadene
e3643d6146 Wandb works, One output dir 2024-02-22 12:14:12 +00:00
Cadene
3dc14b5576 Add Prod transform, Add test_factory 2024-02-20 14:22:16 +00:00
Cadene
3da6ffb2cb Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing) 2024-02-20 12:26:57 +00:00
Cadene
a5c305a7a4 offline training + online finetuning converge to 33 reward! 2024-02-18 01:23:44 +00:00
Cadene
0b4084f0f8 Clean + alpha beta corresponds to config (before 0.7 and 0.9) 2024-02-16 16:27:54 +00:00
Cadene
c202c2b3c2 Online finetuning runs (sometimes crash because of nans) 2024-02-16 15:13:24 +00:00
Cadene
228c045674 Eval reproduced! Train running (but not reproduced) 2024-02-10 15:46:24 +00:00
Cadene
5a5b190f70 Add common, refactor eval with eval_policy 2024-01-31 13:48:12 +00:00
Cadene
1144819c29 First real commit, simxarm env added with torchrl! 2024-01-29 12:49:30 +00:00