Remi Cadene
|
f95ecd66fc
|
Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements
|
2024-03-06 10:15:57 +00:00 |
|
Remi Cadene
|
cfc304e870
|
Refactor env queue, Training diffusion works (Still not converging)
|
2024-03-04 11:00:51 +00:00 |
|
Remi Cadene
|
cbbed590a9
|
Add mode to NormalizeTransform with mean_std or min_max (Not fully tested)
|
2024-03-03 13:19:02 +00:00 |
|
Remi Cadene
|
45b4ecb727
|
pre-commit run -a
|
2024-03-02 15:58:21 +00:00 |
|
Remi Cadene
|
1ae6205269
|
Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion)
|
2024-03-02 15:53:29 +00:00 |
|
Cadene
|
0b9027f05e
|
Clean logging, Refactor
|
2024-02-29 23:21:27 +00:00 |
|
Simon Alibert
|
7e024fdce6
|
Ran pre-commit run --all-files
|
2024-02-29 13:37:48 +01:00 |
|
Simon Alibert
|
f1708c8a37
|
install fix
|
2024-02-28 12:35:49 +01:00 |
|
Cadene
|
7df542445c
|
Small fix and improve logging message
|
2024-02-27 11:44:26 +00:00 |
|
Cadene
|
21670dce90
|
Refactor train, eval_policy, logger, Add diffusion.yaml (WIP)
|
2024-02-26 01:10:09 +00:00 |
|
Cadene
|
ed80db2846
|
Sanitize cfg.env
|
2024-02-25 12:02:29 +00:00 |
|
Cadene
|
598bb496b0
|
Add policies/factory, Add test, Add _self_ in config
|
2024-02-25 10:50:23 +00:00 |
|
Cadene
|
64b5920e94
|
format
|
2024-02-24 18:19:18 +00:00 |
|
Cadene
|
aed02dc7c6
|
Add multithreading for video generation, Speed policy sampling
|
2024-02-24 18:18:39 +00:00 |
|
Cadene
|
e3643d6146
|
Wandb works, One output dir
|
2024-02-22 12:14:12 +00:00 |
|
Cadene
|
ece89730e6
|
Add pusht dataset (TODO verify reward is aligned), Refactor visualize_dataset, Add video_dir, fps, state_dim, action_dim to config (Training works)
|
2024-02-21 00:49:40 +00:00 |
|
Cadene
|
3da6ffb2cb
|
Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing)
|
2024-02-20 12:26:57 +00:00 |
|
Cadene
|
a5c305a7a4
|
offline training + online finetuning converge to 33 reward!
|
2024-02-18 01:23:44 +00:00 |
|
Cadene
|
228c045674
|
Eval reproduced! Train running (but not reproduced)
|
2024-02-10 15:46:24 +00:00 |
|
Cadene
|
937b2f8cba
|
Add option for random policy
|
2024-01-31 13:54:32 +00:00 |
|
Cadene
|
5a5b190f70
|
Add common, refactor eval with eval_policy
|
2024-01-31 13:48:12 +00:00 |
|
Cadene
|
1e52499490
|
eval.mp4 works!
|
2024-01-30 23:30:14 +00:00 |
|
Cadene
|
1144819c29
|
First real commit, simxarm env added with torchrl!
|
2024-01-29 12:49:30 +00:00 |
|