Commit Graph

51 Commits

Author SHA1 Message Date
Remi Cadene
524d29aa80 fix tests 2024-03-07 13:23:22 +01:00
Remi Cadene
d782b029e1 Add aloha dataset 2024-03-06 10:26:32 +00:00
Remi
49c0955f97 Merge pull request #7 from Cadene/user/rcadene/2024_03_05_abstract_replay_buffer
Add AbstractReplayBuffer
2024-03-06 11:25:24 +01:00
Remi Cadene
eed24b083a small fix 2024-03-06 10:21:22 +00:00
Remi Cadene
f95ecd66fc Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements 2024-03-06 10:15:57 +00:00
Simon Alibert
a6d353c419 Fix 2024-03-05 17:00:17 +01:00
Remi Cadene
2f80d71c3e Remove noqa-F821 2024-03-05 10:22:21 +00:00
Remi Cadene
d4e0849970 Refactor datasets with abstract class 2024-03-05 10:20:57 +00:00
Remi Cadene
a027f4edfb Add cfg.offline_prioritized_sampler 2024-03-04 23:08:52 +00:00
Remi
e990f3e148 Merge pull request #6 from Cadene/user/rcadene/2024_03_04_diffusion
Make diffusion work
2024-03-04 18:30:40 +01:00
Remi Cadene
e29fbb50e8 Fix grad_clip_norm 0 -> 10, Fix normalization min_max to be per channel 2024-03-04 17:26:34 +00:00
Remi Cadene
cfc304e870 Refactor env queue, Training diffusion works (Still not converging) 2024-03-04 11:00:51 +00:00
Remi Cadene
fddd9f0311 Add possibility for the policy to provide a sequence of actions to the env 2024-03-03 14:02:24 +00:00
Remi Cadene
0f2fa4d9ef Add obs queue to pusht, Set n_obs_steps=2 for diffusion (Not fully tested) 2024-03-03 13:21:31 +00:00
Remi Cadene
cbbed590a9 Add mode to NormalizeTransform with mean_std or min_max (Not fully tested) 2024-03-03 13:19:02 +00:00
Simon Alibert
b33ec5a630 Add run on cpu-only compatibility 2024-03-03 12:47:26 +01:00
Remi Cadene
48ded3dbc7 fix 2024-03-02 18:11:50 +00:00
Remi Cadene
80785f8d0e Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion) 2024-03-02 17:04:39 +00:00
Remi Cadene
45b4ecb727 pre-commit run -a 2024-03-02 15:58:21 +00:00
Remi Cadene
1ae6205269 Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion) 2024-03-02 15:53:29 +00:00
Remi Cadene
b5a2f460ea fix bus error 2024-03-01 14:22:05 +00:00
Simon Alibert
c1942d45d3 Fixes for PR #4 2024-03-01 14:59:05 +01:00
Simon Alibert
b862145e22 Added pusht dataset auto-download 2024-03-01 14:31:54 +01:00
Cadene
ca948c1e5b fix zip strict=False 2024-03-01 00:45:23 +00:00
Cadene
ae050d2e94 Solve conflicts + pre-commit run -a 2024-02-29 23:31:32 +00:00
Cadene
0b9027f05e Clean logging, Refactor 2024-02-29 23:21:27 +00:00
Simon Alibert
2c05b75f45 Fixes for PR #3 2024-02-29 21:46:41 +01:00
Simon Alibert
7e024fdce6 Ran pre-commit run --all-files 2024-02-29 13:37:48 +01:00
Cadene
ac90b9c3ee Fix diffusion (rm transpose), Add prefetch 2024-02-28 17:45:01 +00:00
Cadene
cf5063e50e Add diffusion policy (train and eval works, TODO: reproduce results) 2024-02-28 15:21:42 +00:00
Simon Alibert
98f8869743 WIP 2024-02-28 10:59:06 +01:00
Cadene
21670dce90 Refactor train, eval_policy, logger, Add diffusion.yaml (WIP) 2024-02-26 01:10:09 +00:00
Cadene
b16c334825 Refactor configs to have env in seperate yaml + Fix training 2024-02-25 17:42:47 +00:00
Cadene
ed80db2846 Sanitize cfg.env 2024-02-25 12:02:29 +00:00
Cadene
0eb9b5d1a5 Sanitize cfg.wandb 2024-02-25 11:15:09 +00:00
Cadene
e765e26b0b Sanitize cfg.policy, Fix skip_frame pusht.yaml 2024-02-25 11:09:02 +00:00
Cadene
598bb496b0 Add policies/factory, Add test, Add _self_ in config 2024-02-25 10:50:23 +00:00
Cadene
64b5920e94 format 2024-02-24 18:19:18 +00:00
Cadene
aed02dc7c6 Add multithreading for video generation, Speed policy sampling 2024-02-24 18:18:39 +00:00
Cadene
591985c67d Fix done in pusht, Fix --time in sbatch 2024-02-22 17:58:26 +00:00
Cadene
63d18475cc fix simxarm factory 2024-02-22 13:04:24 +00:00
Cadene
96c53ad06f remove comments 2024-02-22 12:15:14 +00:00
Cadene
e3643d6146 Wandb works, One output dir 2024-02-22 12:14:12 +00:00
Cadene
ece89730e6 Add pusht dataset (TODO verify reward is aligned), Refactor visualize_dataset, Add video_dir, fps, state_dim, action_dim to config (Training works) 2024-02-21 00:49:40 +00:00
Cadene
3dc14b5576 Add Prod transform, Add test_factory 2024-02-20 14:22:16 +00:00
Cadene
3da6ffb2cb Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing) 2024-02-20 12:26:57 +00:00
Cadene
fdfb2010fd black 2024-02-18 01:24:19 +00:00
Cadene
a5c305a7a4 offline training + online finetuning converge to 33 reward! 2024-02-18 01:23:44 +00:00
Cadene
c202c2b3c2 Online finetuning runs (sometimes crash because of nans) 2024-02-16 15:13:24 +00:00
Cadene
228c045674 Eval reproduced! Train running (but not reproduced) 2024-02-10 15:46:24 +00:00