Alexander Soare
94cc22da9e
Merge remote-tracking branch 'upstream/main' into refactor_dp
2024-04-11 17:52:10 +01:00
Alexander Soare
976a197f98
backup wip
2024-04-11 17:51:35 +01:00
Cadene
c1a618e567
fix pusht images type from float32 to uint8, update gym-pusht dependencies
2024-04-11 14:29:16 +00:00
Cadene
3914831585
remove __name__ outside script
2024-04-10 17:16:44 +00:00
Cadene
e8622154f8
Replace import gym_pusht in pusht dataset by dynamic import
2024-04-10 15:56:18 +00:00
Cadene
06573d7f67
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
2024-04-10 11:34:01 +00:00
Cadene
70aaf1c4cb
test_datasets.py are passing!
2024-04-08 14:16:57 +00:00
Cadene
44656d2706
test_envs are passing
2024-04-05 23:27:12 +00:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Cadene
40f3783fca
v1.2
2024-03-23 11:41:56 +00:00
Cadene
82e6e01651
v1.1
2024-03-20 17:34:00 +00:00
Cadene
6a1a29386a
Add replay_buffer directory in pusht datasets + aloha (WIP)
2024-03-19 15:49:45 +00:00
Cadene
5805a7ffb1
small fix in type + comments
2024-03-15 12:44:52 +00:00
Cadene
41521f7e96
self.root is Path or None + The following packages are already present in the pyproject.toml and will be skipped:
...
- huggingface-hub
If you want to update it to the latest compatible version, you can use `poetry update package`.
If you prefer to upgrade it to the latest available version, you can use `poetry add package@latest`.
Nothing to add.
2024-03-15 10:56:46 +00:00
Cadene
a311d38796
Add aloha + improve readme
2024-03-15 00:30:11 +00:00
Cadene
19730b3412
Add pusht on hf dataset (WIP)
2024-03-14 16:59:37 +00:00
Simon Alibert
6c867d78ef
Integrate pusht env from diffusion
2024-03-10 16:33:03 +01:00
Simon Alibert
302b78962c
Integrate diffusion policy
2024-03-10 15:31:17 +01:00
Simon Alibert
89eaab140b
Add pusht test artifact
2024-03-09 15:36:20 +01:00
Simon Alibert
f1e2837d63
fix pusht data_dir path
2024-03-08 12:26:15 +01:00
Remi Cadene
f95ecd66fc
Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements
2024-03-06 10:15:57 +00:00
Remi Cadene
2f80d71c3e
Remove noqa-F821
2024-03-05 10:22:21 +00:00
Remi Cadene
d4e0849970
Refactor datasets with abstract class
2024-03-05 10:20:57 +00:00
Remi Cadene
e29fbb50e8
Fix grad_clip_norm 0 -> 10, Fix normalization min_max to be per channel
2024-03-04 17:26:34 +00:00
Remi Cadene
cfc304e870
Refactor env queue, Training diffusion works (Still not converging)
2024-03-04 11:00:51 +00:00
Remi Cadene
cbbed590a9
Add mode to NormalizeTransform with mean_std or min_max (Not fully tested)
2024-03-03 13:19:02 +00:00
Remi Cadene
45b4ecb727
pre-commit run -a
2024-03-02 15:58:21 +00:00
Remi Cadene
1ae6205269
Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion)
2024-03-02 15:53:29 +00:00
Remi Cadene
b5a2f460ea
fix bus error
2024-03-01 14:22:05 +00:00
Simon Alibert
c1942d45d3
Fixes for PR #4
2024-03-01 14:59:05 +01:00
Simon Alibert
b862145e22
Added pusht dataset auto-download
2024-03-01 14:31:54 +01:00
Cadene
ae050d2e94
Solve conflicts + pre-commit run -a
2024-02-29 23:31:32 +00:00
Cadene
0b9027f05e
Clean logging, Refactor
2024-02-29 23:21:27 +00:00
Simon Alibert
7e024fdce6
Ran pre-commit run --all-files
2024-02-29 13:37:48 +01:00
Cadene
591985c67d
Fix done in pusht, Fix --time in sbatch
2024-02-22 17:58:26 +00:00
Cadene
ece89730e6
Add pusht dataset (TODO verify reward is aligned), Refactor visualize_dataset, Add video_dir, fps, state_dim, action_dim to config (Training works)
2024-02-21 00:49:40 +00:00
Cadene
3da6ffb2cb
Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing)
2024-02-20 12:26:57 +00:00