Alexander Soare
a9496fde39
revision 1
2024-04-16 17:15:51 +01:00
Alexander Soare
9c2f10bd04
ready for review
2024-04-16 13:43:58 +01:00
Alexander Soare
03b08eb74e
backup wip
2024-04-16 12:51:32 +01:00
Alexander Soare
5608e659e6
backup wip
2024-04-15 19:06:44 +01:00
Alexander Soare
6d0a45a97d
ready for review
2024-04-12 11:36:52 +01:00
Alexander Soare
5666ec3ec7
backup wip
2024-04-11 18:33:54 +01:00
Alexander Soare
976a197f98
backup wip
2024-04-11 17:51:35 +01:00
Alexander Soare
863f28ffd8
ready for review
2024-04-08 13:10:19 +01:00
Alexander Soare
1e71196fe3
backup wip
2024-04-05 17:38:29 +01:00
Cadene
a420714ee4
fix: action_is_pad was missing in compute_loss
2024-04-05 11:33:39 +00:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com >
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com >
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
bcfdba109f
Update pre-commit & run on all files
2024-03-25 17:29:35 +01:00
Alexander Soare
72d3c3120b
Merge remote-tracking branch 'upstream/main' into fix_pusht_diffusion
2024-03-21 10:20:52 +00:00
Alexander Soare
acf1174447
ready for review
2024-03-21 10:18:50 +00:00
Simon Alibert
4631d36c05
Add get_safe_torch_device in policies
2024-03-20 18:38:55 +01:00
Alexander Soare
d323993569
backup wip
2024-03-20 15:01:27 +00:00
Alexander Soare
32e3f71dd1
backup wip
2024-03-20 09:49:16 +00:00
Alexander Soare
896a11f60e
backup wip
2024-03-19 18:50:04 +00:00
Alexander Soare
88347965c2
revert dp changes, make act and tdmpc batch friendly
2024-03-18 19:18:21 +00:00
Alexander Soare
a222c88c99
Merge branch 'user/alexander-soare/train_pusht' into user/alexander-soare/multistep_policy_and_serial_env
2024-03-14 16:06:21 +00:00
Alexander Soare
ba91976944
wip: still needs batch logic for act and tdmp
2024-03-14 15:24:10 +00:00
Alexander Soare
98484ac68e
ready for review
2024-03-12 21:59:01 +00:00
Alexander Soare
87fcc536f9
wip - still need to verify full training run
2024-03-11 18:45:21 +00:00
Alexander Soare
2a01487494
early training loss as expected
2024-03-11 13:34:04 +00:00
Simon Alibert
6c867d78ef
Integrate pusht env from diffusion
2024-03-10 16:33:03 +01:00
Simon Alibert
302b78962c
Integrate diffusion policy
2024-03-10 15:31:17 +01:00
Simon Alibert
a6d353c419
Fix
2024-03-05 17:00:17 +01:00
Remi Cadene
cfc304e870
Refactor env queue, Training diffusion works (Still not converging)
2024-03-04 11:00:51 +00:00
Remi Cadene
fddd9f0311
Add possibility for the policy to provide a sequence of actions to the env
2024-03-03 14:02:24 +00:00
Remi Cadene
0f2fa4d9ef
Add obs queue to pusht, Set n_obs_steps=2 for diffusion (Not fully tested)
2024-03-03 13:21:31 +00:00
Remi Cadene
80785f8d0e
Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion)
2024-03-02 17:04:39 +00:00