lerobot_piper

Author	SHA1	Message	Date
Alexander Soare	a9496fde39	revision 1	2024-04-16 17:15:51 +01:00
Alexander Soare	9c2f10bd04	ready for review	2024-04-16 13:43:58 +01:00
Alexander Soare	03b08eb74e	backup wip	2024-04-16 12:51:32 +01:00
Alexander Soare	5608e659e6	backup wip	2024-04-15 19:06:44 +01:00
Alexander Soare	6d0a45a97d	ready for review	2024-04-12 11:36:52 +01:00
Alexander Soare	5666ec3ec7	backup wip	2024-04-11 18:33:54 +01:00
Alexander Soare	976a197f98	backup wip	2024-04-11 17:51:35 +01:00
Alexander Soare	863f28ffd8	ready for review	2024-04-08 13:10:19 +01:00
Alexander Soare	1e71196fe3	backup wip	2024-04-05 17:38:29 +01:00
Cadene	a420714ee4	fix: action_is_pad was missing in compute_loss	2024-04-05 11:33:39 +00:00
Cadene	5af00d0c1e	fix train.py, stats, eval.py (training is running)	2024-04-05 09:31:39 +00:00
Cadene	1cdfbc8b52	WIP WIP WIP train.py works, loss going down WIP eval.py Fix WIP (eval running, TODO: verify results reproduced) Eval works! (testing reproducibility) WIP pretrained model pusht reproduces same results as torchrl pretrained model pusht reproduces same results as torchrl Remove AbstractPolicy, Move all queues in select_action WIP test_datasets passed (TODO: re-enable NormalizeTransform)	2024-04-04 15:31:03 +00:00
Alexander Soare	1a1308d62f	fix environment seeding add fixes for reproducibility only try to start env if it is closed revision fix normalization and data type Improve README Improve README Tests are passing, Eval pretrained model works, Add gif Update gif Update gif Update gif Update gif Update README Update README update minor Update README.md Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com> Update README.md Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com> Address suggestions Update thumbnail + stats Update thumbnail + stats Update README.md Co-authored-by: Alexander Soare <alexander.soare159@gmail.com> Add more comments Add test_examples.py	2024-03-26 10:10:43 +00:00
Simon Alibert	bcfdba109f	Update pre-commit & run on all files	2024-03-25 17:29:35 +01:00
Alexander Soare	72d3c3120b	Merge remote-tracking branch 'upstream/main' into fix_pusht_diffusion	2024-03-21 10:20:52 +00:00
Alexander Soare	acf1174447	ready for review	2024-03-21 10:18:50 +00:00
Simon Alibert	4631d36c05	Add get_safe_torch_device in policies	2024-03-20 18:38:55 +01:00
Alexander Soare	d323993569	backup wip	2024-03-20 15:01:27 +00:00
Alexander Soare	32e3f71dd1	backup wip	2024-03-20 09:49:16 +00:00
Alexander Soare	896a11f60e	backup wip	2024-03-19 18:50:04 +00:00
Alexander Soare	88347965c2	revert dp changes, make act and tdmpc batch friendly	2024-03-18 19:18:21 +00:00
Alexander Soare	a222c88c99	Merge branch 'user/alexander-soare/train_pusht' into user/alexander-soare/multistep_policy_and_serial_env	2024-03-14 16:06:21 +00:00
Alexander Soare	ba91976944	wip: still needs batch logic for act and tdmp	2024-03-14 15:24:10 +00:00
Alexander Soare	98484ac68e	ready for review	2024-03-12 21:59:01 +00:00
Alexander Soare	87fcc536f9	wip - still need to verify full training run	2024-03-11 18:45:21 +00:00
Alexander Soare	2a01487494	early training loss as expected	2024-03-11 13:34:04 +00:00
Simon Alibert	6c867d78ef	Integrate pusht env from diffusion	2024-03-10 16:33:03 +01:00
Simon Alibert	302b78962c	Integrate diffusion policy	2024-03-10 15:31:17 +01:00
Simon Alibert	a6d353c419	Fix	2024-03-05 17:00:17 +01:00
Remi Cadene	cfc304e870	Refactor env queue, Training diffusion works (Still not converging)	2024-03-04 11:00:51 +00:00
Remi Cadene	fddd9f0311	Add possibility for the policy to provide a sequence of actions to the env	2024-03-03 14:02:24 +00:00
Remi Cadene	0f2fa4d9ef	Add obs queue to pusht, Set n_obs_steps=2 for diffusion (Not fully tested)	2024-03-03 13:21:31 +00:00
Remi Cadene	80785f8d0e	Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion)	2024-03-02 17:04:39 +00:00

33 Commits