Commit Graph

  • 96b2b62377 Fix missing local_files_only in record/replay (#540) Remi 2024-12-03 10:53:21 +01:00
  • b5c98bbfd3 Refactor OpenX (#505) Michel Aractingi 2024-12-03 00:51:55 +01:00
  • 58e12cf2e8 Fixup Eugene Mironov 2024-12-17 02:42:53 +07:00
  • d8b5fae622 Add human intervention mechanism and eval_robot script to evaluate policy on the robot (#541) Michel Aractingi 2024-12-09 19:17:47 +01:00
  • 67ac81d728 Reward classifier and training (#528) Yoel 2024-12-09 10:21:50 +01:00
  • b5f1ea3140 nit Michel Aractingi 2024-11-27 15:19:20 +01:00
  • 4d854a1513 Stable version of rlpd + drq AdilZouitine 2025-01-22 09:00:16 +00:00
  • 87da655eab Add type annotations and restructure SACConfig class fields AdilZouitine 2025-01-21 09:51:12 +00:00
  • 6cc2cc896a set num workers to zero in compute stats, temporary fix to hanging code when iterating over dataloader user/michel-aractingi/2025-01-18-port-rlds-example Michel Aractingi 2025-01-20 11:02:37 +00:00
  • d8560b8d5f Bumprerun-sdk dependency to 0.21.0 (#618) Jochen Görtler 2025-01-20 09:50:11 +01:00
  • acc433d25d Added example script to port rlds/openx dataset formats Michel Aractingi 2025-01-18 10:49:26 +00:00
  • a8fda9c61a Change SAC policy implementation with configuration and modeling classes Adil Zouitine 2025-01-17 09:39:04 +01:00
  • 55505ff817 Add rlpd tricks Adil Zouitine 2025-01-15 15:49:24 +01:00
  • 20d31ab8e0 SAC works Adil Zouitine 2025-01-14 11:34:52 +01:00
  • e5b83aab5e remove breakpoint Adil Zouitine 2025-01-13 17:58:00 +01:00
  • a9d5f62304 [WIP] correct sac implementation Adil Zouitine 2025-01-13 17:54:11 +01:00
  • 72e1ed7058 Add rlpd tricks Adil Zouitine 2025-01-15 15:49:24 +01:00
  • d8e67a2609 SAC works Adil Zouitine 2025-01-14 11:34:52 +01:00
  • 50e12376de remove breakpoint Adil Zouitine 2025-01-13 17:58:00 +01:00
  • 73aa6c25f3 [WIP] correct sac implementation Adil Zouitine 2025-01-13 17:54:11 +01:00
  • 278b56bce9 Add rlpd tricks user/adil-zouitine/2025-1-7-port-hil-serl Adil Zouitine 2025-01-15 15:49:24 +01:00
  • 380b836eee Fix for the issue https://github.com/huggingface/lerobot/issues/638 (#639) Pradeep Kadubandi 2025-01-15 01:50:38 -08:00
  • f945641de9 added SM8512BL nepyope 2025-01-14 14:19:38 +01:00
  • 0ffc0a7170 SAC works Adil Zouitine 2025-01-14 11:34:52 +01:00
  • 43d9890489 remove breakpoint Adil Zouitine 2025-01-13 17:58:00 +01:00
  • 963be41003 [WIP] correct sac implementation Adil Zouitine 2025-01-13 17:54:11 +01:00
  • 3bb5ed5e91 Extend reward classifier for multiple camera views (#626) user/michel-aractingi/2024-11-27-port-hil-serl Michel Aractingi 2025-01-13 13:57:49 +01:00
  • eec6796cb8 fixes to SO-100 readme (#600) Philip Fung 2025-01-10 02:30:01 -08:00
  • 5bd41a3dca added feetech failsafe comment nepyope 2025-01-09 18:14:01 +01:00
  • 25a8597680 [viz] Fixes & updates to html visualizer (#617) Mishig 2025-01-09 11:39:54 +01:00
  • b8b368310c typo fix: batch_convert_dataset_v1_to_v2.py (#615) CharlesCNorton 2025-01-09 03:57:45 -05:00
  • 5097cd900e fix(visualise): use correct language description for each episode id (#604) Ville Kuosmanen 2025-01-09 08:39:48 +00:00
  • bc16e1b497 fix(docs): typos in benchmark readme.md (#614) CharlesCNorton 2025-01-09 03:35:27 -05:00
  • 8f821ecad0 Fix Quality workflow (#622) Simon Alibert 2025-01-08 13:35:11 +01:00
  • 9edae4a8de Correct losses and factorisation Adil Zouitine 2025-01-07 17:07:55 +01:00
  • c5bca1cf0f [Port HIL_SERL] Final fixes for the Reward Classifier (#598) Eugene Mironov 2025-01-06 17:34:00 +07:00
  • 89d8189d8b remove unused debug lines Ke-Wang1017 2025-01-06 10:18:40 +00:00
  • 8b70b129dc improvements from JClinton to speed up loading offline data Ke-Wang1017 2025-01-06 09:50:08 +00:00
  • db3925df28 Potential fixes for SAC instability and NAN bug joeclinton1 2025-01-03 21:12:59 +00:00
  • f99e670976 Refactor SACPolicy and configuration for improved training dynamics Ke-Wang1017 2025-01-02 22:13:58 +00:00
  • eec28baa63 fix the bug of target critic updates, roll back to origial temperature implementation, added debug logging info KeWang1017 2025-01-02 21:05:14 +00:00
  • f1f04eb4f9 use mean instead of sampled action for the inference KeWang1017 2024-12-31 10:48:06 +00:00
  • 77a7f92139 1, add input normalization in configuration_sac.py 2, add masking on loss computation KeWang1017 2024-12-30 18:46:25 +00:00
  • 4519016e67 Update README.md (#612) CharlesCNorton 2025-01-03 10:19:37 -05:00
  • 245e844d83 Update factory.py user/michel-aractingi/2024-11-26-Maniskill-support Michel Aractingi 2024-12-31 11:38:53 +01:00
  • 35de91ef2b added temporary fix for missing task_index key in online environment Michel Aractingi 2024-12-30 13:47:28 +00:00
  • ee306e2f9b split encoder for critic and actor Michel Aractingi 2024-12-29 23:59:39 +00:00
  • bae3b02928 style fixes Michel Aractingi 2024-12-29 14:35:21 +00:00
  • 5b4adc00bb Refactor SAC configuration and policy for improved action sampling and stability KeWang1017 2024-12-29 12:30:39 +00:00
  • 22fbc9ea4a Refine SAC configuration and policy for enhanced performance KeWang1017 2024-12-28 22:11:34 +00:00
  • ca74a13d61 Refactor SACPolicy for improved action sampling and standard deviation handling KeWang1017 2024-12-28 18:07:15 +00:00
  • 18a4598986 trying to get sac running KeWang1017 2024-12-26 23:38:46 +00:00
  • dc54d357ca Added normalization schemes and style checks Michel Aractingi 2024-12-29 12:51:21 +00:00
  • f996a13f70 Add test3 test4 user/rcadene/2024_11_26_hope_junior Remi Cadene 2024-12-29 13:30:17 +01:00
  • 49ae3e19e1 Add clone, delete, WIP on remove_episode, drop_frame user/rcadene/2024_12_03_manage_dataset Remi Cadene 2024-12-27 17:49:47 +01:00
  • 743ebfa7c1 Cremaillaire HF Remi Cadene 2024-12-24 11:33:55 +01:00
  • 59e2757434 Fix broken create_lerobot_dataset_card (#590) Eugene Mironov 2024-12-23 21:05:59 +07:00
  • 08ec971086 added optimizer and sac to factory.py Michel Aractingi 2024-12-23 14:12:03 +01:00
  • b53d6e0ff2 [HIL-SERL PORT] Fix linter issues (#588) Eugene Mironov 2024-12-23 16:44:29 +07:00
  • 70b652f791 [Port Hil-SERL] Add unit tests for the reward classifier & fix imports & check script (#578) Eugene Mironov 2024-12-23 16:43:55 +07:00
  • 73b64c3089 [vizualizer] for LeRobodDataset V2 (#576) Mishig 2024-12-20 16:26:23 +01:00
  • 7b68bfb73b added comments from kewang Michel Aractingi 2024-12-17 18:03:46 +01:00
  • 7e0f20fbf2 Enhance SAC configuration and policy with new parameters and subsampling logic KeWang1017 2024-12-17 15:58:04 +00:00
  • def42ff487 Port SAC WIP (#581) temp_branch_hil_serl KeWang 2024-12-17 13:26:17 +00:00
  • c9af8e36a7 completed losses Michel Aractingi 2024-12-12 11:45:30 +01:00
  • 5ae5dcb30f Port SAC WIP (#581) user/michel-aractingi/2024-12-07-soft-actor-critic KeWang 2024-12-17 13:26:17 +00:00
  • ed66c92383 nit in control_robot.py Michel Aractingi 2024-12-11 00:30:33 +01:00
  • 668d493bf9 Update lerobot/scripts/train_hilserl_classifier.py Michel Aractingi 2024-12-11 00:22:10 +01:00
  • 67f4d7ea7a LerobotDataset pushable to HF from any folder (#563) Claudio Coppola 2024-12-09 10:32:25 +00:00
  • 4b0c88ff8e Update 7_get_started_with_real_robot.md (#559) berjaoui 2024-12-09 00:17:49 +01:00
  • b19fef9d18 Control simulated robot with real leader (#514) Michel Aractingi 2024-12-03 12:20:05 +01:00
  • 1612e00e63 Fix missing local_files_only in record/replay (#540) Remi 2024-12-03 10:53:21 +01:00
  • c3bc136420 Refactor OpenX (#505) Michel Aractingi 2024-12-03 00:51:55 +01:00
  • 1020bc3108 Fixup Eugene Mironov 2024-12-17 02:42:53 +07:00
  • 7fcf638c0d Add human intervention mechanism and eval_robot script to evaluate policy on the robot (#541) Michel Aractingi 2024-12-09 19:17:47 +01:00
  • e35546f58e Reward classifier and training (#528) Yoel 2024-12-09 10:21:50 +01:00
  • 1aa8d4ac91 nit Michel Aractingi 2024-11-27 15:19:20 +01:00
  • 972bac98b4 completed losses Michel Aractingi 2024-12-12 11:45:30 +01:00
  • ebe0bfad77 WIP Remi Cadene 2024-12-11 09:09:14 -08:00
  • 66f8736598 fixing typo from 'teloperation' to 'teleoperation' (#566) s1lent4gnt 2024-12-11 14:57:52 +01:00
  • 4c41f6fcc6 Fix example 6 (#572) Simon Alibert 2024-12-11 10:32:18 +01:00
  • 458c427e0c adding sac implementation Michel Aractingi 2024-12-11 00:34:43 +01:00
  • 3d7e74d162 nit in control_robot.py Michel Aractingi 2024-12-11 00:30:33 +01:00
  • e9ef46f134 Update lerobot/scripts/train_hilserl_classifier.py Michel Aractingi 2024-12-11 00:22:10 +01:00
  • d26a9b1363 fixed merge conflicts Michel Aractingi 2024-12-10 16:59:43 +01:00
  • ceda66e2bc incomplete merge Michel Aractingi 2024-12-10 16:55:55 +01:00
  • 839dac9cd4 Merge branch 'main' into user/michel-aractingi/2024-11-27-port-hil-serl Michel Aractingi 2024-12-10 16:02:49 +01:00
  • abcf17a070 Add human intervention mechanism and eval_robot script to evaluate policy on the robot (#541) Michel Aractingi 2024-12-09 19:17:47 +01:00
  • 44f9b21e74 LerobotDataset pushable to HF from any folder (#563) Claudio Coppola 2024-12-09 10:32:25 +00:00
  • 6490927b3d Reward classifier and training (#528) Yoel 2024-12-09 10:21:50 +01:00
  • 03f49ceaf0 Update 7_get_started_with_real_robot.md (#559) berjaoui 2024-12-09 00:17:49 +01:00
  • 16b905ee67 adding sac implemenation user/michel-aractingi/2024-12-06-sac-implementation Michel Aractingi 2024-12-06 01:32:22 +01:00
  • c6e9a3dc24 nit Remi Cadene 2024-12-03 17:24:55 +01:00
  • afbd42d082 Add manage_dataset Remi Cadene 2024-12-03 17:16:47 +01:00
  • 8e7d6970ea Control simulated robot with real leader (#514) Michel Aractingi 2024-12-03 12:20:05 +01:00
  • 286bca37cc Fix missing local_files_only in record/replay (#540) Remi 2024-12-03 10:53:21 +01:00
  • a2c181992a Refactor OpenX (#505) Michel Aractingi 2024-12-03 00:51:55 +01:00
  • 32eb0cec8f Dataset v2.0 (#461) Simon Alibert 2024-11-29 19:04:00 +01:00
  • aa03a27f0a fix user/rcadene/2024_11_27_reachy2_viz_inference Remi Cadene 2024-11-27 16:45:07 +01:00
  • a97c1cb1af [viz tool] add policy pred column Mishig Davaadorj 2024-11-27 15:32:48 +01:00