lerobot

tangger/lerobot

Fork 1

Commit Graph

Select branches

Hide Pull Requests

2025_04_11_vla_eval

Cadene-patch-1

alexander-soare/add_drop_last_keyframes

chore/bump_pythonversion_precommit

depth

feat/add_rerun

feat/autopolicy

fix/lint_warnings

fix_aloha_conversion_nan

fix_path

hf-papers

lerobot-xh

main

mindbotv1

my-fix-based-on-pr-1175

origin/fix/record_policy_action_dict

origin/user/rcadene/2024_04_30_refactor_push_dataset_to_hub_video

pre-commit-ci-update-config

qgallouedec-patch-1

qgallouedec/macos_ci

qgallouedec/move_update_optim_schedul_outside_policy

qgallouedec/use_pretrainedconfig

realman

realman-dual

realman-dual-arm

realman-dual-arm-bak

realman-dual-xh

realman-dual-yehao

realman-single-arm

recovered-commit

refactor/updated_api_docs_rebased

revert-1160-mshukor-patch-1

robot_client

smolvla_doc

tdmpc23

temp_branch_hil_serl

test/add_cameras_di_tests

test/add_cameras_di_tests_no_tmp_connect

test/add_cameras_patch_tests

test/add_cameras_patch_tests_no_tmp_connect

test/robot_refactor_experiments

thom-act

thom-fixes

thom-proposals

thom_arm

thomwolf_2024_06_04_nan_fixes

thomwolf_2024_06_04_nans

thomwolf_2024_06_04_simple_gym

thomwolf_2024_06_12_deep_dive_ACT

thomwolf_2024_06_18_fix_normalization

torchcodec-cpu

user/adil-zouitine/2025-1-7-port-hil-serl

user/adil-zouitine/2025-1-7-port-hil-serl-new

user/alexander/multistep_policy_rollout

user/aliberts/2024_03_15_notebook_example

user/aliberts/2024_03_16_update_torchrl

user/aliberts/2024_03_22_fix_simxarm

user/aliberts/2024_03_26_add_ci_tests

user/aliberts/2024_03_28_package_envs

user/aliberts/2024_03_31_fix_simxarm_render

user/aliberts/2024_05_06_add_coverage

user/aliberts/2024_05_07_dev_docker

user/aliberts/2024_05_12_add_metrics_logging_config

user/aliberts/2024_05_14_compare_policies

user/aliberts/2024_05_20_add_gym_dora

user/aliberts/2024_07_26_remove_dataset_artifacts

user/aliberts/2024_09_10_fix_docker_dev

user/aliberts/2025_02_25_refactor_robots

user/aliberts/2025_02_27_fix_pr_style_bot

user/aliberts/2025_03_08_register_configs

user/aliberts/2025_03_10_new_feetech_calibration

user/aliberts/2025_04_03_add_hope_jr

user/aliberts/2025_04_08_fix_install_instruction

user/aliberts/2025_04_19_refactor_cameras

user/aliberts/2025_06_02_cap_pymunk

user/azouitine/2025-04-24-hot-fix-ci

user/azouitine/2025-4-8-softmax-grasp-critic

user/fracapuano/2024-04-24-mocking-robots

user/fracapuano/2025-04-23-fixing-mock-robot

user/fracapuano/2025-04-23-predicting-chunks

user/fracapuano/2025_05_21_dataset_streaming

user/fracapuano/2025_05_29_dataset_streaming_delta_timesteps

user/fracapuano/async-inf

user/jmoss/2024_08_19_add_hiwonder_motors

user/marinabar/2024_06_10_evaluation_stats

user/michel-aractingi/2024-02-18-hilserl-cache-embedding

user/michel-aractingi/2024-02-21-hilserl-threading-to-mp

user/michel-aractingi/2024-08-26-tdmpc2-implementation

user/michel-aractingi/2024-09-02/move-make-optimizer-to-policy

user/michel-aractingi/2024-10-15-control-sim

user/michel-aractingi/2024-11-20-new-tdmpc2-impl

user/michel-aractingi/2024-11-26-Maniskill-support

user/michel-aractingi/2024-11-27-port-hil-serl

user/michel-aractingi/2024-12-06-sac-implementation

user/michel-aractingi/2024-12-07-soft-actor-critic

user/michel-aractingi/2025-01-18-port-rlds-example

user/michel-aractingi/2025-01-21-server-client-arch

user/michel-aractingi/2025-05-20-hil-rebase-robots

user/michel-aractingi/2025-06-02-docs-for-hil-serl

user/michel-aractingi/tmp-hil-serl-rebase

user/michel-aractingi/tmp-port-hil-serl-new

user/michel_adil/dump_hil_serl

user/michel_aractingi/2024-04-04-grasp-critic

user/mrussi/2025_01_09_hope_junior

user/mrussi/2025_04_12_hopejr

user/mshukor/2025_02_25_accelerate

user/pepijn/2025_01_24_save_autocorrect_calibration_to_json

user/pepijn/2025_01_27_steps_assembly_intructions

user/pepijn/2025_03_11_focal_loss

user/pepijn/2025_03_11_weighted_sampling

user/pepijn/2025_03_18_fix_rotation_so100_docs

user/pepijn/2025_03_20_dana_workshop

user/pepijn/2025_05_05_lekiwi_dual_teleop

user/rcadene/2024_03_04_diffusion_pretrained_from_repo

user/rcadene/2024_03_04_sbatch_hopper

user/rcadene/2024_03_08_act_policy

user/rcadene/2024_03_11_aloha_load_pretrained_from_act_repo

user/rcadene/2024_03_14_video_dataset

user/rcadene/2024_03_22_pretrained

user/rcadene/2024_03_25_readme

user/rcadene/2024_03_31_pretrained_models_aloha_simxarm

user/rcadene/2024_04_20_mobile_aloha_rerun

user/rcadene/2024_04_30_refactor_push_dataset_to_hub_video

user/rcadene/2024_05_08_test

user/rcadene/2024_05_20_dora_parquet_format_viz

user/rcadene/2024_06_01_custom_visualize_dataset

user/rcadene/2024_06_03_fix_delta_timestamps

user/rcadene/2024_06_03_reachy2

user/rcadene/2024_06_07_faster_act

user/rcadene/2024_06_08_add_data_augmentation

user/rcadene/2024_06_22_control_robot_other

user/rcadene/2024_07_09_bill_of_materials

user/rcadene/2024_07_16_control_robot_v2_aloha

user/rcadene/2024_07_16_vqbet_multi_images

user/rcadene/2024_07_26_control_robot_v2_train

user/rcadene/2024_07_27_nightly

user/rcadene/2024_07_29_control_robot_linux

user/rcadene/2024_08_06_improve_visualize_dataset_inference

user/rcadene/2024_08_22_control_aloha

user/rcadene/2024_08_24_edit_dataset_remove_episodes

user/rcadene/2024_09_10_train_aloha_debug

user/rcadene/2024_09_18_async_inference

user/rcadene/2024_10_24_feetech_dataset_v2

user/rcadene/2024_11_26_hope_junior

user/rcadene/2024_11_27_reachy2_viz_inference

user/rcadene/2024_12_03_manage_dataset

user/rcadene/2025_01_28_aloha_hdf5

user/rcadene/2025_02_07_improve_pi0

user/rcadene/2025_02_17_streaming

user/rcadene/2025_02_17_visualize_inference

user/rcadene/2025_02_19_port_openx

user/rcadene/2025_04_11_dataset_v3

user/youliang/2024_06_20_oxe_data_format_to_hg

#10

#1001

#1002

#1006

#1006

#1008

#1008

#1009

#101

#1011

#1014

#1014

#1016

#1017

#1018

#1020

#1020

#1021

#1022

#1023

#1024

#1024

#1025

#1027

#1028

#1029

#103

#1030

#1031

#1031

#1034

#1035

#1035

#1036

#1036

#1037

#1037

#1040

#1040

#1046

#1047

#1047

#1048

#1051

#1052

#1052

#1053

#1053

#1054

#1055

#1055

#1056

#1056

#1057

#1057

#106

#1060

#1060

#1062

#1062

#1063

#1064

#1067

#107

#1070

#1073

#1074

#1075

#1076

#1077

#108

#1081

#1087

#1089

#109

#1092

#1092

#11

#110

#1103

#1104

#1108

#111

#1112

#1113

#1113

#1115

#1117

#1120

#1120

#1122

#1123

#1125

#1127

#1128

#1128

#113

#1131

#1132

#1132

#1133

#1134

#1136

#1137

#1138

#1138

#1139

#114

#1140

#1141

#1143

#1146

#1148

#1149

#1149

#115

#1150

#1150

#1151

#1151

#1152

#1154

#1155

#1156

#1157

#1158

#1158

#1159

#116

#1160

#1162

#1163

#1164

#1165

#1165

#1167

#1168

#117

#1172

#1172

#1173

#1173

#1175

#1178

#118

#1181

#1182

#1183

#1184

#1185

#1185

#1187

#1187

#1188

#119

#1190

#1191

#1192

#1193

#1194

#1196

#1196

#1197

#1197

#1198

#1198

#12

#120

#1200

#1200

#1201

#1201

#121

#122

#123

#124

#125

#126

#127

#128

#129

#13

#130

#131

#132

#133

#134

#135

#136

#137

#138

#138

#139

#14

#140

#144

#145

#146

#147

#148

#149

#15

#150

#151

#153

#154

#154

#155

#157

#158

#16

#160

#161

#162

#163

#163

#164

#165

#166

#169

#17

#171

#172

#174

#175

#176

#177

#179

#18

#181

#181

#182

#184

#184

#185

#186

#188

#189

#19

#190

#191

#192

#193

#195

#196

#197

#198

#199

#2

#20

#200

#201

#202

#203

#204

#205

#206

#207

#208

#209

#209

#21

#210

#213

#215

#216

#217

#218

#219

#22

#220

#221

#222

#223

#224

#225

#225

#228

#229

#23

#230

#231

#232

#233

#234

#235

#236

#237

#239

#240

#241

#242

#242

#243

#244

#245

#245

#246

#247

#249

#25

#252

#253

#254

#257

#26

#260

#262

#262

#264

#265

#267

#268

#269

#27

#270

#271

#272

#272

#273

#275

#276

#277

#278

#279

#28

#280

#281

#281

#282

#283

#284

#286

#288

#29

#290

#290

#292

#295

#298

#299

#3

#30

#300

#301

#302

#303

#306

#307

#308

#309

#31

#310

#312

#314

#316

#317

#317

#318

#319

#32

#322

#323

#325

#326

#327

#328

#329

#329

#33

#330

#331

#332

#333

#337

#338

#339

#340

#343

#344

#344

#345

#346

#347

#349

#349

#35

#350

#350

#351

#353

#353

#354

#355

#356

#356

#357

#358

#359

#36

#361

#362

#363

#364

#364

#365

#37

#370

#371

#372

#372

#373

#375

#376

#378

#378

#379

#38

#380

#380

#381

#381

#382

#384

#385

#386

#387

#388

#389

#39

#391

#391

#392

#392

#395

#396

#397

#398

#399

#4

#40

#400

#401

#401

#402

#403

#404

#409

#41

#410

#411

#411

#412

#414

#415

#416

#417

#418

#419

#42

#420

#422

#423

#424

#426

#428

#428

#429

#430

#431

#433

#434

#437

#437

#44

#442

#443

#445

#445

#447

#447

#448

#45

#450

#453

#455

#455

#457

#457

#459

#459

#46

#460

#461

#463

#464

#465

#465

#466

#467

#468

#47

#471

#473

#476

#476

#479

#48

#480

#481

#481

#482

#484

#485

#486

#487

#487

#488

#489

#49

#490

#491

#493

#493

#494

#495

#498

#499

#5

#50

#503

#505

#506

#506

#507

#507

#508

#512

#513

#514

#516

#516

#518

#519

#519

#521

#522

#523

#523

#524

#528

#529

#529

#53

#531

#531

#535

#535

#539

#54

#540

#541

#543

#543

#546

#548

#548

#550

#551

#551

#554

#556

#557

#558

#559

#56

#562

#563

#565

#565

#566

#567

#569

#569

#57

#570

#572

#573

#576

#577

#577

#578

#579

#58

#580

#581

#583

#584

#586

#586

#587

#588

#589

#590

#591

#591

#592

#592

#593

#594

#595

#598

#599

#6

#60

#600

#601

#602

#603

#604

#609

#610

#612

#614

#615

#616

#617

#618

#619

#620

#621

#622

#626

#627

#628

#628

#629

#63

#631

#634

#637

#637

#639

#64

#640

#641

#642

#643

#644

#644

#645

#645

#646

#647

#648

#649

#65

#651

#653

#654

#655

#656

#657

#657

#658

#659

#659

#66

#660

#662

#663

#664

#665

#666

#667

#668

#669

#67

#670

#675

#676

#677

#68

#680

#680

#681

#682

#682

#684

#684

#688

#69

#690

#691

#691

#693

#695

#696

#697

#7

#70

#701

#702

#703

#703

#704

#704

#708

#709

#71

#710

#711

#712

#714

#715

#716

#719

#72

#720

#722

#724

#725

#726

#728

#729

#729

#73

#730

#731

#732

#733

#734

#737

#739

#739

#74

#740

#740

#743

#744

#744

#747

#747

#75

#751

#752

#753

#754

#756

#756

#757

#758

#758

#759

#76

#760

#760

#762

#763

#765

#766

#767

#767

#768

#768

#77

#770

#772

#774

#775

#776

#777

#777

#778

#778

#780

#781

#782

#783

#784

#785

#786

#787

#79

#790

#791

#791

#792

#793

#794

#794

#795

#796

#798

#799

#8

#80

#800

#801

#803

#804

#807

#809

#81

#810

#811

#812

#814

#815

#818

#82

#820

#820

#824

#824

#827

#829

#83

#830

#831

#831

#832

#833

#834

#834

#835

#835

#838

#839

#84

#840

#841

#842

#843

#844

#845

#848

#85

#855

#856

#857

#859

#86

#861

#863

#865

#866

#866

#868

#869

#87

#870

#871

#872

#872

#873

#873

#874

#874

#875

#877

#878

#878

#88

#880

#880

#882

#882

#883

#885

#885

#886

#887

#889

#89

#891

#893

#896

#897

#899

#899

#9

#90

#900

#902

#902

#903

#907

#908

#91

#911

#913

#913

#914

#915

#917

#918

#92

#921

#922

#924

#924

#925

#929

#931

#931

#934

#935

#935

#936

#937

#939

#94

#943

#943

#944

#945

#945

#946

#947

#948

#95

#950

#953

#954

#957

#958

#96

#962

#965

#965

#966

#967

#967

#969

#969

#97

#972

#972

#974

#974

#976

#977

#978

#98

#980

#983

#984

#984

#986

#986

#987

#988

#989

#99

#990

#991

#992

#993

#994

#995

#995

#998

#999

aebea08a99 Added support for checkpointing the policy. We can save and load the policy state dict, optimizers state, optimization step and interaction step Added functions for converting the replay buffer from and to LeRobotDataset. When we want to save the replay buffer, we convert it first to LeRobotDataset format and save it locally and vice-versa. Michel Aractingi 2025-01-30 17:39:41 +00:00
03616db82c Removed unnecessary time.sleep in the streaming server on the learner side Michel Aractingi 2025-01-29 16:31:38 +00:00
93c4fc198f Added missing config files env/maniskill_example.yaml and policy/sac_maniskill.yaml that are necessary to run the lerobot implementation of sac with the maniskill baselines. Michel Aractingi 2025-01-29 16:07:32 +00:00
8cd44ae163 - Added additional logging information in wandb around the timings of the policy loop and optimization loop. - Optimized critic design that improves the performance of the learner loop by a factor of 2 - Cleaned the code and fixed style issues Michel Aractingi 2025-01-29 15:50:46 +00:00
2ae657f568 FREEDOM, added back the optimization loop code in learner_server.py Ran experiment with pushcube env from maniskill. The learning seem to work. Michel Aractingi 2025-01-28 17:25:49 +00:00
508f5d1407 Added server directory in lerobot/scripts that contains scripts and the protobuf message types to split training into two processes, acting and learning. The actor rollouts the policy and collects interaction data while the learner recieves the data, trains the policy and sends the updated parameters to the actor. The two scripts are ran simultaneously Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com> Michel Aractingi 2025-01-28 15:52:03 +00:00
c8b1132846 Stable version of rlpd + drq AdilZouitine 2025-01-22 09:00:16 +00:00
ef777993cd Add type annotations and restructure SACConfig class fields AdilZouitine 2025-01-21 09:51:12 +00:00
760d60ad4b Change SAC policy implementation with configuration and modeling classes Adil Zouitine 2025-01-17 09:39:04 +01:00
875c0271b7 SAC works Adil Zouitine 2025-01-14 11:34:52 +01:00
57344bfde5 [WIP] correct sac implementation Adil Zouitine 2025-01-13 17:54:11 +01:00
46827fb002 Add rlpd tricks Adil Zouitine 2025-01-15 15:49:24 +01:00
2fd78879f6 SAC works Adil Zouitine 2025-01-14 11:34:52 +01:00
e8449e9630 remove breakpoint Adil Zouitine 2025-01-13 17:58:00 +01:00
a0e2be8b92 [WIP] correct sac implementation Adil Zouitine 2025-01-13 17:54:11 +01:00
181727c0fe Extend reward classifier for multiple camera views (#626) Michel Aractingi 2025-01-13 13:57:49 +01:00
d1d6ffd23c [Port HIL_SERL] Final fixes for the Reward Classifier (#598) Eugene Mironov 2025-01-06 17:34:00 +07:00
e5801f467f added temporary fix for missing task_index key in online environment Michel Aractingi 2024-12-30 13:47:28 +00:00
c6ca9523de split encoder for critic and actor Michel Aractingi 2024-12-29 23:59:39 +00:00
642e3a3274 style fixes Michel Aractingi 2024-12-29 14:35:21 +00:00
146148c48c Refactor SAC configuration and policy for improved action sampling and stability KeWang1017 2024-12-29 12:30:39 +00:00
8f15835daa Refine SAC configuration and policy for enhanced performance KeWang1017 2024-12-28 22:11:34 +00:00
022bd65125 Refactor SACPolicy for improved action sampling and standard deviation handling KeWang1017 2024-12-28 18:07:15 +00:00
63d8c96514 trying to get sac running KeWang1017 2024-12-26 23:38:46 +00:00
4624a836e5 Added normalization schemes and style checks Michel Aractingi 2024-12-29 12:51:21 +00:00
ad7eea132d added optimizer and sac to factory.py Michel Aractingi 2024-12-23 14:12:03 +01:00
22a1899ff4 [HIL-SERL PORT] Fix linter issues (#588) Eugene Mironov 2024-12-23 16:44:29 +07:00
17a3a31b5f [Port Hil-SERL] Add unit tests for the reward classifier & fix imports & check script (#578) Eugene Mironov 2024-12-23 16:43:55 +07:00
1a8b99e360 added comments from kewang Michel Aractingi 2024-12-17 18:03:46 +01:00
6db2154f28 Enhance SAC configuration and policy with new parameters and subsampling logic KeWang1017 2024-12-17 15:58:04 +00:00
be3adda95f Port SAC WIP (#581) KeWang 2024-12-17 13:26:17 +00:00
9d48d236c1 completed losses Michel Aractingi 2024-12-12 11:45:30 +01:00
b57d6a7776 nit in control_robot.py Michel Aractingi 2024-12-11 00:30:33 +01:00
d1f76cba8e Update lerobot/scripts/train_hilserl_classifier.py Michel Aractingi 2024-12-11 00:22:10 +01:00
d78cef1fee Fixup Eugene Mironov 2024-12-17 02:42:53 +07:00
30a808c0ae Add human intervention mechanism and eval_robot script to evaluate policy on the robot (#541) Michel Aractingi 2024-12-09 19:17:47 +01:00
4a7f85a6ec Reward classifier and training (#528) Yoel 2024-12-09 10:21:50 +01:00
a22fe8a6de Refactor SACObservationEncoder to improve modularity and readability. Split initialization into dedicated methods for image and state layers, and enhance caching logic for image features. Update forward method to streamline feature encoding and ensure proper normalization handling. user/michel-aractingi/tmp-port-hil-serl-new AdilZouitine 2025-04-18 12:22:14 +00:00
b6b9635be6 Remove names Simon Alibert 2025-04-18 09:48:16 +02:00
21b1026872 Remove deprecated dynamixel_calibration Simon Alibert 2025-04-18 09:34:46 +02:00
8c3eab32b0 Remove deprecated configure_motor Simon Alibert 2025-04-18 09:19:43 +02:00
29633865c7 Fix _find_single_motor Simon Alibert 2025-04-18 09:18:56 +02:00
0fc9a4341f fix: separate threads for obs streaming, action receiving & execution + action queue reconciliation Francesco Capuano 2025-04-17 21:09:58 +02:00
d40e74f371 fix: streams inference process using LIFO on obs Francesco Capuano 2025-04-17 21:09:04 +02:00
40237f5ea3 fix: ruff, get your hands off compiled files Francesco Capuano 2025-04-17 20:33:54 +02:00
2bcdb57854 fix: bus ids Francesco Capuano 2025-04-17 20:02:59 +02:00
e9ca1b612d fix: send obs, receives and queues actions chunk, overwrites queue periodically Francesco Capuano 2025-04-15 12:00:33 +02:00
169babd621 fix: server predicts multiple actions for a given observation, VLA-like Francesco Capuano 2025-04-15 11:59:59 +02:00
a9031ee1be add: server computes action, robot's daemon constantly reads it Francesco Capuano 2025-04-14 19:25:44 +02:00
fc107a2c6e add: robot can send observations Francesco Capuano 2025-04-14 17:29:21 +02:00
84fabbf4af add: grpc service between robot and remote policy server Francesco Capuano 2025-04-14 15:40:15 +02:00
49b5f379a7 Refactor SACPolicy initialization by breaking down the constructor into smaller methods for normalization, encoders, critics, actor, and temperature setup. This enhances readability and maintainability. AdilZouitine 2025-04-17 16:37:43 +00:00
7a3d8756b4 Refactor input and output normalization handling in SACPolicy for improved clarity and efficiency. Consolidate encoder initialization logic and remove redundant else statements. AdilZouitine 2025-04-17 16:05:11 +00:00
702749b7d3 Fix setup_motor & add it to robots Simon Alibert 2025-04-17 16:56:23 +02:00
b43ece8934 Add pythno3-dev in Dockerfile to build and modify Readme.md , python-dev to python3-dev (#987) k1000dai 2025-04-17 16:17:07 +02:00
c10c5a0e64 Fix --width --height type parsing on opencv and intelrealsense scripts (#556) Alex Thiele 2025-04-17 06:19:23 -07:00
a8db91c40e Fix Windows HTML visualization to make videos could be seen (#647) Junshan Huang 2025-04-17 21:07:28 +08:00
0f5f7ac780 Fix broken links in examples/4_train_policy_with_script.md (#697) HUANG TZU-CHUN 2025-04-17 20:59:43 +08:00
bf1c737858 Fix calibration msg display Simon Alibert 2025-04-17 13:18:32 +02:00
d07c7347f8 Add setup_motor Simon Alibert 2025-04-17 13:14:06 +02:00
54b5c805bf Revert mistake convert_dataset_v20_to_v21.py Remi Cadene 2025-04-17 04:47:00 +02:00
eab5543750 Merge (No verify) Remi Cadene 2025-04-17 04:46:09 +02:00
e42485c837 refactor(cameras): remove tmp video capture in connect test/add_cameras_di_tests_no_tmp_connect Steven Palma 2025-04-17 00:51:24 +02:00
cdcb27f908 test(cameras): add opencv camera dependency injection tests suite test/add_cameras_di_tests Steven Palma 2025-04-16 22:13:22 +02:00
79498ab967 refactor(cameras): remove tmp video capture in connect test/add_cameras_patch_tests_no_tmp_connect Steven Palma 2025-04-17 00:33:31 +02:00
cb10f97ccc test(cameras): add opencv camera patch tests suite test/add_cameras_patch_tests Steven Palma 2025-04-15 17:47:51 +02:00
6b6a990f4c most unit tests passing (TODO: convert datasets) Remi Cadene 2025-04-16 21:30:58 +02:00
dc1548fe1a Fix init temp Co-authored-by: s1lent4gnt <kmeftah.khalil@gmail.com> AdilZouitine 2025-04-16 14:43:47 +00:00
23c9441d5f Update log_std_min type to float in PolicyConfig for consistency AdilZouitine 2025-04-15 14:02:24 +00:00
870e3efb92 fix caching AdilZouitine 2025-04-15 13:16:22 +00:00
bfd48a8b70 Handle caching AdilZouitine 2025-04-15 13:02:31 +00:00
5dc7ff6d3c change the tanh distribution to match hil serl Co-authored-by: s1lent4gnt <kmeftah.khalil@gmail.com> AdilZouitine 2025-04-15 08:31:14 +00:00
ee4ebeac9b match target entropy hil serl AdilZouitine 2025-04-15 08:00:38 +00:00
fe7b47f459 stick to hil serl nn architecture AdilZouitine 2025-04-15 07:44:32 +00:00
044ca3b039 Refactor modeling_sac and parameter handling for clarity and reusability. AdilZouitine 2025-04-14 14:00:57 +00:00
bc36c69b71 fix encoder training AdilZouitine 2025-04-11 11:50:46 +00:00
2b9b05f1ba [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2025-04-09 15:05:17 +00:00
9eec7b8bb0 General fixes in code, removed delta action, fixed grasp penalty, added logic to put gripper reward in info Michel Aractingi 2025-04-09 17:04:43 +02:00
a80a9cf379 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2025-04-09 13:51:31 +00:00
7a42af835e fix caching and dataset stats is optional AdilZouitine 2025-04-09 13:20:51 +00:00
9751328783 Add rounding for safety AdilZouitine 2025-04-08 08:50:02 +00:00
7225bc74a3 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2025-04-07 15:48:39 +00:00
03b1644bf7 fix sign issue AdilZouitine 2025-04-07 15:44:06 +00:00
9b6e5a383f Refactor complementary_info handling in ReplayBuffer AdilZouitine 2025-04-07 14:48:42 +00:00
86466b025f Handle gripper penalty AdilZouitine 2025-04-07 08:23:49 +00:00
54745f111d fix caching AdilZouitine 2025-04-04 14:29:38 +00:00
82584cca78 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2025-04-04 07:59:22 +00:00
d3a8c2c247 fix indentation issue AdilZouitine 2025-04-03 16:05:29 +00:00
74c11c4a75 Enhance SAC configuration and replay buffer with asynchronous prefetching support AdilZouitine 2025-04-03 14:23:50 +00:00
2d932b710c Enhance SACPolicy to support shared encoder and optimize action selection AdilZouitine 2025-04-03 07:44:46 +00:00
a54baceabb Enhance SACPolicy and learner server for improved grasp critic integration AdilZouitine 2025-04-02 15:50:39 +00:00
077d18b439 Refactor SACPolicy for improved readability and action dimension handling AdilZouitine 2025-04-01 15:43:29 +00:00
c6cd1475a7 Add mock gripper support and enhance SAC policy action handling AdilZouitine 2025-04-01 14:22:08 +00:00
e35ee47b07 Refactor SAC policy and training loop to enhance discrete action support AdilZouitine 2025-04-01 11:42:28 +00:00
c3f2487026 Refactor SAC configuration and policy to support discrete actions AdilZouitine 2025-04-01 09:30:32 +00:00
c621077b62 Added Gripper quantization wrapper and grasp penalty removed complementary info from buffer and learner server removed get_gripper_action function added gripper parameters to common/envs/configs.py Michel Aractingi 2025-04-01 11:08:15 +02:00
f5cfd9fd48 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2025-03-31 16:10:00 +00:00
22da1739b1 Add grasp critic to the training loop s1lent4gnt 2025-03-31 18:06:21 +02:00
d38d5f988d Add get_gripper_action method to GamepadController s1lent4gnt 2025-03-31 17:40:00 +02:00
8d1936ffe0 Add gripper penalty wrapper s1lent4gnt 2025-03-31 17:38:16 +02:00