lerobot

Files

Ke-Wang1017 f99e670976 Refactor SACPolicy and configuration for improved training dynamics

- Introduced target critic networks in SACPolicy to enhance stability during training.
- Updated TD target calculation to incorporate entropy adjustments, improving robustness.
- Increased online buffer capacity in configuration from 10,000 to 40,000 for better data handling.
- Adjusted learning rates for critic, actor, and temperature to 3e-4 for optimized training performance.

These changes aim to refine the SAC implementation, enhancing its robustness and performance during training and inference.

2025-01-06 10:14:34 +00:00

datasets

Reward classifier and training (#528 )

2024-12-17 02:41:29 +07:00

envs

small fix: assertion error message in envs/utils.py (#426 )

2024-09-12 18:03:34 +02:00

policies

Refactor SACPolicy and configuration for improved training dynamics

2025-01-06 10:14:34 +00:00

robot_devices

Fixup

2024-12-17 02:42:53 +07:00

utils

Make say(blocking=True) work for Linux (#460 )

2024-10-17 15:22:21 +01:00

logger.py

Added normalization schemes and style checks

2024-12-29 12:51:21 +00:00