FREEDOM, added back the optimization loop code in learner_server.py

Ran experiment with pushcube env from maniskill. The learning seem to work.

Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
This commit is contained in:
Michel Aractingi
2025-01-28 17:25:49 +00:00
committed by AdilZouitine
parent ef64ba91d9
commit a0a81c0c12
3 changed files with 85 additions and 80 deletions

View File

@@ -82,7 +82,7 @@ policy:
temperature_lr: 3e-4
# critic_target_update_weight: 0.005
critic_target_update_weight: 0.01
utd_ratio: 1
utd_ratio: 2
# # Loss coefficients.