FREEDOM, added back the optimization loop code in learner_server.py

Ran experiment with pushcube env from maniskill. The learning seem to work.

Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
This commit is contained in:
Michel Aractingi
2025-01-28 17:25:49 +00:00
parent 508f5d1407
commit 2ae657f568
3 changed files with 85 additions and 80 deletions

View File

@@ -82,7 +82,7 @@ policy:
temperature_lr: 3e-4
# critic_target_update_weight: 0.005
critic_target_update_weight: 0.01
utd_ratio: 1
utd_ratio: 2
# # Loss coefficients.