Added additional wrappers for the environment: Action repeat, keyboard interface, reset wrapper

Tested the reset mechanism and keyboard interface and the convert wrapper on the robots.

Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
This commit is contained in:
Michel Aractingi
2025-02-04 17:41:14 +00:00
parent ca45c34ad5
commit d143043037
3 changed files with 262 additions and 52 deletions

View File

@@ -273,6 +273,9 @@ def act_with_policy(cfg: DictConfig):
# TODO (michel-aractingi): Label the reward
# if config.label_reward_on_actor:
# reward = reward_classifier(obs)
if info["is_intervention"]:
# TODO: Check the shape
action = info["action_intervention"]
list_transition_to_send_to_learner.append(
Transition(
@@ -281,7 +284,7 @@ def act_with_policy(cfg: DictConfig):
reward=reward,
next_state=next_obs,
done=done,
complementary_info=None,
complementary_info=info, # TODO Handle information for the transition, is_demonstraction: bool
)
)