Login / Signup
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning.
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
Published in:
CoRR (2024)
Keyphrases
</>
hierarchical reinforcement learning
reinforcement learning
reward function
average reward
state abstraction
optimal policy
long run
machine learning
probability distribution
markov decision processes
model free