Login / Signup
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs.
Qingyang Zhang
Yiming Yang
Jingqing Ruan
Xuantang Xiong
Dengpeng Xing
Bo Xu
Published in:
IJCNN (2023)
Keyphrases
</>
hierarchical reinforcement learning
balancing exploration and exploitation
reinforcement learning
model free
state abstraction
latent variables
learning algorithm
dynamic programming
learning to rank
weighted graph
average reward