Login / Signup
Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences.
Robert Pinsler
Riad Akrour
Takayuki Osa
Jan Peters
Gerhard Neumann
Published in:
ICRA (2018)
Keyphrases
</>
hierarchical reinforcement learning
data mining
neural network
reinforcement learning
least squares
user preferences
maximum entropy