Login / Signup
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards.
Sanjay Krishnan
Animesh Garg
Richard Liaw
Lauren Miller
Florian T. Pokorny
Ken Goldberg
Published in:
CoRR (2016)
Keyphrases
</>
inverse reinforcement learning
reward function
bayesian nonparametric
preference elicitation
reinforcement learning
markov decision processes
artificial intelligence
np hard
simple examples