Login / Signup
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations.
Daniel S. Brown
Wonjoon Goo
Prabhat Nagarajan
Scott Niekum
Published in:
ICML (2019)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
graphical models
multi agent
search space
decision makers
gaussian process