Login / Signup
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations.
Daniel S. Brown
Wonjoon Goo
Prabhat Nagarajan
Scott Niekum
Published in:
CoRR (2019)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
multiple agents
artificial intelligence
topic models