C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations.
Daniel S. Brown
Wonjoon Goo
Prabhat Nagarajan
Scott Niekum
Published in:
ICML (2019)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
graphical models
multi agent
search space
decision makers
gaussian process