Login / Signup
Inverse Reinforcement Learning Under Noisy Observations.
Shervin Shahryari
Prashant Doshi
Published in:
AAMAS (2017)
Keyphrases
</>
noisy observations
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
temporal difference
reward function
utility function
bayesian networks