Login / Signup
Online Observer-Based Inverse Reinforcement Learning.
Ryan Self
Kevin Coleman
He Bai
Rushikesh Kamalapurkar
Published in:
CoRR (2020)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
bayesian nonparametric
preference elicitation
artificial intelligence
bayesian networks
cost function
reward function