Login / Signup
Infinite time horizon maximum causal entropy inverse reinforcement learning.
Michael Bloem
Nicholas Bambos
Published in:
CDC (2014)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
bayesian nonparametric
preference elicitation
reward function
bayesian networks
artificial intelligence
multi agent
semi supervised
mixture model
decision problems