Infinite Time Horizon Maximum Causal Entropy Inverse Reinforcement Learning.

Zhengyuan Zhou Michael Bloem Nicholas Bambos

Published in: IEEE Trans. Autom. Control. (2018)

Keyphrases

inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
bayesian networks
special case
temporal difference
sufficient conditions
machine learning
convergence rate
decision theory
gaussian process
dynamic programming
multi objective
control system
artificial intelligence