Infinite Time Horizon Maximum Causal Entropy Inverse Reinforcement Learning.
Zhengyuan ZhouMichael BloemNicholas BambosPublished in: IEEE Trans. Autom. Control. (2018)
Keyphrases
- inverse reinforcement learning
- bayesian nonparametric
- partially observable environments
- preference elicitation
- reward function
- bayesian networks
- special case
- temporal difference
- sufficient conditions
- machine learning
- convergence rate
- decision theory
- gaussian process
- dynamic programming
- multi objective
- control system
- artificial intelligence