Scalable Bayesian Inverse Reinforcement Learning.
Alex James ChanMihaela van der SchaarPublished in: ICLR (2021)
Keyphrases
- inverse reinforcement learning
- bayesian nonparametric
- partially observable environments
- preference elicitation
- reward function
- bayesian networks
- maximum likelihood
- mixture model
- decision theory
- temporal difference
- gaussian process
- variational inference
- artificial intelligence
- dynamic systems
- dynamic programming
- objective function
- decision making