Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning?
Lei ZhaoMengdi WangYu BaiPublished in: CoRR (2023)
Keyphrases
- inverse reinforcement learning
- reinforcement learning
- partially observable environments
- reward function
- temporal difference
- bayesian nonparametric
- np complete
- artificial intelligence
- preference elicitation
- markov decision processes
- function approximation
- reinforcement learning algorithms
- dynamical systems
- markov chain
- simple examples
- special case
- objective function
- machine learning