Inverse Reinforcement Learning without Reinforcement Learning.
Gokul SwamySanjiban ChoudhuryJ. Andrew BagnellZhiwei Steven WuPublished in: CoRR (2023)
Keyphrases
- inverse reinforcement learning
- partially observable environments
- reinforcement learning
- reward function
- temporal difference
- bayesian nonparametric
- reinforcement learning algorithms
- preference elicitation
- state space
- markov decision processes
- partially observable
- markov decision process
- function approximation
- multiple agents
- multi agent
- random walk
- learning algorithm
- control policies
- partial observability
- bayesian networks