RLIF: Interactive Imitation Learning as Reinforcement Learning.
Jianlan LuoPerry DongYuexiang ZhaiYi MaSergey LevinePublished in: ICLR (2024)
Keyphrases
- imitation learning
- reinforcement learning
- reinforcement learning methods
- function approximation
- state space
- reinforcement learning algorithms
- learning algorithm
- maximum margin
- control problems
- markov decision processes
- temporal difference
- model free
- learning process
- optimal policy
- learning problems
- machine learning
- action selection
- multi agent
- dynamic programming
- optimal control
- average reward