RLIF: Interactive Imitation Learning as Reinforcement Learning.

Jianlan Luo Perry Dong Yuexiang Zhai Yi Ma Sergey Levine

Published in: ICLR (2024)

Keyphrases

imitation learning
reinforcement learning
reinforcement learning methods
function approximation
state space
reinforcement learning algorithms
learning algorithm
maximum margin
control problems
markov decision processes
temporal difference
model free
learning process
optimal policy
learning problems
machine learning
action selection
multi agent
dynamic programming
optimal control
average reward