Login / Signup
Improved Policy Optimization for Online Imitation Learning.
Jonathan Wilder Lavington
Sharan Vaswani
Mark Schmidt
Published in:
CoLLAs (2022)
Keyphrases
</>
feature selection
imitation learning
reinforcement learning
real time
optimal policy
robotic systems
computer vision
dynamic programming
multi modal
multi task
markov decision process