Login / Signup
Minimax Optimal Online Imitation Learning via Replay Estimation.
Gokul Swamy
Nived Rajaraman
Matthew Peng
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
Jiantao Jiao
Kannan Ramchandran
Published in:
CoRR (2022)
Keyphrases
</>
imitation learning
worst case
computer vision
real time
image sequences
reinforcement learning
dynamic programming
upper bound
optimal control