Login / Signup
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization.
Paul Barde
Julien Roy
Wonseok Jeon
Joelle Pineau
Chris Pal
Derek Nowrouzezahrai
Published in:
NeurIPS (2020)
Keyphrases
</>
imitation learning
optimal policy
reinforcement learning
multi agent
maximum likelihood
feature selection
robotic systems