Login / Signup
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization.
Paul Barde
Julien Roy
Wonseok Jeon
Joelle Pineau
Christopher J. Pal
Derek Nowrouzezahrai
Published in:
CoRR (2020)
Keyphrases
</>
imitation learning
reinforcement learning
maximum margin
real time
multi agent
logic programs
optimal policy
pattern classification
robotic systems
hyperplane