A functional mirror ascent view of policy gradient methods with function approximation.

Sharan Vaswani Olivier Bachem Simone Totaro Robert Mueller Matthieu Geist Marlos C. Machado Pablo Samuel Castro Nicolas Le Roux

Published in: CoRR (2021)

Keyphrases