On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift.

Published in: J. Mach. Learn. Res. (2021)

Keyphrases