Login / Signup

Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes.

Guanghui Lan
Published in: Math. Program. (2023)
Keyphrases