Login / Signup

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes.

Emmeran JohnsonCiara Pike-BurkePatrick Rebeschini
Published in: CoRR (2023)
Keyphrases