Login / Signup

Optimal Sample Complexity of Reinforcement Learning for Uniformly Ergodic Discounted Markov Decision Processes.

Shengbo WangJose BlanchetPeter W. Glynn
Published in: CoRR (2023)
Keyphrases