Sign in

Near Sample-Optimal Reduction-based Policy Learning for Average Reward MDP.

Jinghan WangMengdi WangLin F. Yang
Published in: CoRR (2022)
Keyphrases