Login / Signup

Stochastic first-order methods for average-reward Markov decision processes.

Tianjiao LiFeiyang WuGuanghui Lan
Published in: CoRR (2022)
Keyphrases