Publication: Finite-Memory Near-Optimal Learning for Markov Decision Processes with Long-Run Average Reward.