Publication: Achieving Tractable Minimax Optimal Regret in Average Reward MDPs.