Publication: Adaptive computation of optimal nonrandomized policies in constrained average-reward MDPs.