Publication: Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space.