Publication: Posterior Sampling for Reinforcement Learning Without Episodes.