Publication: Contextual-MDPs for PAC-Reinforcement Learning with Rich Observations.