Publication: Online Reinforcement Learning for Real-Time Exploration in Continuous State and Action Markov Decision Processes.