Publication: Logarithmic regret bounds for continuous-time average-reward Markov decision processes.