Publication: Approximate stochastic annealing for online control of infinite horizon Markov decision processes.