Publication: A novel Q-learning algorithm with function approximation for constrained Markov decision processes.