Login / Signup

A novel Q-learning algorithm with function approximation for constrained Markov decision processes.

K. LakshmananShalabh Bhatnagar
Published in: Allerton Conference (2012)
Keyphrases