Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems.

Published in: ACC (2018)

Keyphrases