Learning Continuous Control Policies by Stochastic Value Gradients.

Published in: NIPS (2015)

Keyphrases