Login / Signup
Learning Continuous Control Policies by Stochastic Value Gradients.
Nicolas Heess
Gregory Wayne
David Silver
Timothy P. Lillicrap
Tom Erez
Yuval Tassa
Published in:
NIPS (2015)
Keyphrases
</>
control policies
reinforcement learning
learning algorithm
stochastic optimization problems
decision making
prior knowledge
continuous state