Login / Signup
Learning Continuous Control Policies by Stochastic Value Gradients.
Nicolas Heess
Greg Wayne
David Silver
Timothy P. Lillicrap
Yuval Tassa
Tom Erez
Published in:
CoRR (2015)
Keyphrases
</>
control policies
learning algorithm
reinforcement learning
robot control
decision making
multi agent systems
control system