Login / Signup
Controlling bicycle using deep deterministic policy gradient algorithm.
Le Pham Tuyen
TaeChoong Chung
Published in:
URAI (2017)
Keyphrases
</>
dynamic programming
learning algorithm
computational complexity
policy gradient
optimal solution
search algorithm
evolutionary algorithm
np hard
gradient ascent
search space
machine learning
worst case
mathematical model
monte carlo
path planning
convergence rate
gradient method