Login / Signup
A Convergent Online Single Time Scale Actor Critic Algorithm.
Dotan Di Castro
Ron Meir
Published in:
J. Mach. Learn. Res. (2010)
Keyphrases
</>
learning algorithm
actor critic
objective function
computational complexity
np hard
dynamic programming
optimization algorithm
machine learning
search space
neural network
optimal solution
linear programming
gradient method