Login / Signup
A Self-Tuning Actor-Critic Algorithm.
Tom Zahavy
Zhongwen Xu
Vivek Veeriah
Matteo Hessel
Junhyuk Oh
Hado van Hasselt
David Silver
Satinder Singh
Published in:
NeurIPS (2020)
Keyphrases
</>
optimal solution
learning algorithm
dynamic programming
cost function
particle swarm optimization
computational complexity
np hard
monte carlo
optimization algorithm
actor critic
machine learning
convergence rate
dynamic environments
linear programming
simulated annealing
state space
search space