Login / Signup
Towards Understanding Asynchronous Advantage Actor-Critic: Convergence and Linear Speedup.
Han Shen
Kaiqing Zhang
Mingyi Hong
Tianyi Chen
Published in:
IEEE Trans. Signal Process. (2023)
Keyphrases
</>
convergence proof
actor critic
lyapunov stability
reinforcement learning
convergence speed
policy gradient
optimal control
temporal difference
gradient method
approximate dynamic programming
machine learning
multi agent systems
neuro fuzzy
function approximation
recursive least squares
lyapunov function