Login / Signup
A Finite Time Analysis of Two Time-Scale Actor Critic Methods.
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
Published in:
CoRR (2020)
Keyphrases
</>
optimization methods
gradient method
learning algorithm
temporal difference