Login / Signup
Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance.
Thinh T. Doan
Published in:
L4DC (2021)
Keyphrases
</>
stochastic approximation
monte carlo
temporal difference learning
reinforcement learning
policy iteration
artificial neural networks