C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance.
Thinh T. Doan
Published in:
L4DC (2021)
Keyphrases
</>
stochastic approximation
monte carlo
temporal difference learning
reinforcement learning
policy iteration
artificial neural networks