Login / Signup
N-Timescale Stochastic Approximation: Stability and Convergence.
Rohan Deb
Shalabh Bhatnagar
Published in:
CoRR (2021)
Keyphrases
</>
stochastic approximation
monte carlo
reinforcement learning
machine learning
graphical models
policy iteration
markov decision processes
theoretical guarantees