Login / Signup
On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning.
Alain Durmus
Eric Moulines
Alexey Naumov
Sergey Samsonov
Hoi-To Wai
Published in:
COLT (2021)
Keyphrases
</>
stochastic approximation
td learning
neural network
reinforcement learning
evaluation function
temporal difference