Login / Signup
On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning.
Alain Durmus
Eric Moulines
Alexey Naumov
Sergey Samsonov
Hoi-To Wai
Published in:
CoRR (2021)
Keyphrases
</>
stochastic approximation
td learning
neural network
artificial neural networks
linear combination
monte carlo
temporal difference