Login / Signup
The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise.
Shuze Liu
Shuhang Chen
Shangtong Zhang
Published in:
CoRR (2024)
Keyphrases
</>
stochastic approximation
dynamic programming
reinforcement learning
neural network
cost function
support vector machine svm
objective function
machine learning
training data
markov random field
markov chain
monte carlo