The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise.

Shuze Liu Shuhang Chen Shangtong Zhang

Published in: CoRR (2024)

Keyphrases

stochastic approximation
dynamic programming
reinforcement learning
neural network
cost function
support vector machine svm
objective function
machine learning
training data
markov random field
markov chain
monte carlo