Login / Signup
Quasi-Stochastic Approximation and Off-Policy Reinforcement Learning.
Andrey Bernstein
Yue Chen
Marcello Colombino
Emiliano Dall'Anese
Prashant G. Mehta
Sean P. Meyn
Published in:
CDC (2019)
Keyphrases
</>
stochastic approximation
reinforcement learning
monte carlo
temporal difference learning
neural network
supervised learning
policy iteration
learning algorithm
markov chain
function approximation
finite state
model free