Login / Signup
Simple and optimal methods for stochastic variational inequalities, II: Markovian noise and policy evaluation in reinforcement learning.
Georgios Kotsalis
Guanghui Lan
Tianjiao Li
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
variational inequalities
model free
dynamic programming
computational complexity
monte carlo
temporal difference
policy evaluation
denoising
sensitivity analysis
function approximation