Reanalysis of Variance Reduced Temporal Difference Learning.

Tengyu Xu Zhe Wang Yi Zhou Yingbin Liang

Published in: ICLR (2020)

Keyphrases

temporal difference learning
function approximation
fixed point
game playing
evaluation function
reinforcement learning
approximate value iteration
temporal difference
reinforcement learning algorithms
markov decision process
neural network
support vector
feature selection
bayesian networks
state space
collaborative learning