Geometric Insights into the Convergence of Nonlinear TD Learning.

David Brandfonbrener Joan Bruna

Published in: ICLR (2020)

Keyphrases

td learning
temporal difference
evaluation function
function approximation
reinforcement learning
multi step
reinforcement learning algorithms
convergence speed
data mining
step size
policy evaluation
training set
model selection
radial basis function
convergence rate