Login / Signup
Geometric Insights into the Convergence of Nonlinear TD Learning.
David Brandfonbrener
Joan Bruna
Published in:
ICLR (2020)
Keyphrases
</>
td learning
temporal difference
evaluation function
function approximation
reinforcement learning
multi step
reinforcement learning algorithms
convergence speed
data mining
step size
policy evaluation
training set
model selection
radial basis function
convergence rate