Login / Signup
Faster Rates, Adaptive Algorithms, and Finite-Time Bounds for Linear Composition Optimization and Gradient TD Learning.
Anant Raj
Pooria Joulani
András György
Csaba Szepesvári
Published in:
AISTATS (2022)
Keyphrases
</>
adaptive algorithms
td learning
non stationary
temporal difference
evaluation function
optimization algorithm
function approximation
lower bound
upper bound
linear model
constrained optimization
multiscale
multiresolution
edge detection
reinforcement learning
multi step