A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning.

Eloïse Berthier Ziad Kobeissi Francis R. Bach

Published in: NeurIPS (2022)

Keyphrases

asymptotic analysis
temporal difference learning
function approximation
fixed point
reinforcement learning
evaluation function
game playing
fluid model
gaussian process
temporal difference
monte carlo
reinforcement learning algorithms
markov decision process
machine learning
cost function
state space
neural network