Login / Signup
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation.
Mark Rowland
Yunhao Tang
Clare Lyle
Rémi Munos
Marc G. Bellemare
Will Dabney
Published in:
ICML (2023)
Keyphrases
</>
temporal difference learning
function approximation
fixed point
reinforcement learning
evaluation function
temporal difference
game playing
approximate value iteration
reinforcement learning algorithms
neural network
training data
learning environment