Login / Signup
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation.
Mark Rowland
Yunhao Tang
Clare Lyle
Rémi Munos
Marc G. Bellemare
Will Dabney
Published in:
CoRR (2023)
Keyphrases
</>
temporal difference learning
fixed point
evaluation function
function approximation
approximate value iteration
reinforcement learning
game playing
temporal difference
reinforcement learning algorithms
markov decision process
bayesian networks
graphical models