Login / Signup
Off-policy Distributional Q(λ): Distributional RL without Importance Sampling.
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Ávila Pires
Will Dabney
Published in:
CoRR (2024)
Keyphrases
</>
importance sampling
monte carlo
co occurrence
approximate inference
markov chain
variance reduction
particle filter
least squares
kalman filter
particle filtering
reinforcement learning
rare events
graphical models
semi supervised
belief propagation
state space
markov chain monte carlo
video sequences