Off-policy Distributional Q(λ): Distributional RL without Importance Sampling.

Published in: CoRR (2024)

Keyphrases