Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets.

Published in: CoRR (2021)

Keyphrases