Q-learning with Logarithmic Regret.

Kunhe Yang Lin F. Yang Simon S. Du

Published in: AISTATS (2021)

Keyphrases