Q-learning with Logarithmic Regret.

Kunhe Yang Lin F. Yang Simon S. Du

Published in: CoRR (2020)

Keyphrases