The Best of Both Worlds: Reinforcement Learning with Logarithmic Regret and Policy Switches.

Published in: CoRR (2022)

Keyphrases