Publication: Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret.