Login / Signup

Optimistic Posterior Sampling for Reinforcement Learning: Worst-Case Regret Bounds.

Shipra AgrawalRandy Jia
Published in: Math. Oper. Res. (2023)
Keyphrases