From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses.

Daniil Tiapkin Denis Belomestny Eric Moulines Alexey Naumov Sergey Samsonov Yunhao Tang Michal Valko Pierre Ménard

Published in: CoRR (2022)

Keyphrases