A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning.

Published in: CoRR (2022)

Keyphrases