A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning.

Published in: NeurIPS (2021)

Keyphrases