Sign in

Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling.

Alekh AgarwalTong Zhang
Published in: CoRR (2022)
Keyphrases