Estimating Optimal Policy Value in Linear Contextual Bandits Beyond Gaussianity.

Published in: Trans. Mach. Learn. Res. (2024)

Keyphrases