Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits.

Wonyoung KimKyungbok LeeMyunghee Cho Paik
Published in: CoRR (2022)