Login / Signup
Information-Theoretic Regret Bounds for Bandits with Fixed Expert Advice.
Khaled Eldowa
Nicolò Cesa-Bianchi
Alberto Maria Metelli
Marcello Restelli
Published in:
CoRR (2023)
Keyphrases
</>
information theoretic
regret bounds
expert advice
bregman divergences
mutual information
lower bound
information theory
online learning
linear regression
upper bound
multi armed bandit
log likelihood
kl divergence
least squares
similarity measure