Login / Signup
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees.
Andrea Tirinzoni
Matteo Papini
Ahmed Touati
Alessandro Lazaric
Matteo Pirotta
Published in:
NeurIPS (2022)
Keyphrases
</>
learning process
online learning
learning algorithm
learning systems
inductive inference
multiscale
knowledge acquisition
multi armed bandits
prior knowledge
unsupervised learning
background knowledge
learning community