Login / Signup
Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning.
Raymond Feng
Jesse Geneson
Andrew Lee
Espen Slettnes
Published in:
Theor. Comput. Sci. (2023)
Keyphrases
</>
online learning
prior knowledge
probabilistic model
upper bound
graphical models
markov chain
complex systems
statistical models
e learning
active learning
experimental data
classification models
asymptotically optimal