Login / Signup
Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning.
Raymond Feng
Jesse Geneson
Andrew Lee
Espen Slettnes
Published in:
CoRR (2022)
Keyphrases
</>
online learning
statistical models
high quality
bayesian networks
multi agent
prior knowledge
probabilistic model
upper bound
least squares
complex systems
experimental data
computational models
computer mediated
parametric models