Login / Signup
Improved Regret Bounds of Bilinear Bandits using Action Space Analysis.
Kyoungseok Jang
Kwang-Sung Jun
Se-Young Yun
Wanmo Kang
Published in:
ICML (2021)
Keyphrases
</>
higher order
bayesian networks
objective function
active learning
language model
sufficient conditions
multi armed bandit