Login / Signup
Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit.
Yichi Zhou
Shihong Song
Huishuai Zhang
Jun Zhu
Wei Chen
Tie-Yan Liu
Published in:
CoRR (2021)
Keyphrases
</>
contextual bandit
upper confidence bound
least squares
regularized least squares
maximum likelihood
machine learning
generative model
recursive least squares