Login / Signup
Second-order multi-armed bandit learning for online optimization in communication and networks.
Zhiyong Du
Bin Jiang
Kun Xu
Shengyun Wei
Shengqing Wang
Huatao Zhu
Published in:
ACM TUR-C (2019)
Keyphrases
</>
online learning
learning algorithm
learning process
multi armed bandits
decision trees
active learning
probability distribution
least squares