Sign in

Second-order multi-armed bandit learning for online optimization in communication and networks.

Zhiyong DuBin JiangKun XuShengyun WeiShengqing WangHuatao Zhu
Published in: ACM TUR-C (2019)
Keyphrases
  • online learning
  • learning algorithm
  • learning process
  • multi armed bandits
  • decision trees
  • active learning
  • probability distribution
  • least squares