Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback.

Siwei Wang Haoyun Wang Longbo Huang

Published in: AAAI (2021)

Keyphrases

adaptive algorithms
multi armed bandit
non stationary
multi armed bandits
noise cancellation
reinforcement learning
decentralized decision making
image processing
objective function
least squares
denoising
higher order