Login / Signup
Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback.
Siwei Wang
Haoyun Wang
Longbo Huang
Published in:
AAAI (2021)
Keyphrases
</>
adaptive algorithms
multi armed bandit
non stationary
multi armed bandits
noise cancellation
reinforcement learning
decentralized decision making
image processing
objective function
least squares
denoising
higher order