Login / Signup
Combinatorial Multi-Armed Bandit with General Reward Functions.
Wei Chen
Wei Hu
Fu Li
Jian Li
Yu Liu
Pinyan Lu
Published in:
NIPS (2016)
Keyphrases
</>
special case
reward function
multi armed bandit
reinforcement learning
multi armed bandits
decision making
image segmentation
probability distribution
markov decision processes