Login / Signup
On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards.
Yi Gai
Bhaskar Krishnamachari
Mingyan Liu
Published in:
GLOBECOM (2011)
Keyphrases
</>
total reward
reinforcement learning
markov decision processes
multiarmed bandit
bandit problems
long term and short term
action selection
multi armed bandit
multi armed bandits
information systems
lower bound
computational complexity
machine learning
computational geometry
multi agent systems
bayesian networks
image segmentation
multi armed bandit problems