Login / Signup
PAC Identification of a Bandit Arm Relative to a Reward Quantile.
Arghya Roy Chaudhuri
Shivaram Kalyanakrishnan
Published in:
AAAI (2017)
Keyphrases
</>
bandit problems
multi armed bandit problems
reinforcement learning
automatic identification
information systems
decision trees
special case
upper bound
learning algorithm
decision problems
multi armed bandit