PAC Identification of a Bandit Arm Relative to a Reward Quantile.

Arghya Roy Chaudhuri Shivaram Kalyanakrishnan

Published in: AAAI (2017)

Keyphrases

bandit problems
multi armed bandit problems
reinforcement learning
automatic identification
information systems
decision trees
special case
upper bound
learning algorithm
decision problems
multi armed bandit