Adaptive KL-UCB Based Bandit Algorithms for Markovian and I.I.D. Settings.

Arghyadip Roy Sanjay Shakkottai R. Srikant

Published in: IEEE Trans. Autom. Control. (2024)

Keyphrases

computationally efficient
bandit problems
data structure
worst case
mutual information
neural network
multi armed bandit
upper bound
theoretical analysis
orders of magnitude
computationally expensive
times faster
upper confidence bound
adaptive algorithms
graph theory
recently developed
machine learning algorithms
particle swarm optimization
lower bound
image processing
computer vision
social networks