Adaptive KL-UCB Based Bandit Algorithms for Markovian and I.I.D. Settings.
Arghyadip RoySanjay ShakkottaiR. SrikantPublished in: IEEE Trans. Autom. Control. (2024)
Keyphrases
- computationally efficient
- bandit problems
- data structure
- worst case
- mutual information
- neural network
- multi armed bandit
- upper bound
- theoretical analysis
- orders of magnitude
- computationally expensive
- times faster
- upper confidence bound
- adaptive algorithms
- graph theory
- recently developed
- machine learning algorithms
- particle swarm optimization
- lower bound
- image processing
- computer vision
- social networks