Imitation Upper Confidence Bound for Bandits on a Graph.

Andrei Lupu Doina Precup

Published in: AAAI (2018)

Keyphrases

upper confidence bound
contextual bandit
graph theory
bipartite graph
graph model
random walk
structured data
directed graph
reinforcement learning
graph representation
graph structure
data analysis
k nearest neighbor