Login / Signup
Imitation Upper Confidence Bound for Bandits on a Graph.
Andrei Lupu
Doina Precup
Published in:
AAAI (2018)
Keyphrases
</>
upper confidence bound
contextual bandit
graph theory
bipartite graph
graph model
random walk
structured data
directed graph
reinforcement learning
graph representation
graph structure
data analysis
k nearest neighbor