Pseudo-reward Algorithms for Contextual Bandits with Linear Payoff Functions.

Ku-Chun Chou Hsuan-Tien Lin Chao-Kai Chiang Chi-Jen Lu

Published in: ACML (2014)

Keyphrases

recently developed
linear models
data structure
combinatorial optimization
multi armed bandit
learning algorithm
reinforcement learning
graph theory
neural network
least squares
context sensitive
orders of magnitude
machine learning algorithms
optimization problems
computational cost
computational complexity
search algorithm
bayesian networks
data mining