Pseudo-reward Algorithms for Contextual Bandits with Linear Payoff Functions.
Ku-Chun ChouHsuan-Tien LinChao-Kai ChiangChi-Jen LuPublished in: ACML (2014)
Keyphrases
- recently developed
- linear models
- data structure
- combinatorial optimization
- multi armed bandit
- learning algorithm
- reinforcement learning
- graph theory
- neural network
- least squares
- context sensitive
- orders of magnitude
- machine learning algorithms
- optimization problems
- computational cost
- computational complexity
- search algorithm
- bayesian networks
- data mining