Efficient Optimal Learning for Contextual Bandits.

Miroslav Dudík Daniel J. Hsu Satyen Kale Nikos Karampatziakis John Langford Lev Reyzin Tong Zhang

Published in: UAI (2011)

Keyphrases

learning process
unsupervised learning
learning algorithm
active learning
machine learning
incremental learning
contextual information
online learning
dynamic programming
multi armed bandits
inductive inference
learning problems
learning tasks
computationally efficient
knowledge acquisition
supervised learning
neural network
mobile devices
reinforcement learning
information systems
genetic algorithm
information retrieval
data mining