Efficient Optimal Learning for Contextual Bandits.
Miroslav DudíkDaniel J. HsuSatyen KaleNikos KarampatziakisJohn LangfordLev ReyzinTong ZhangPublished in: UAI (2011)
Keyphrases
- learning process
- unsupervised learning
- learning algorithm
- active learning
- machine learning
- incremental learning
- contextual information
- online learning
- dynamic programming
- multi armed bandits
- inductive inference
- learning problems
- learning tasks
- computationally efficient
- knowledge acquisition
- supervised learning
- neural network
- mobile devices
- reinforcement learning
- information systems
- genetic algorithm
- information retrieval
- data mining