Parallelizing Exploration-Exploitation Tradeoffs with Gaussian Process Bandit Optimization.
Thomas DesautelsAndreas KrauseJoel W. BurdickPublished in: ICML (2012)
Keyphrases
- gaussian process
- exploration exploitation
- bandit problems
- gaussian processes
- active learning
- regression model
- bayesian framework
- gaussian process regression
- semi supervised
- hyperparameters
- random sampling
- model selection
- reinforcement learning
- latent variables
- gaussian process models
- maximum likelihood
- relevance feedback
- lower bound