Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations.
Nando de FreitasAlexander J. SmolaMasrour ZoghiPublished in: ICML (2012)
Keyphrases
- regret bounds
- gaussian process
- lower bound
- online learning
- linear regression
- regression model
- bayesian framework
- hyperparameters
- multi armed bandit
- model selection
- upper bound
- latent variables
- semi supervised
- approximate inference
- bregman divergences
- least squares
- reinforcement learning
- e learning
- sample size
- prior information
- maximum a posteriori
- learning algorithm