Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations.

Nando de Freitas Alexander J. Smola Masrour Zoghi

Published in: ICML (2012)

Keyphrases

regret bounds
gaussian process
lower bound
online learning
linear regression
regression model
bayesian framework
hyperparameters
multi armed bandit
model selection
upper bound
latent variables
semi supervised
approximate inference
bregman divergences
least squares
reinforcement learning
e learning
sample size
prior information
maximum a posteriori
learning algorithm