Leveraging Side Observations in Stochastic Bandits.
Stéphane CaronBranislav KvetonMarc LelargeSmriti BhagatPublished in: UAI (2012)
Keyphrases
- stochastic systems
- stochastic models
- regret bounds
- monte carlo
- stochastic nature
- case study
- stochastic optimization
- stochastic model
- multi armed bandits
- noisy observations
- stochastic context free grammars
- stochastic approximation
- real time
- dynamic programming
- special case
- decision trees
- decision making
- learning algorithm
- real world