Thompson Sampling in Dynamic Systems for Contextual Bandit Problems.
Tianbing XuYaming YuJohn TurnerAmelia ReganPublished in: CoRR (2013)
Keyphrases
- dynamic systems
- bandit problems
- exploration exploitation
- multi armed bandits
- complex systems
- decision problems
- qualitative reasoning
- discrete event
- consistency based diagnosis
- contextual information
- state variables
- dynamical systems
- qualitative models
- model based diagnosis
- artificial intelligence
- ordinary differential equations
- multi armed bandit
- linear time invariant
- particle filter