The Possibilistic Reward Method and a Dynamic Extension for the Multi-armed Bandit Problem: A Numerical Study.
Miguel MartínAntonio Jiménez-MartínAlfonso MateosPublished in: ICORES (2017)
Keyphrases
- high precision
- study proposes
- synthetic data
- similarity measure
- detection method
- support vector machine
- pairwise
- cost function
- experimental evaluation
- reinforcement learning
- computationally efficient
- support vector machine svm
- machine learning
- prior knowledge
- regression analysis
- simulation study
- classification method
- segmentation method
- objective function
- preprocessing
- dynamic environments
- edge detection
- high accuracy
- significant improvement