Exploration Control in Reinforcement Learning using Optimistic Model Selection.
Jeremy L. WyattPublished in: ICML (2001)
Keyphrases
- model selection
- reinforcement learning
- cross validation
- hyperparameters
- machine learning
- bayesian learning
- parameter estimation
- regression model
- action selection
- sample size
- statistical inference
- selection criterion
- gaussian process
- motion segmentation
- information criterion
- statistical learning
- parameter determination
- model selection criteria
- error estimation
- feature selection
- generalization error
- mixture model
- meta learning
- leave one out cross validation
- learning algorithm
- variable selection
- state space
- generalization bounds
- bayesian information criterion
- marginal likelihood
- automatic model selection
- posterior distribution
- statistical model
- semi supervised
- similarity measure
- decision trees