Pessimistic Model Selection for Offline Deep Reinforcement Learning.
Chao-Han Huck YangZhengling QiYifan CuiPin-Yu ChenPublished in: CoRR (2021)
Keyphrases
- model selection
- reinforcement learning
- cross validation
- machine learning
- sample size
- regression model
- hyperparameters
- parameter estimation
- mixture model
- statistical learning
- error estimation
- bayesian learning
- state space
- meta learning
- feature selection
- statistical inference
- generalization error
- variable selection
- unsupervised learning
- motion segmentation
- gaussian process
- bayesian methods
- selection criterion
- information criterion
- model selection criteria
- real world
- supervised learning
- image segmentation
- decision trees
- leave one out cross validation
- marginal likelihood
- bayesian model selection