Pessimistic Model Selection for Offline Deep Reinforcement Learning.
Chao-Han Huck YangZhengling QiYifan CuiPin-Yu ChenPublished in: UAI (2023)
Keyphrases
- model selection
- reinforcement learning
- cross validation
- machine learning
- hyperparameters
- sample size
- parameter estimation
- regression model
- mixture model
- state space
- statistical learning
- bayesian learning
- variable selection
- error estimation
- meta learning
- unsupervised learning
- feature selection
- model selection criteria
- generalization bounds
- selection criterion
- gaussian process
- statistical inference
- information criterion
- generalization error
- motion segmentation
- learning problems
- parameter determination
- probabilistic model
- leave one out cross validation
- automatic model selection
- marginal likelihood
- expectation maximization
- learning algorithm