Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation.
Ilya KostrikovOfir NachumPublished in: CoRR (2020)
Keyphrases
- policy evaluation
- semi parametric
- statistical inference
- least squares
- reinforcement learning
- temporal difference
- monte carlo
- confidence intervals
- statistical analysis
- model free
- variance reduction
- importance sampling
- function approximation
- state space
- evaluation function
- markov decision processes
- parameter estimation
- upper bound