Accountable Off-Policy Evaluation With Kernel Bellman Statistics.
Yihao FengTongzheng RenZiyang TangQiang LiuPublished in: ICML (2020)
Keyphrases
- policy evaluation
- least squares
- statistical inference
- temporal difference
- monte carlo
- reinforcement learning
- model free
- markov decision processes
- semi parametric
- support vector
- variance reduction
- kernel function
- policy iteration
- linear program
- kernel methods
- function approximation
- machine learning
- input space
- confidence intervals