Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling.
Yao LiuPierre-Luc BaconEmma BrunskillPublished in: ICML (2020)
Keyphrases
- importance sampling
- policy evaluation
- monte carlo
- variance reduction
- markov chain
- kalman filter
- temporal difference
- least squares
- particle filter
- high dimensional
- approximate inference
- markov chain monte carlo
- high dimensional data
- optimal policy
- markov decision processes
- image segmentation
- appearance model
- visual tracking
- policy iteration