Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation.
Shengpu TangJenna WiensPublished in: NeurIPS (2023)
Keyphrases
- importance sampling
- policy evaluation
- monte carlo
- variance reduction
- temporal difference
- markov chain
- kalman filter
- particle filter
- least squares
- markov decision processes
- function approximation
- approximate inference
- model free
- machine learning
- particle filtering
- posterior distribution
- markov chain monte carlo
- video sequences
- reinforcement learning
- feature selection
- computer vision