Externally Valid Policy Evaluation Combining Trial and Observational Data.
Sofia EkDave ZachariahPublished in: CoRR (2023)
Keyphrases
- policy evaluation
- observational data
- least squares
- temporal difference
- reinforcement learning
- experimental data
- monte carlo
- causal discovery
- variance reduction
- function approximation
- latent variables
- causal models
- causal bayesian networks
- semi parametric
- model free
- causal relationships
- directed acyclic graph
- computational complexity
- statistical inference
- policy iteration
- markov decision processes
- sample size
- text mining
- dynamic programming