Login / Signup

Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation.

Jan Malte LichtenbergAlexander BuchholzGiuseppe Di BenedettoMatteo RuffiniBen London
Published in: CoRR (2023)
Keyphrases
  • variance reduction
  • policy evaluation
  • monte carlo
  • sample size
  • importance sampling
  • confidence intervals
  • policy gradient
  • computational complexity