• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Offline Policy Comparison with Confidence: Benchmarks and Baselines.

Anurag KoulMariano PhielippAlan Fern
Published in: CoRR (2022)
Keyphrases
  • confidence level
  • optimal policy
  • real time
  • statistical analysis
  • machine learning
  • high confidence
  • database systems
  • markov decision processes
  • confidence measure
  • confidence values