Login / Signup
Multiple-policy High-confidence Policy Evaluation.
Christoph Dann
Mohammad Ghavamzadeh
Teodor V. Marinov
Published in:
AISTATS (2023)
Keyphrases
</>
high confidence
policy evaluation
association rules
least squares
temporal difference
reinforcement learning
policy iteration
data sets
high dimensional
optimal policy
monte carlo
data mining
cost function
model free
variance reduction