Login / Signup
Safe Exploration for Efficient Policy Evaluation and Comparison.
Runzhe Wan
Branislav Kveton
Rui Song
Published in:
CoRR (2022)
Keyphrases
</>
least squares
policy evaluation
support vector
neural network
artificial neural networks
optical flow
upper bound