Login / Signup
Safe Policy Improvement with Soft Baseline Bootstrapping.
Kimia Nadjahi
Romain Laroche
Rémi Tachet des Combes
Published in:
CoRR (2019)
Keyphrases
</>
optimal policy
error reduction
data sets
neural network
real world
information retrieval
website
video sequences
information extraction
infinite horizon
relation extraction