Login / Signup
Safe Policy Improvement with Soft Baseline Bootstrapping.
Kimia Nadjahi
Romain Laroche
Rémi Tachet des Combes
Published in:
ECML/PKDD (3) (2019)
Keyphrases
</>
error reduction
neural network
information extraction
optimal policy
relative improvement
artificial intelligence
management system
relation extraction
clustering algorithm
similarity measure
probabilistic model
asymptotically optimal
policy makers
policy making
policy search