Login / Signup
More for Less: Safe Policy Improvement With Stronger Performance Guarantees.
Patrick Wienhöft
Marnix Suilen
Thiago D. Simão
Clemens Dubslaff
Christel Baier
Nils Jansen
Published in:
CoRR (2023)
Keyphrases
</>
significant improvement
neural network
real world
information retrieval
artificial intelligence
computer vision
special case
np hard
infinite horizon
asymptotically optimal
policy search