Login / Signup
Policy Search: Any Local Optimum Enjoys a Global Performance Guarantee.
Bruno Scherrer
Matthieu Geist
Published in:
CoRR (2013)
Keyphrases
</>
policy search
reinforcement learning
continuous state
neural network
dynamic programming
continuous action
optimal solution
steady state
domain independent
utility function
finite state
partially observable markov decision processes