Login / Signup
Policy search in continuous action domains: An overview.
Olivier Sigaud
Freek Stulp
Published in:
Neural Networks (2019)
Keyphrases
</>
policy search
continuous action
continuous state
reinforcement learning
dynamic programming
reinforcement learning algorithms
partially observable markov decision processes
search space
finite state
policy gradient
reward function
monte carlo methods