Login / Signup
Policy Search in Continuous Action Domains: an Overview.
Olivier Sigaud
Freek Stulp
Published in:
CoRR (2018)
Keyphrases
</>
policy search
continuous action
reinforcement learning
continuous state
dynamic programming
partially observable markov decision processes
reward function
reinforcement learning algorithms
multi agent
steady state
markov decision processes
decision problems
function approximation
robot navigation
policy gradient