Login / Signup
Entropic Risk Measure in Policy Search.
David Nass
Boris Belousov
Jan Peters
Published in:
IROS (2019)
Keyphrases
</>
policy search
reinforcement learning
continuous state
reinforcement learning algorithms
dynamic programming
continuous action
policy gradient
finite state
markov decision problems
neural network
machine learning
search space
linear programming
random walk
partially observable markov decision processes