Login / Signup
Learning Upper-Level Policy using Importance Sampling-based Policy Search Method.
Jose Pastor
Henry Díaz
Leopoldo Armesto
Alicia Esparza
Antonio Sala
Published in:
ICSC (2018)
Keyphrases
</>
policy search
reinforcement learning
learning process
dynamic programming
continuous state
learning algorithm
neural network
artificial intelligence
higher level
dynamical systems
optimal control
action selection
policy gradient