Login / Signup
Self-guided deep deterministic policy gradient with multi-actor.
Hongming Chen
Quan Liu
Shan Zhong
Published in:
Neural Comput. Appl. (2021)
Keyphrases
</>
policy gradient
reinforcement learning
parametric optimization
actor critic
model free reinforcement learning
gradient method
cost function
markov decision processes
optimal control
average reward