• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Self-guided deep deterministic policy gradient with multi-actor.

Hongming ChenQuan LiuShan Zhong
Published in: Neural Comput. Appl. (2021)
Keyphrases
  • policy gradient
  • reinforcement learning
  • parametric optimization
  • actor critic
  • model free reinforcement learning
  • gradient method
  • cost function
  • markov decision processes
  • optimal control
  • average reward