Login / Signup

Alleviating the estimation bias of deep deterministic policy gradient via co-regularization.

Yao LiYuhui WangYaozhong GanXiaoyang Tan
Published in: Pattern Recognit. (2022)
Keyphrases
  • policy gradient
  • variance reduction
  • actor critic
  • function approximation
  • approximation methods