Login / Signup
Alleviating the estimation bias of deep deterministic policy gradient via co-regularization.
Yao Li
Yuhui Wang
Yaozhong Gan
Xiaoyang Tan
Published in:
Pattern Recognit. (2022)
Keyphrases
</>
policy gradient
variance reduction
actor critic
function approximation
approximation methods