Login / Signup
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay.
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman S. Kozat
Published in:
ICTAI (2021)
Keyphrases
</>
policy gradient
learning algorithm
computational complexity
machine learning algorithms
gradient ascent