Login / Signup

Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay.

Dogan C. CicekEnes DuranBaturay SaglamFurkan B. MutluSuleyman S. Kozat
Published in: ICTAI (2021)
Keyphrases
  • policy gradient
  • learning algorithm
  • computational complexity
  • machine learning algorithms
  • gradient ascent