Login / Signup
Proximal Deterministic Policy Gradient.
Marco Maggipinto
Gian Antonio Susto
Pratik Chaudhari
Published in:
CoRR (2020)
Keyphrases
</>
policy gradient
actor critic
reinforcement learning
parametric optimization
optimal control
gradient method
function approximation
variance reduction
model free reinforcement learning
approximation methods
reinforcement learning algorithms
dynamic programming
average reward
monte carlo