Login / Signup
Adaptive Experience Selection for Policy Gradient.
Saad Mohamad
Giovanni Montana
Published in:
CoRR (2020)
Keyphrases
</>
policy gradient
actor critic
reinforcement learning
optimal control
reinforcement learning algorithms
machine learning
heuristic search
neuro fuzzy
function approximation
adaptive control
approximation methods
variance reduction