Login / Signup
Clipped Action Policy Gradient.
Yasuhiro Fujita
Shin-ichi Maeda
Published in:
ICML (2018)
Keyphrases
</>
policy gradient
state action
parametric optimization
actor critic
reinforcement learning
model free reinforcement learning
gradient method
optimal control
reinforcement learning algorithms
function approximation
policy search
markov decision process
approximation methods
variance reduction