Login / Signup
A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning.
Nhan H. Pham
Lam M. Nguyen
Dzung T. Phan
Phuong Ha Nguyen
Marten van Dijk
Quoc Tran-Dinh
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
dynamic programming
policy gradient
learning algorithm
function approximation
actor critic
optimal solution
computational complexity
worst case
model free reinforcement learning
np hard
mathematical model
monte carlo
function approximators