A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning.

Published in: CoRR (2020)

Keyphrases