A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning.

Published in: AISTATS (2020)

Keyphrases