Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines.

Published in: CoRR (2018)

Keyphrases