Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines.

Published in: ICLR (2018)

Keyphrases