Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.

Published in: Neural Comput. (2010)

Keyphrases