Sign in

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.

Tetsuro MorimuraEiji UchibeJunichiro YoshimotoJan PetersKenji Doya
Published in: Neural Comput. (2010)
Keyphrases