Login / Signup

Analysis and improvement of policy gradient estimation.

Tingting ZhaoHirotaka HachiyaGang NiuMasashi Sugiyama
Published in: Neural Networks (2012)
Keyphrases
  • gradient estimation
  • support vector machine
  • long run