Login / Signup
Analysis and improvement of policy gradient estimation.
Tingting Zhao
Hirotaka Hachiya
Gang Niu
Masashi Sugiyama
Published in:
Neural Networks (2012)
Keyphrases
</>
gradient estimation
support vector machine
long run