Login / Signup
A K-fold Method for Baseline Estimation in Policy Gradient Algorithms.
Nithyanand Kota
Abhishek Mishra
Sunil Srinivasa
Xi Chen
Pieter Abbeel
Published in:
CoRR (2017)
Keyphrases
</>
computational complexity
gradient ascent
learning algorithm
dynamic programming
optimization problems
convergence rate
linear regression
natural gradient