Infinite-Horizon Policy-Gradient Estimation

Peter L. Bartlett Jonathan Baxter

Published in: CoRR (2011)

Keyphrases