Login / Signup

Fast gradient-descent methods for temporal-difference learning with linear function approximation.

Richard S. SuttonHamid Reza MaeiDoina PrecupShalabh BhatnagarDavid SilverCsaba SzepesváriEric Wiewiora
Published in: ICML (2009)
Keyphrases