Login / Signup

Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization.

Matthew W. HoffmanAlessandro LazaricMohammad GhavamzadehRémi Munos
Published in: EWRL (2011)
Keyphrases