Login / Signup

Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling.

Prashanth L. A.Nathaniel KordaRémi Munos
Published in: Mach. Learn. (2021)
Keyphrases