Login / Signup
Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis.
Koulik Khamaru
Ashwin Pananjady
Feng Ruan
Martin J. Wainwright
Michael I. Jordan
Published in:
SIAM J. Math. Data Sci. (2021)
Keyphrases
</>
temporal difference learning
reinforcement learning
dynamic programming
machine learning
optimal solution
learning environment
learning process
fixed point