Login / Signup
Online Sparse Temporal Difference Learning Based on Nested Optimization and Regularized Dual Averaging.
Tianheng Song
Dazi Li
Xin Xu
Published in:
IEEE Trans. Syst. Man Cybern. Syst. (2022)
Keyphrases
</>
temporal difference learning
function approximation
evaluation function
game playing
fixed point
approximate value iteration
reinforcement learning
least squares
markov decision process
machine learning
belief propagation
temporal difference