Login / Signup
Proximal Gradient Temporal Difference Learning Algorithms.
Bo Liu
Ji Liu
Mohammad Ghavamzadeh
Sridhar Mahadevan
Marek Petrik
Published in:
IJCAI (2016)
Keyphrases
</>
temporal difference learning algorithms
function approximation
temporal difference learning
asymptotic properties
artificial neural networks
reinforcement learning
linear combination
approximation error