Login / Signup
An Imperfect Dopaminergic Error Signal Can Drive Temporal-Difference Learning.
Wiebke Potjans
Markus Diesmann
Abigail Morrison
Published in:
PLoS Comput. Biol. (2011)
Keyphrases
</>
temporal difference learning
function approximation
fixed point
game playing
reinforcement learning
evaluation function
approximate value iteration
temporal difference
reinforcement learning algorithms
least squares
linear combination
model free