Ventral striatum uses a temporal difference rule for prediction during neuroprosthetic control.

N. Vendrell-Llopis Aaron C. Koralek R. Costa Jose M. Carmena

Published in: NER (2019)

Keyphrases

temporal difference
reinforcement learning
evaluation function
td learning
function approximation
action selection
step size
model free
temporal difference learning
monte carlo
control system
control problems
temporal difference methods
policy evaluation
reinforcement learning algorithms
supervised learning
cost function
decision trees
optimization algorithm
feature vectors
actor critic
decision making