Ventral striatum uses a temporal difference rule for prediction during neuroprosthetic control.
N. Vendrell-LlopisAaron C. KoralekR. CostaJose M. CarmenaPublished in: NER (2019)
Keyphrases
- temporal difference
- reinforcement learning
- evaluation function
- td learning
- function approximation
- action selection
- step size
- model free
- temporal difference learning
- monte carlo
- control system
- control problems
- temporal difference methods
- policy evaluation
- reinforcement learning algorithms
- supervised learning
- cost function
- decision trees
- optimization algorithm
- feature vectors
- actor critic
- decision making