Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.
Takahiro EzakiYutaka HoritaMasanori TakezawaNaoki MasudaPublished in: PLoS Comput. Biol. (2016)
Keyphrases
- reinforcement learning
- multi agent
- cooperative
- optimal policy
- state space
- function approximation
- robotic control
- reinforcement learning algorithms
- multi agent systems
- model free
- learning algorithm
- temporal difference
- action selection
- dynamic programming
- temporal difference learning
- distributed problem solving
- learning problems
- markov decision processes
- supervised learning
- conditional logic
- random field model
- data sets