Counterfactual Multi-Agent Policy Gradients.
Jakob N. FoersterGregory FarquharTriantafyllos AfourasNantas NardelliShimon WhitesonPublished in: AAAI (2018)
Keyphrases
- multi agent
- optimal policy
- reinforcement learning
- multi agent systems
- cooperative
- multiagent systems
- partially observable markov decision processes
- american football
- causal reasoning
- multiple agents
- image gradient
- decision problems
- autonomous agents
- gradient information
- policy making
- cooperative agents
- multi agent architecture
- oriented programming
- case study