Shapley Q-Value: A Local Reward Approach to Solve Global Reward Games.

Jianhong Wang Yuan Zhang Tae-Kyun Kim Yunjie Gu

Published in: AAAI (2020)

Keyphrases

reinforcement learning
game theory
neural network
decision making
computer games
global information
reward function
nash equilibria
bandit problems
data sets
machine learning
cooperative
long run
learning agent