Login / Signup
Shapley Q-Value: A Local Reward Approach to Solve Global Reward Games.
Jianhong Wang
Yuan Zhang
Tae-Kyun Kim
Yunjie Gu
Published in:
AAAI (2020)
Keyphrases
</>
reinforcement learning
game theory
neural network
decision making
computer games
global information
reward function
nash equilibria
bandit problems
data sets
machine learning
cooperative
long run
learning agent