Bandit Learning in Concave N-Person Games.

Mario Bravo David S. Leslie Panayotis Mertikopoulos

Published in: NeurIPS (2018)

Keyphrases

learning process
learning algorithm
active learning
learning systems
learning tasks
reinforcement learning
supervised learning
online learning
knowledge acquisition
unsupervised learning
learning agents
objective function
computer games
multiagent learning