Login / Signup
Bandit Learning in Concave N-Person Games.
Mario Bravo
David S. Leslie
Panayotis Mertikopoulos
Published in:
NeurIPS (2018)
Keyphrases
</>
learning process
learning algorithm
active learning
learning systems
learning tasks
reinforcement learning
supervised learning
online learning
knowledge acquisition
unsupervised learning
learning agents
objective function
computer games
multiagent learning