C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Player-optimal Stable Regret for Bandit Learning in Matching Markets.
Fang Kong
Shuai Li
Published in:
CoRR (2023)
Keyphrases
</>
online learning
learning algorithm
worst case
markov chain
matching algorithm
active learning
supervised learning
learning systems
learning tasks
neural network
optimal solution
lower bound
optimal policy
learning problems
graph matching