Login / Signup
Player-optimal Stable Regret for Bandit Learning in Matching Markets.
Fang Kong
Shuai Li
Published in:
SODA (2023)
Keyphrases
</>
online learning
learning process
learning tasks
learning algorithm
active learning
reinforcement learning
dynamic programming
supervised learning
learning systems
learning problems
inductive inference
action selection
bandit problems