Player-optimal Stable Regret for Bandit Learning in Matching Markets.

Fang Kong Shuai Li

Published in: SODA (2023)

Keyphrases

online learning
learning process
learning tasks
learning algorithm
active learning
reinforcement learning
dynamic programming
supervised learning
learning systems
learning problems
inductive inference
action selection
bandit problems