Online Matching with Stochastic Rewards: Optimal Competitive Ratio via Path-Based Formulation.

Vineet Goyal Rajan Udwani

Published in: Oper. Res. (2023)

Keyphrases

competitive ratio
online algorithms
online learning
single machine
monte carlo sampling
lower bound
average case
optimal strategy
reward function
worst case
learning algorithm
initially unknown
processing times
reinforcement learning
convergence rate
asymptotically optimal
markov decision processes
machine learning
decision makers
upper bound
scheduling problem
probabilistic model
search algorithm