Online Matching with Stochastic Rewards: Optimal Competitive Ratio via Path-Based Formulation.
Vineet GoyalRajan UdwaniPublished in: Oper. Res. (2023)
Keyphrases
- competitive ratio
- online algorithms
- online learning
- single machine
- monte carlo sampling
- lower bound
- average case
- optimal strategy
- reward function
- worst case
- learning algorithm
- initially unknown
- processing times
- reinforcement learning
- convergence rate
- asymptotically optimal
- markov decision processes
- machine learning
- decision makers
- upper bound
- scheduling problem
- probabilistic model
- search algorithm