Online Matching with Stochastic Rewards: Optimal Competitive Ratio via Path Based Formulation.
Vineet GoyalRajan UdwaniPublished in: EC (2020)
Keyphrases
- competitive ratio
- online algorithms
- lower bound
- monte carlo sampling
- single machine
- online learning
- optimal strategy
- average case
- initially unknown
- reward function
- processing times
- worst case
- convergence rate
- learning algorithm
- scheduling problem
- dynamic programming
- monte carlo
- upper bound
- search algorithm
- optimal policy
- decision boundary
- uniform distribution
- markov decision processes
- asymptotically optimal
- simulated annealing
- nearest neighbor
- state space