Login / Signup

When Stochastic Rewards Reduce to Deterministic Rewards in Online Bipartite Matching.

Rajan Udwani
Published in: SOSA (2024)
Keyphrases
  • bipartite matching
  • fully observable
  • reinforcement learning
  • markov decision processes
  • online learning
  • maximum weight
  • multiple objectives
  • reward function