Login / Signup

When Stochastic Rewards Reduce to Deterministic Rewards in Online Bipartite Matching.

Rajan Udwani
Published in: CoRR (2023)
Keyphrases
  • fully observable
  • bipartite matching
  • reinforcement learning
  • markov decision processes
  • state space
  • online learning
  • genetic algorithm
  • maximum weight