When Stochastic Rewards Reduce to Deterministic Rewards in Online Bipartite Matching.

Published in: SOSA (2024)

Keyphrases

bipartite matching
fully observable
reinforcement learning
markov decision processes
online learning
maximum weight
multiple objectives
reward function