Login / Signup
When Stochastic Rewards Reduce to Deterministic Rewards in Online Bipartite Matching.
Rajan Udwani
Published in:
SOSA (2024)
Keyphrases
</>
bipartite matching
fully observable
reinforcement learning
markov decision processes
online learning
maximum weight
multiple objectives
reward function