When Stochastic Rewards Reduce to Deterministic Rewards in Online Bipartite Matching.

Published in: CoRR (2023)

Keyphrases

fully observable
bipartite matching
reinforcement learning
markov decision processes
state space
online learning
genetic algorithm
maximum weight