A Better Resource Allocation Algorithm with Semi-Bandit Feedback.

Yuval Dagan Koby Crammer

Published in: CoRR (2018)

Keyphrases

resource allocation
dynamic programming
learning algorithm
objective function
optimal solution
linear programming
worst case
resource allocation problems
reinforcement learning
computational complexity
dynamic environments
path finding
optimal resource allocation