Login / Signup
A Better Resource Allocation Algorithm with Semi-Bandit Feedback.
Yuval Dagan
Koby Crammer
Published in:
CoRR (2018)
Keyphrases
</>
resource allocation
dynamic programming
learning algorithm
objective function
optimal solution
linear programming
worst case
resource allocation problems
reinforcement learning
computational complexity
dynamic environments
path finding
optimal resource allocation