A Q-decomposition and bounded RTDP approach to resource allocation.

Pierrick Plamondon Brahim Chaib-draa Abder Rezak Benaskeur

Published in: AAMAS (2007)

Keyphrases

resource allocation
heuristic search
markov decision processes
dynamic programming
upper bound
control theory
larger problems
initial state
reinforcement learning methods
real time dynamic programming
lower bound
game theory
resource allocation and scheduling
resource allocation problems
dynamical systems
state space
allocation strategies
allocation problems
scarce resources
optimal resource allocation
reinforcement learning
differential equations
beam search
optimal policy
machine learning
domain independent
dynamic environments
search algorithm