A Q-decomposition and bounded RTDP approach to resource allocation.
Pierrick PlamondonBrahim Chaib-draaAbder Rezak BenaskeurPublished in: AAMAS (2007)
Keyphrases
- resource allocation
- heuristic search
- markov decision processes
- dynamic programming
- upper bound
- control theory
- larger problems
- initial state
- reinforcement learning methods
- real time dynamic programming
- lower bound
- game theory
- resource allocation and scheduling
- resource allocation problems
- dynamical systems
- state space
- allocation strategies
- allocation problems
- scarce resources
- optimal resource allocation
- reinforcement learning
- differential equations
- beam search
- optimal policy
- machine learning
- domain independent
- dynamic environments
- search algorithm