Reinforcement learning based monotonic policy for online resource allocation.
Pankaj MishraAhmed MoustafaPublished in: Future Gener. Comput. Syst. (2023)
Keyphrases
- resource allocation
- reinforcement learning
- optimal policy
- optimal resource allocation
- policy search
- resource management
- allocation problems
- resource allocation problems
- markov decision process
- online learning
- state space
- action selection
- allocate resources
- allocation strategies
- resource availability
- resource allocation and scheduling
- function approximation
- dynamic resource allocation
- social welfare
- scarce resources
- resource usage
- resource requirements
- control policy
- bidding strategies
- dynamic programming
- learning process
- resource consumption
- exploration exploitation tradeoff
- decision problems
- multi agent