Semi-Markov Reinforcement Learning for Stochastic Resource Collection.
Sebastian SchmollMatthias SchubertPublished in: IJCAI (2020)
Keyphrases
- semi markov
- reinforcement learning
- direct policy search
- stochastic approximation
- learning automata
- conditional random fields
- function approximation
- resource allocation
- continuous state
- document collections
- continuous state spaces
- state space
- learning algorithm
- machine learning
- control policies
- resource selection
- model free
- resource management
- database
- dynamic programming
- resource constraints
- web resources
- temporal difference
- learning process
- multi agent
- transfer learning
- approximate dynamic programming
- optimal policy
- higher order