TTL-Based Cache Utility Maximization Using Deep Reinforcement Learning.
Chunglae ChoSeungjae ShinHongseok JeonSeunghyun YoonPublished in: GLOBECOM (2021)
Keyphrases
- utility maximization
- reinforcement learning
- utility function
- stochastic gradient
- function approximation
- markov decision processes
- state space
- machine learning
- dynamic programming
- sample path
- decision makers
- model free
- policy iteration
- learning algorithm
- supervised learning
- temporal difference
- bayesian networks
- single period