Stochastic Game Based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networks.
Tiankui ZhangXinyuan FangZiduan WangYuanwei LiuArumugam NallanathanPublished in: IEEE Trans. Veh. Technol. (2021)
Keyphrases
- cooperative
- multi agent
- dynamic content
- dynamic networks
- proxy cache
- reinforcement learning
- stochastic approximation
- query processing
- learning algorithm
- multi agent reinforcement learning
- dynamic environments
- social networks
- pose estimation
- monte carlo
- game theory
- learning systems
- temporal difference learning
- learning process
- neural network