UAV Assisted Cooperative Caching on Network Edge Using Multi-Agent Actor-Critic Reinforcement Learning.
Sadman ArafAdittya Soukarjya SahaSadia Hamid KaziNguyen Hoang TranMd. Golam Rabiul AlamPublished in: IEEE Trans. Veh. Technol. (2023)
Keyphrases
- reinforcement learning
- multi agent
- actor critic
- cooperative
- function approximation
- temporal difference
- reinforcement learning algorithms
- approximate dynamic programming
- policy gradient
- multi agent systems
- model free
- neuro fuzzy
- optimal control
- gradient method
- markov decision processes
- state space
- dynamic programming
- optimal policy
- rl algorithms
- supervised learning
- control algorithm
- linear program
- markov decision process
- average reward
- natural actor critic