Dynamic Spectrum Sharing Based on Federated Learning and Multi-Agent Actor-Critic Reinforcement Learning.
Tongtong YangWensheng ZhangYulian BoJian SunCheng-Xiang WangPublished in: IWCMC (2023)
Keyphrases
- reinforcement learning
- actor critic
- multi agent
- temporal difference
- learning algorithm
- policy gradient
- learning process
- function approximation
- supervised learning
- optimal control
- reinforcement learning algorithms
- approximate dynamic programming
- policy iteration
- learning problems
- state space
- function approximators
- temporal difference learning
- control system
- partially observable markov decision processes
- transfer learning
- rl algorithms
- dynamic environments
- fuzzy sets
- machine learning