Dynamic spectrum access and sharing through actor-critic deep reinforcement learning.
Liang DongYuchen QianYuan XingPublished in: EURASIP J. Wirel. Commun. Netw. (2022)
Keyphrases
- actor critic
- reinforcement learning
- function approximation
- temporal difference
- reinforcement learning algorithms
- policy gradient
- approximate dynamic programming
- neuro fuzzy
- optimal control
- gradient method
- markov decision processes
- policy iteration
- average reward
- learning algorithm
- model free
- multi agent
- linear program
- dynamic environments
- supervised learning
- control problems
- dynamic programming
- temporal difference learning
- reinforcement learning methods
- optimal solution
- decision making