Dynamic Spectrum Anti-Jamming With Reinforcement Learning Based on Value Function Approximation.
Xinyu ZhuYang HuangShaoyu WangQihui WuXiaohu GeYuan LiuZhen GaoPublished in: IEEE Wirel. Commun. Lett. (2023)
Keyphrases
- reinforcement learning
- temporal difference
- temporal difference learning
- state space
- multi agent
- dynamic environments
- machine learning
- approximate dynamic programming
- learning algorithm
- optimal policy
- basis functions
- markov games
- database
- function approximators
- control problems
- reinforcement learning algorithms
- function approximation
- genetic algorithm
- data sets