Joint power and hopping rate adaption against follower jammer based on deep reinforcement learning.
Ruidong WangShilian WangWei ZhangPublished in: Trans. Emerg. Telecommun. Technol. (2023)
Keyphrases
- energy dissipation
- reinforcement learning
- function approximation
- state space
- temporal difference
- optimal policy
- supervised learning
- robotic control
- multi agent reinforcement learning
- markov decision processes
- machine learning
- learning algorithm
- neural network
- data sets
- real time
- learning process
- multi agent systems
- power consumption
- learning problems
- decision making
- model free