Realtime Spectrum Monitoring via Reinforcement Learning - A Comparison Between Q-Learning and Heuristic Methods.
Tobias BraunTobias KorzyzkowskeLarissa PutzarJan MietznerPeter A. HoeherPublished in: CoRR (2023)
Keyphrases
- heuristic methods
- reinforcement learning
- real time
- function approximation
- reinforcement learning algorithms
- model free
- optimal solution
- tabu search
- temporal difference learning
- state space
- efficient solutions
- monitoring system
- reinforcement learning methods
- optimal policy
- action selection
- temporal difference
- multi agent
- stochastic approximation
- multi agent reinforcement learning
- eligibility traces
- exact algorithms
- learning algorithm
- dynamic programming
- function approximators
- markov decision processes
- activity monitoring
- state action
- optimal control
- rl algorithms
- continuous state and action spaces
- learning capabilities
- learning agent
- learning problems
- variable neighborhood search
- neighborhood search
- multiagent learning
- quality of service
- situation assessment