Reinforcement-Learning-Based UAV Utility Maximization for Offloading Cellular Communication Systems.
Abhishek MondalDeepak MishraGanesh PrasadGeorge C. AlexandropoulosAshraf HossainPublished in: ICC Workshops (2023)
Keyphrases
- communication systems
- utility maximization
- reinforcement learning
- utility function
- information processing systems
- computer systems
- stochastic gradient
- function approximation
- blind equalization
- sample path
- multiple access
- state space
- policy iteration
- machine learning
- model free
- markov decision processes
- data processing
- temporal difference
- dynamic programming
- human computer interaction
- optimal control
- data management
- bayesian networks