On-Policy vs. Off-Policy Deep Reinforcement Learning for Resource Allocation in Open Radio Access Network.
Nessrine HammamiKim Khoa NguyenPublished in: WCNC (2022)
Keyphrases
- resource allocation
- access network
- reinforcement learning
- bandwidth allocation
- optimal policy
- optimal resource allocation
- state space
- resource management
- reward function
- resource allocation problems
- wireless communication
- mobile terminals
- allocation problems
- markov decision processes
- learning algorithm
- allocation scheme
- cellular networks
- reinforcement learning algorithms
- scarce resources
- learning agents
- wireless networks
- dynamic programming
- resource allocation decisions
- multi agent
- real time