Scheduling for Mobile Edge Computing with Random User Arrivals: An Approximate MDP and Reinforcement Learning Approach.
Shanfeng HuangBojie LvRui WangKaibin HuangPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- markov decision processes
- optimal policy
- state space
- markov decision process
- end users
- policy evaluation
- dynamic programming
- user preferences
- action sets
- learning algorithm
- function approximation
- edge detection
- scheduling problem
- mobile devices
- mobile learning
- user interface
- mobile terminals
- user experience
- user interaction
- state dependent
- reinforcement learning algorithms
- optimal control
- state and action spaces
- policy iteration
- recommender systems
- edge information
- relevance feedback
- utility function
- resource allocation
- mobile phone