Optimal Policies for Status Update Generation in a Wireless System with Heterogeneous Traffic.
George StamatakisNikolaos PappasApostolos TraganitisPublished in: CoRR (2018)
Keyphrases
- optimal policy
- markov decision processes
- decision problems
- long run
- finite horizon
- state space
- dynamic programming
- reinforcement learning
- voice and data services
- wireless networks
- multistage
- infinite horizon
- finite state
- average reward
- average cost
- wireless communication
- dynamic programming algorithms
- average reward reinforcement learning
- serial inventory systems
- initial state
- partially observable markov decision processes
- sufficient conditions
- state dependent
- search space
- policy iteration
- markov decision process
- total reward
- discounted reward
- semi markov decision processes