A UoI-Optimal Policy for Timely Status Updates with Resource Constraint.
Lehan WangJingzhou SunYuxuan SunSheng ZhouZhisheng NiuPublished in: Entropy (2021)
Keyphrases
- optimal policy
- markov decision processes
- state space
- decision problems
- infinite horizon
- finite horizon
- finite state
- state dependent
- reinforcement learning
- dynamic programming
- long run
- multistage
- resource allocation
- sufficient conditions
- lost sales
- markov decision process
- average cost
- serial inventory systems
- bayesian reinforcement learning
- policy iteration
- inventory level
- average reward
- steady state
- learning algorithm
- partially observable
- stochastic demand
- optimal pricing
- markov chain
- stochastic inventory control