A survey of energy harvesting communications: models and offline optimal policies.
Yejun HeXudong ChengWei PengGordon L. StüberPublished in: IEEE Commun. Mag. (2015)
Keyphrases
- optimal policy
- markov decision processes
- decision problems
- state space
- dynamic programming
- multistage
- finite horizon
- average reward reinforcement learning
- stochastic inventory control
- probabilistic model
- control policies
- serial inventory systems
- sufficient conditions
- machine learning
- long run
- infinite horizon
- dynamic programming algorithms