A survey of energy harvesting communications: models and offline optimal policies.

Yejun He Xudong Cheng Wei Peng Gordon L. Stüber

Published in: IEEE Commun. Mag. (2015)

Keyphrases

optimal policy
markov decision processes
decision problems
state space
dynamic programming
multistage
finite horizon
average reward reinforcement learning
stochastic inventory control
probabilistic model
control policies
serial inventory systems
sufficient conditions
machine learning
long run
infinite horizon
dynamic programming algorithms