Login / Signup
Optimal policies for two-user energy harvesting device networks with imperfect state-of-charge knowledge.
Davide Del Testa
Michele Zorzi
Published in:
ITA (2014)
Keyphrases
</>
optimal policy
markov decision processes
decision problems
reinforcement learning
dynamic programming
state space
multistage
finite horizon
knowledge base
finite state
long run
infinite horizon
serial inventory systems
machine learning
state dependent
bayesian reinforcement learning