An MDP-based application oriented optimal policy for wireless sensor networks.
Arslan MunirAnn Gordon-RossPublished in: CODES+ISSS (2009)
Keyphrases
- optimal policy
- application oriented
- wireless sensor networks
- markov decision processes
- markov decision process
- decision problems
- state space
- finite horizon
- long run
- finite state
- reinforcement learning
- dynamic programming
- infinite horizon
- average reward
- state dependent
- multistage
- bayesian reinforcement learning
- sufficient conditions
- dynamic programming algorithms
- control policies
- discount factor
- machine learning
- initial state
- markov decision problems
- probability distribution
- serial inventory systems