Joint Minimization of Monitoring Cost and Delay in Overlay Networks: Optimal Policies with a Markovian Approach.
Sandrine VatonOlivier BrunMaxime MouchetPablo BelzarenaIsabel AmigoBalakrishna J. PrabhuThierry ChonavelPublished in: J. Netw. Syst. Manag. (2019)
Keyphrases
- optimal policy
- bandwidth consumption
- overlay network
- average cost
- markov decision processes
- peer to peer
- finite horizon
- structured peer to peer
- decision problems
- state space
- long run
- lost sales
- finite state
- multistage
- reinforcement learning
- peer to peer file sharing
- state dependent
- average reward
- infinite horizon
- expected cost
- serial inventory systems
- dynamic programming
- average reward reinforcement learning
- markov decision process
- publish subscribe
- dynamic programming algorithms
- sufficient conditions
- network topology
- objective function
- initial state
- real time
- long run average cost
- cost function
- communication cost
- total cost
- demand distributions
- finite number
- lead time
- holding cost
- policy iteration
- partially observable markov decision processes