Optimal policies for control of peers in online multimedia services.
Young Myoung KoJean-François ChamberlandNatarajan GautamPublished in: CDC (2007)
Keyphrases
- optimal policy
- multimedia services
- markov decision processes
- control policies
- finite horizon
- decision problems
- reinforcement learning
- state space
- dynamic programming
- infinite horizon
- long run
- online learning
- wireless networks
- average reward reinforcement learning
- dynamic programming algorithms
- peer to peer
- control strategy
- serial inventory systems
- multimedia
- policy iteration
- control policy
- average reward
- quality of service
- search space
- service delivery
- average cost
- real time