Improved policy representation and policy search for proactive content caching in wireless networks.
Samuel O. SomuyiwaAndrás GyörgyDeniz GündüzPublished in: WiOpt (2017)
Keyphrases
- wireless networks
- policy search
- reinforcement learning
- wireless communication
- dynamic programming
- continuous state
- mobile computing
- ad hoc networks
- reinforcement learning algorithms
- multimedia content
- multimedia services
- policy gradient
- multimedia
- reward function
- markov decision problems
- optimal policy
- mobile devices
- robot navigation