STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic Routing in SDN.
Juan ChenZhiwen XiaoHuanlai XingPenglin DaiShouxi LuoMuhammad Azhar IqbalPublished in: ICC (2020)
Keyphrases
- dynamic routing
- policy gradient
- single agent
- multi agent systems
- multi agent
- state action
- load balancing
- multiple agents
- function approximation
- gradient method
- mobile agents
- optimal control
- intelligent agents
- reinforcement learning
- travel time
- heavy traffic
- reinforcement learning algorithms
- dynamic environments
- agent technology
- transportation networks
- moving objects
- image sequences
- action selection
- machine learning
- partially observable markov decision processes
- learning agent
- average reward
- routing algorithm
- reinforcement learning methods