Login / Signup
Mitigation of Scheduling Violations in Time-Sensitive Networking using Deep Deterministic Policy Gradient.
Boyang Zhou
Liang Cheng
Published in:
FlexNets@SIGCOMM (2021)
Keyphrases
</>
policy gradient
parametric optimization
reinforcement learning
actor critic
function approximation
model free reinforcement learning
optimal control
gradient method
approximation methods
variance reduction
reinforcement learning algorithms
learning algorithm
multi agent
single agent