Login / Signup
A Task-Oriented Hybrid Routing Approach based on Deep Deterministic Policy Gradient.
Zongxuan Sha
Ru Huo
Chuang Sun
Shuo Wang
Tao Huang
Published in:
Comput. Commun. (2023)
Keyphrases
</>
policy gradient
actor critic
model free reinforcement learning
parametric optimization
reinforcement learning algorithms
gradient method
reinforcement learning
function approximation
machine learning
state space
optimal control