Login / Signup
Deep Deterministic Policy Gradient (DDPG)-Based Resource Allocation Scheme for NOMA Vehicular Communications.
Yi-Han Xu
Cheng-Cheng Yang
Min Hua
Wen Zhou
Published in:
IEEE Access (2020)
Keyphrases
</>
allocation scheme
policy gradient
parametric optimization
function approximation
reinforcement learning
evaluation method
gradient method
actor critic
random access
optimal control
single agent
approximation methods
variance reduction
reinforcement learning algorithms