Login / Signup
Average Reward Mode Selection in D2D Communication with Deadline Constraint.
Jun Xu
Chengcheng Guo
Xin Li
Published in:
ICCT (2021)
Keyphrases
</>
average reward
mode selection
markov decision processes
long run
optimal policy
communication systems
discounted reward
complexity reduction
resource constraints
reinforcement learning
motion estimation
policy iteration
rate distortion
communication networks
intra prediction