Login / Signup
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network.
Wenjia Meng
Qian Zheng
Long Yang
Pengfei Li
Gang Pan
Published in:
IEEE Trans. Neural Networks Learn. Syst. (2020)
Keyphrases
</>
neural network
communication networks
machine learning
wireless sensor networks
peer to peer
computer networks
reinforcement learning
multi agent
optimal policy
network structure
complex networks
network traffic
infinite horizon
qualitative models
network resources
markov decision process