Login / Signup
Caching Transient Content for IoT Sensing: Multi-Agent Soft Actor-Critic.
Xiongwei Wu
Xiuhua Li
Jun Li
P. C. Ching
Victor C. M. Leung
H. Vincent Poor
Published in:
IEEE Trans. Commun. (2021)
Keyphrases
</>
multi agent
actor critic
reinforcement learning
cooperative
policy gradient
approximate dynamic programming
multi agent systems
temporal difference
neural network
machine learning
function approximation
neuro fuzzy
model free