Login / Signup
Caching Transient Content for IoT Sensing: Multi-Agent Soft Actor-Critic.
Xiongwei Wu
Xiuhua Li
Jun Li
P. C. Ching
Victor C. M. Leung
H. Vincent Poor
Published in:
CoRR (2020)
Keyphrases
</>
multi agent
actor critic
reinforcement learning
policy gradient
neuro fuzzy
function approximation
neural network
multiple agents
cooperative
multi agent systems
dynamic environments
temporal difference