Login / Signup
Sample and Communication Efficient Fully Decentralized MARL Policy Evaluation via a New Approach: Local TD update.
Hairi
Zifan Zhang
Jia Liu
Published in:
CoRR (2024)
Keyphrases
</>
policy evaluation
temporal difference
least squares
reinforcement learning
multi agent
multi agent reinforcement learning
cooperative
reinforcement learning algorithms
monte carlo
model free
function approximation
learning algorithm
multi agent systems
variance reduction