Login / Signup
Multi-Agent Off-Policy TD Learning: Finite-Time Analysis with Near-Optimal Sample Complexity and Communication Complexity.
Ziyi Chen
Yi Zhou
Rongrong Chen
Published in:
CoRR (2021)
Keyphrases
</>
sample complexity
multi agent
active learning
special case
lower bound
worst case
td learning
data sets
feature selection
reinforcement learning
feature space
upper bound
theoretical analysis