Multi-agent temporal-difference learning with linear function approximation: Weak convergence under time-varying network topologies.

Published in: ACC (2016)

Keyphrases