​
Login / Signup
Zaiwei Chen
ORCID
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 30
Top Topics
Global Convergence
Stochastic Approximation
Natural Actor Critic
Reinforcement Learning
Top Venues
CoRR
NeurIPS
Proc. ACM Meas. Anal. Comput. Syst.
SIGMETRICS (Abstracts)
</>
Publications
</>
Ruiyang Jin
,
Zaiwei Chen
,
Yiheng Lin
,
Jie Song
,
Adam Wierman
Approximate Global Convergence of Independent Learning in Multi-Agent Systems.
CoRR
(2024)
Zaiwei Chen
,
John-Paul Clarke
,
Siva Theja Maguluri
Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning.
SIAM J. Math. Data Sci.
5 (4) (2023)
Zaiwei Chen
,
Kaiqing Zhang
,
Eric Mazumdar
,
Asuman E. Ozdaglar
,
Adam Wierman
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games.
NeurIPS
(2023)
Zhaoyi Zhou
,
Zaiwei Chen
,
Yiheng Lin
,
Adam Wierman
Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games.
CoRR
(2023)
Zhaoyi Zhou
,
Zaiwei Chen
,
Yiheng Lin
,
Adam Wierman
Convergence rates for localized actor-critic in networked Markov potential games.
UAI
(2023)
Yizhou Zhang
,
Guannan Qu
,
Pan Xu
,
Yiheng Lin
,
Zaiwei Chen
,
Adam Wierman
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning.
SIGMETRICS (Abstracts)
(2023)
Zaiwei Chen
,
Kaiqing Zhang
,
Eric Mazumdar
,
Asuman E. Ozdaglar
,
Adam Wierman
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games.
CoRR
(2023)
Zaiwei Chen
,
Kaiqing Zhang
,
Eric Mazumdar
,
Asuman E. Ozdaglar
,
Adam Wierman
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games.
CoRR
(2023)
Yizhou Zhang
,
Guannan Qu
,
Pan Xu
,
Yiheng Lin
,
Zaiwei Chen
,
Adam Wierman
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning.
Proc. ACM Meas. Anal. Comput. Syst.
7 (1) (2023)
Zaiwei Chen
,
Siva Theja Maguluri
,
Martin Zubeldia
Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise.
CoRR
(2023)
Zaiwei Chen
A Unified Lyapunov Framework for Finite-Sample Analysis of Reinforcement Learning Algorithms.
SIGMETRICS Perform. Evaluation Rev.
50 (3) (2022)
Zaiwei Chen
,
Shancong Mou
,
Siva Theja Maguluri
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization.
SIGMETRICS (Abstracts)
(2022)
Zaiwei Chen
,
Siva Theja Maguluri
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation.
AISTATS
(2022)
Zaiwei Chen
,
John-Paul Clarke
,
Siva Theja Maguluri
Target Network and Truncation Overcome The Deadly triad in Q-Learning.
CoRR
(2022)
Zaiwei Chen
,
Shancong Mou
,
Siva Theja Maguluri
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization.
Proc. ACM Meas. Anal. Comput. Syst.
6 (1) (2022)
Yizhou Zhang
,
Guannan Qu
,
Pan Xu
,
Yiheng Lin
,
Zaiwei Chen
,
Adam Wierman
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning.
CoRR
(2022)
Zaiwei Chen
,
Siva Theja Maguluri
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation.
CoRR
(2022)
Zaiwei Chen
,
Sheng Zhang
,
Thinh T. Doan
,
John-Paul Clarke
,
Siva Theja Maguluri
Finite-sample analysis of nonlinear stochastic approximation with applications in reinforcement learning.
Autom.
146 (2022)
Zaiwei Chen
,
Sajad Khodadadian
,
Siva Theja Maguluri
Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation.
IEEE Control. Syst. Lett.
6 (2022)
Sajad Khodadadian
,
Zaiwei Chen
,
Siva Theja Maguluri
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm.
CoRR
(2021)
Zaiwei Chen
,
Shancong Mou
,
Siva Theja Maguluri
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization.
CoRR
(2021)
Zaiwei Chen
,
Siva Theja Maguluri
,
Sanjay Shakkottai
,
Karthikeyan Shanmugam
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators.
CoRR
(2021)
Zaiwei Chen
,
Siva Theja Maguluri
,
Sanjay Shakkottai
,
Karthikeyan Shanmugam
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants.
CoRR
(2021)
Fanruiqi Zeng
,
Zaiwei Chen
,
John-Paul Clarke
,
David Goldsman
Nested Vehicle Routing Problem: Optimizing Drone-Truck Surveillance Operations.
CoRR
(2021)
Sajad Khodadadian
,
Zaiwei Chen
,
Siva Theja Maguluri
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm.
ICML
(2021)
Zaiwei Chen
,
Siva Theja Maguluri
,
Sanjay Shakkottai
,
Karthikeyan Shanmugam
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators.
NeurIPS
(2021)
Zaiwei Chen
,
Sajad Khodadadian
,
Siva Theja Maguluri
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation.
CoRR
(2021)
Zaiwei Chen
,
Siva Theja Maguluri
,
Sanjay Shakkottai
,
Karthikeyan Shanmugam
Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes.
CoRR
(2020)
Zaiwei Chen
,
Siva Theja Maguluri
,
Sanjay Shakkottai
,
Karthikeyan Shanmugam
Finite-Sample Analysis of Contractive Stochastic Approximation Using Smooth Convex Envelopes.
NeurIPS
(2020)
Zaiwei Chen
,
Sheng Zhang
,
Thinh T. Doan
,
Siva Theja Maguluri
,
John-Paul Clarke
Finite-Time Analysis of Q-Learning with Linear Function Approximation.
CoRR
(2019)