default search action

combined dblp search
author search
venue search
publication search

ask others

Zaiwei Chen

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/ior/ChenMSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ior/ChenMSS24
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
A Lyapunov Theory for Finite-Sample Guarantees of Markovian Stochastic Approximation. Oper. Res. 72(4): 1352-1367 (2024)
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenM24
Zaiwei Chen, Eric Mazumdar:
Last-Iterate Convergence for Generalized Frank-Wolfe in Monotone Variational Inequalities. NeurIPS 2024
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/sigecom/ChenZMOW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigecom/ChenZMOW24
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games. EC 2024: 378
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19811
Ruiyang Jin, Zaiwei Chen, Yiheng Lin, Jie Song, Adam Wierman:
Approximate Global Convergence of Independent Learning in Multi-Agent Systems. CoRR abs/2405.19811 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-01447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-01447
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games. CoRR abs/2409.01447 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-07591
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-07591
Chenbei Lu, Laixi Shi, Zaiwei Chen, Chenye Wu, Adam Wierman:
Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization. CoRR abs/2411.07591 (2024)
2023
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/pomacs/ZhangQXLCW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pomacs/ZhangQXLCW23
Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman:
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning. Proc. ACM Meas. Anal. Comput. Syst. 7(1): 13:1-13:51 (2023)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/simods/ChenCM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/simods/ChenCM23
Zaiwei Chen, John-Paul Clarke, Siva Theja Maguluri:
Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning. SIAM J. Math. Data Sci. 5(4): 1078-1101 (2023)
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenZMOW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenZMOW23
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games. NeurIPS 2023
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/sigmetrics/ZhangQ0LCW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigmetrics/ZhangQ0LCW23
Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman:
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning. SIGMETRICS (Abstracts) 2023: 83-84
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/ZhouCLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/ZhouCLW23
Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman:
Convergence rates for localized actor-critic in networked Markov potential games. UAI 2023: 2563-2573
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03100
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03100
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games. CoRR abs/2303.03100 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-04865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-04865
Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman:
Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games. CoRR abs/2303.04865 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-15740
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-15740
Zaiwei Chen, Siva Theja Maguluri, Martin Zubeldia:
Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise. CoRR abs/2303.15740 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04905
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games. CoRR abs/2312.04905 (2023)
2022
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/automatica/ChenZDCM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/automatica/ChenZDCM22
Zaiwei Chen, Sheng Zhang, Thinh T. Doan, John-Paul Clarke, Siva Theja Maguluri:
Finite-sample analysis of nonlinear stochastic approximation with applications in reinforcement learning. Autom. 146: 110623 (2022)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/csysl/ChenKM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csysl/ChenKM22
Zaiwei Chen, Sajad Khodadadian, Siva Theja Maguluri:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation. IEEE Control. Syst. Lett. 6: 2611-2616 (2022)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/pomacs/ChenMM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pomacs/ChenMM22
Zaiwei Chen, Shancong Mou, Siva Theja Maguluri:
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization. Proc. ACM Meas. Anal. Comput. Syst. 6(1): 19:1-19:24 (2022)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/sigmetrics/Chen22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sigmetrics/Chen22
Zaiwei Chen:
A Unified Lyapunov Framework for Finite-Sample Analysis of Reinforcement Learning Algorithms. SIGMETRICS Perform. Evaluation Rev. 50(3): 12-15 (2022)
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ChenM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ChenM22
Zaiwei Chen, Siva Theja Maguluri:
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation. AISTATS 2022: 11195-11214
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/sigmetrics/ChenMM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigmetrics/ChenMM22
Zaiwei Chen, Shancong Mou, Siva Theja Maguluri:
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization. SIGMETRICS (Abstracts) 2022: 109-110
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-02628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-02628
Zaiwei Chen, John-Paul Clarke, Siva Theja Maguluri:
Target Network and Truncation Overcome The Deadly triad in Q-Learning. CoRR abs/2203.02628 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-03247
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-03247
Zaiwei Chen, Siva Theja Maguluri:
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation. CoRR abs/2208.03247 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-17116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-17116
Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman:
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning. CoRR abs/2211.17116 (2022)
2021
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KhodadadianCM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KhodadadianCM21
Sajad Khodadadian, Zaiwei Chen, Siva Theja Maguluri:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm. ICML 2021: 5420-5431
[c2]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenMSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenMSS21
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators. NeurIPS 2021: 21440-21452
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01567
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants. CoRR abs/2102.01567 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09318
Sajad Khodadadian, Zaiwei Chen, Siva Theja Maguluri:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm. CoRR abs/2102.09318 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-01528
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-01528
Fanruiqi Zeng, Zaiwei Chen, John-Paul Clarke, David Goldsman:
Nested Vehicle Routing Problem: Optimizing Drone-Truck Surveillance Operations. CoRR abs/2103.01528 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-12540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-12540
Zaiwei Chen, Sajad Khodadadian, Siva Theja Maguluri:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation. CoRR abs/2105.12540 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-12729
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-12729
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators. CoRR abs/2106.12729 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-06328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-06328
Zaiwei Chen, Shancong Mou, Siva Theja Maguluri:
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization. CoRR abs/2111.06328 (2021)
2020
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenMSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenMSS20
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
Finite-Sample Analysis of Contractive Stochastic Approximation Using Smooth Convex Envelopes. NeurIPS 2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00874
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes. CoRR abs/2002.00874 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11425
Zaiwei Chen, Sheng Zhang, Thinh T. Doan, Siva Theja Maguluri, John-Paul Clarke:
Finite-Time Analysis of Q-Learning with Linear Function Approximation. CoRR abs/1905.11425 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.