default search action
Zhi Xu 0001
Person information
- affiliation (PhD 2021): Massachusetts Institute of Technology, Laboratory for Information and Decision Systems, Cambridge, MA, USA
Other persons with the same name
- Zhi Xu — disambiguation page
- Zhi Xu 0002 — University of Waterloo, School of Computer Science, Waterloo, Canada
- Zhi Xu 0003 — University of Western Ontario, Department of Computer Science, London, Canada
- Zhi Xu 0004 — Pennsylvania State University, Department of Computer Science and Engineering, University Park, PA, USA
- Zhi Xu 0005 — Guilin University of Electronic Technology, Guangxi Key Laboratory of Image and Graphic Intelligent Processing, Guilin, China (and 1 more)
- Zhi Xu 0006 — Anhui University of Science and Technology, College of Electrical and Information Engineering, Huainan, China
- Zhi Xu 0007 — East China University of Science and Technology, School of Chemical Engineering, Shanghai, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [j3]Devavrat Shah, Qiaomin Xie, Zhi Xu:
Nonasymptotic Analysis of Monte Carlo Tree Search. Oper. Res. 70(6): 3234-3260 (2022) - 2021
- [b1]Zhi Xu:
Data Efficient Reinforcement Learning. Massachusetts Institute of Technology, USA, 2021 - [j2]John N. Tsitsiklis, Kuang Xu, Zhi Xu:
Private Sequential Learning. Oper. Res. 69(5): 1575-1590 (2021) - [c12]Anish Agarwal, Abdullah Alomar, Varkey Alumootil, Devavrat Shah, Dennis Shen, Zhi Xu, Cindy Yang:
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators. NeurIPS 2021: 18564-18576 - [i10]Anish Agarwal, Abdullah Alomar, Varkey Alumootil, Devavrat Shah, Dennis Shen, Zhi Xu, Cindy Yang:
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators. CoRR abs/2102.06961 (2021) - 2020
- [c11]Devavrat Shah, Varun Somani, Qiaomin Xie, Zhi Xu:
On Reinforcement Learning for Turn-based Zero-sum Markov Games. FODS 2020: 139-148 - [c10]Yuzhe Yang, Guo Zhang, Zhi Xu, Dina Katabi:
Harnessing Structures for Value-Based Planning and Reinforcement Learning. ICLR 2020 - [c9]Devavrat Shah, Qiaomin Xie, Zhi Xu:
Stable Reinforcement Learning with Unbounded State Space. L4DC 2020: 581 - [c8]Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang:
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation. NeurIPS 2020 - [c7]Yuzhe Yang, Zhi Xu:
Rethinking the Value of Labels for Improving Class-Imbalanced Learning. NeurIPS 2020 - [c6]Devavrat Shah, Qiaomin Xie, Zhi Xu:
Non-Asymptotic Analysis of Monte Carlo Tree Search. SIGMETRICS (Abstracts) 2020: 31-32 - [i9]Devavrat Shah, Varun Somani, Qiaomin Xie, Zhi Xu:
On Reinforcement Learning for Turn-based Zero-sum Markov Games. CoRR abs/2002.10620 (2020) - [i8]Devavrat Shah, Qiaomin Xie, Zhi Xu:
Stable Reinforcement Learning with Unbounded State Space. CoRR abs/2006.04353 (2020) - [i7]Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang:
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation. CoRR abs/2006.06135 (2020) - [i6]Yuzhe Yang, Zhi Xu:
Rethinking the Value of Labels for Improving Class-Imbalanced Learning. CoRR abs/2006.07529 (2020)
2010 – 2019
- 2019
- [c5]Yuzhe Yang, Guo Zhang, Zhi Xu, Dina Katabi:
ME-Net: Towards Effective Adversarial Robustness with Matrix Estimation. ICML 2019: 7025-7034 - [i5]Devavrat Shah, Qiaomin Xie, Zhi Xu:
On Reinforcement Learning Using Monte Carlo Tree Search with Supervised Learning: Non-Asymptotic Analysis. CoRR abs/1902.05213 (2019) - [i4]Yuzhe Yang, Guo Zhang, Dina Katabi, Zhi Xu:
ME-Net: Towards Effective Adversarial Robustness with Matrix Estimation. CoRR abs/1905.11971 (2019) - [i3]Yuzhe Yang, Guo Zhang, Zhi Xu, Dina Katabi:
Harnessing Structures for Value-Based Planning and Reinforcement Learning. CoRR abs/1909.12255 (2019) - 2018
- [c4]John N. Tsitsiklis, Kuang Xu, Zhi Xu:
Private Sequential Learning. COLT 2018: 721-727 - [i2]John N. Tsitsiklis, Kuang Xu, Zhi Xu:
Private Sequential Learning. CoRR abs/1805.02136 (2018) - 2017
- [j1]Xudong Chen, Ji Liu, Mohamed-Ali Belabbas, Zhi Xu, Tamer Basar:
Distributed Evaluation and Convergence of Self-Appraisals in Social Networks. IEEE Trans. Autom. Control. 62(1): 291-304 (2017) - 2015
- [c3]Zhi Xu, Ji Liu, Tamer Basar:
On a Modified DeGroot-Friedkin model of opinion dynamics. ACC 2015: 1047-1052 - [c2]Zhi Xu, Ali Khanafer, Tamer Basar:
Competition over epidemic networks: Nash and stackelberg games. ACC 2015: 2063-2068 - [c1]Xudong Chen, Ji Liu, Zhi Xu, Tamer Basar:
Distributed evaluation and convergence of self-appraisals in social networks. CDC 2015: 2895-2900 - [i1]Xudong Chen, Ji Liu, Zhi Xu, Tamer Basar:
Distributed Evaluation and Convergence of Self-Appraisals in Social Networks. CoRR abs/1503.08175 (2015)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:03 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint