default search action
Yufeng Zhang 0007
Person information
- affiliation: Northwestern University, Evanston, IL, USA
Other persons with the same name
- Yufeng Zhang — disambiguation page
- Yufeng Zhang 0001 — Hunan University, College of Computer Science and Electronic Engineering, China
- Yufeng Zhang 0002 — Yunnan University, Department of Electronic Engineering, Kunming, China
- Yufeng Zhang 0003 — Stevens Institute of Technology, Hoboken, NJ, USA
- Yufeng Zhang 0004 — South China University of Technology, Guangzhou , China
- Yufeng Zhang 0005 — University of Birmingham, Birmingham, UK
- Yufeng Zhang 0006 — Chinese Academy of Sciences, National Space Science Center, Beijing, China (and 2 more)
- Yufeng Zhang 0008 — China Telecom Dict Application Capability Center, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i11]Hongyi Guo, Zhihan Liu, Yufeng Zhang, Zhaoran Wang:
Can Large Language Models Play Games? A Case Study of A Self-Play Approach. CoRR abs/2403.05632 (2024) - [i10]Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen:
A Mean-Field Analysis of Neural Gradient Descent-Ascent: Applications to Functional Conditional Moment Equations. CoRR abs/2404.12312 (2024) - [i9]Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang:
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs. CoRR abs/2410.08067 (2024) - 2023
- [i8]Yufeng Zhang, Fengzhuo Zhang, Zhuoran Yang, Zhaoran Wang:
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization. CoRR abs/2305.19420 (2023) - 2022
- [c8]Hongyi Guo, Qi Cai, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang:
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes. ICML 2022: 8016-8038 - [c7]Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang:
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation. ICML 2022: 14094-14138 - [i7]Doudou Zhou, Yufeng Zhang, Aaron Sonabend W., Zhaoran Wang, Junwei Lu, Tianxi Cai:
Federated Offline Reinforcement Learning. CoRR abs/2206.05581 (2022) - [i6]Yufeng Zhang, Boyi Liu, Qi Cai, Lingxiao Wang, Zhaoran Wang:
An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models. CoRR abs/2212.14852 (2022) - 2021
- [c6]Yufeng Zhang, Zhuoran Yang, Zhaoran Wang:
Provably Efficient Actor-Critic for Risk-Sensitive and Robust Adversarial RL: A Linear-Quadratic Case. AISTATS 2021: 2764-2772 - [c5]Lewis Liu, Yufeng Zhang, Zhuoran Yang, Reza Babanezhad, Zhaoran Wang:
Infinite-Dimensional Optimization for Zero-Sum Games via Variational Transport. ICML 2021: 7033-7044 - [c4]Yufeng Zhang, Siyu Chen, Zhuoran Yang, Michael I. Jordan, Zhaoran Wang:
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic. NeurIPS 2021: 15993-16006 - [c3]Runzhe Wu, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang:
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration. NeurIPS 2021: 25439-25451 - [i5]Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang:
Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation. CoRR abs/2108.08765 (2021) - [i4]Yufeng Zhang, Siyu Chen, Zhuoran Yang, Michael I. Jordan, Zhaoran Wang:
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic. CoRR abs/2112.13530 (2021) - 2020
- [c2]Yufeng Zhang, Qi Cai, Zhuoran Yang, Zhaoran Wang:
Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate. ICML 2020: 11044-11054 - [c1]Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang:
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory. NeurIPS 2020 - [i3]Yufeng Zhang, Qi Cai, Zhuoran Yang, Zhaoran Wang:
Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate. CoRR abs/2003.03709 (2020) - [i2]Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang:
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory. CoRR abs/2006.04761 (2020) - [i1]Zhuoran Yang, Yufeng Zhang, Yongxin Chen, Zhaoran Wang:
Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization. CoRR abs/2012.11554 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-16 23:14 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint