default search action

combined dblp search
author search
venue search
publication search

ask others

Yufeng Zhang 0007

> Home > Persons

Person information

affiliation: Northwestern University, Evanston, IL, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-05632
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-05632
Hongyi Guo, Zhihan Liu, Yufeng Zhang, Zhaoran Wang:
Can Large Language Models Play Games? A Case Study of A Self-Play Approach. CoRR abs/2403.05632 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-12312
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-12312
Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen:
A Mean-Field Analysis of Neural Gradient Descent-Ascent: Applications to Functional Conditional Moment Equations. CoRR abs/2404.12312 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-08067
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-08067
Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang:
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs. CoRR abs/2410.08067 (2024)
2023
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19420
Yufeng Zhang, Fengzhuo Zhang, Zhuoran Yang, Zhaoran Wang:
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization. CoRR abs/2305.19420 (2023)
2022
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GuoCZYW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GuoCZYW22
Hongyi Guo, Qi Cai, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang:
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes. ICML 2022: 8016-8038
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiuZFYW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuZFYW22
Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang:
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation. ICML 2022: 14094-14138
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-05581
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-05581
Doudou Zhou, Yufeng Zhang, Aaron Sonabend W., Zhaoran Wang, Junwei Lu, Tianxi Cai:
Federated Offline Reinforcement Learning. CoRR abs/2206.05581 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-14852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-14852
Yufeng Zhang, Boyi Liu, Qi Cai, Lingxiao Wang, Zhaoran Wang:
An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models. CoRR abs/2212.14852 (2022)
2021
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ZhangYW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ZhangYW21
Yufeng Zhang, Zhuoran Yang, Zhaoran Wang:
Provably Efficient Actor-Critic for Risk-Sensitive and Robust Adversarial RL: A Linear-Quadratic Case. AISTATS 2021: 2764-2772
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiuZYBW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuZYBW21
Lewis Liu, Yufeng Zhang, Zhuoran Yang, Reza Babanezhad, Zhaoran Wang:
Infinite-Dimensional Optimization for Zero-Sum Games via Variational Transport. ICML 2021: 7033-7044
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangCYJW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangCYJW21
Yufeng Zhang, Siyu Chen, Zhuoran Yang, Michael I. Jordan, Zhaoran Wang:
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic. NeurIPS 2021: 15993-16006
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WuZYW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WuZYW21
Runzhe Wu, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang:
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration. NeurIPS 2021: 25439-25451
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08765
Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang:
Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation. CoRR abs/2108.08765 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-13530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-13530
Yufeng Zhang, Siyu Chen, Zhuoran Yang, Michael I. Jordan, Zhaoran Wang:
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic. CoRR abs/2112.13530 (2021)
2020
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangCYW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangCYW20
Yufeng Zhang, Qi Cai, Zhuoran Yang, Zhaoran Wang:
Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate. ICML 2020: 11044-11054
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangCYCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangCYCW20
Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang:
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory. NeurIPS 2020
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-03709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-03709
Yufeng Zhang, Qi Cai, Zhuoran Yang, Zhaoran Wang:
Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate. CoRR abs/2003.03709 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-04761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-04761
Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang:
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory. CoRR abs/2006.04761 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-11554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-11554
Zhuoran Yang, Yufeng Zhang, Yongxin Chen, Zhaoran Wang:
Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization. CoRR abs/2012.11554 (2020)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.