default search action

combined dblp search
author search
venue search
publication search

ask others

Yuanhao Wang 0001

> Home > Persons

Person information

affiliation: Princeton University, USA
affiliation (former): Tsinghua University, Institute for Interdisciplinary Information Sciences, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-04081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-04081
Aaron Mishkin, Ahmed Khaled, Yuanhao Wang, Aaron Defazio, Robert M. Gower:
Directional Smoothness and Gradient Methods: Convergence and Adaptivity. CoRR abs/2403.04081 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04201
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04201
Jiawei Ge, Yuanhao Wang, Wenzhe Li, Chi Jin:
Towards Principled Superhuman AI for Multiplayer Symmetric Games. CoRR abs/2406.04201 (2024)
2023
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/WangL0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/WangL0023
Yuanhao Wang, Qinghua Liu, Yu Bai, Chi Jin:
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation. COLT 2023: 2793-2848
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0004K0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0004K0023
Yuanhao Wang, Dingwen Kong, Yu Bai, Chi Jin:
Learning Rationalizable Equilibria in Multiplayer Games. ICLR 2023
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0001L023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001L023
Yuanhao Wang, Qinghua Liu, Chi Jin:
Is RLHF More Difficult than Standard RL? A Theoretical Perspective. NeurIPS 2023
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-06606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-06606
Yuanhao Wang, Qinghua Liu, Yu Bai, Chi Jin:
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation. CoRR abs/2302.06606 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14111
Yuanhao Wang, Qinghua Liu, Chi Jin:
Is RLHF More Difficult than Standard RL? CoRR abs/2306.14111 (2023)
2022
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/Zhang0LG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/Zhang0LG22
Guodong Zhang, Yuanhao Wang, Laurent Lessard, Roger B. Grosse:
Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization. AISTATS 2022: 7659-7679
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Liu0J22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Liu0J22
Qinghua Liu, Yuanhao Wang, Chi Jin:
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits. ICML 2022: 14036-14053
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-06803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-06803
Qinghua Liu, Yuanhao Wang, Chi Jin:
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits. CoRR abs/2203.06803 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11402
Yuanhao Wang, Dingwen Kong, Yu Bai, Chi Jin:
Learning Rationalizable Equilibria in Multiplayer Games. CoRR abs/2210.11402 (2022)
2021
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/Zhang021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/Zhang021
Guodong Zhang, Yuanhao Wang:
On the Suboptimality of Negative Momentum for Minimax Optimization. AISTATS 2021: 2098-2106
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Tian0YS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Tian0YS21
Yi Tian, Yuanhao Wang, Tiancheng Yu, Suvrit Sra:
Online Learning in Unknown Markov Games. ICML 2021: 10279-10288
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangWK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangWK21
Yuanhao Wang, Ruosong Wang, Sham M. Kakade:
An Exponential Lower Bound for Linearly Realizable MDP with Constant Suboptimality Gap. NeurIPS 2021: 9521-9533
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09468
Guodong Zhang, Yuanhao Wang, Laurent Lessard, Roger B. Grosse:
Don't Fix What ain't Broke: Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization. CoRR abs/2102.09468 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-12690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-12690
Yuanhao Wang, Ruosong Wang, Sham M. Kakade:
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap. CoRR abs/2103.12690 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14555
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14555
Chi Jin, Qinghua Liu, Yuanhao Wang, Tiancheng Yu:
V-Learning - A Simple, Efficient, Decentralized Algorithm for Multiagent RL. CoRR abs/2110.14555 (2021)
2020
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangDCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangDCW20
Yuanhao Wang, Kefan Dong, Xiaoyu Chen, Liwei Wang:
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP. ICLR 2020
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangHCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangHCW20
Yuanhao Wang, Jiachen Hu, Xiaoyu Chen, Liwei Wang:
Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication. ICLR 2020
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangZB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangZB20
Yuanhao Wang, Guodong Zhang, Jimmy Ba:
On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach. ICLR 2020
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangL20
Yuanhao Wang, Jian Li:
Improved Algorithms for Convex-Concave Minimax Optimization. NeurIPS 2020
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-06359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-06359
Yuanhao Wang, Jian Li:
Improved Algorithms for Convex-Concave Minimax Optimization. CoRR abs/2006.06359 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-07459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-07459
Guodong Zhang, Yuanhao Wang:
On the Suboptimality of Negative Momentum for Minimax Optimization. CoRR abs/2008.07459 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-09251
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-09251
Yuanhao Wang, Kefan Dong:
Refined Analysis of FPL for Adversarial Markov Decision Processes. CoRR abs/2008.09251 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15020
Yi Tian, Yuanhao Wang, Tiancheng Yu, Suvrit Sra:
Provably Efficient Online Agnostic Learning in Markov Games. CoRR abs/2010.15020 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09311
Kefan Dong, Yuanhao Wang, Xiaoyu Chen, Liwei Wang:
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP. CoRR abs/1901.09311 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-06309
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-06309
Yuanhao Wang, Jiachen Hu, Xiaoyu Chen, Liwei Wang:
Distributed Bandit Learning: How Much Communication is Needed to Achieve (Near) Optimal Regret. CoRR abs/1904.06309 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07512
Yuanhao Wang, Guodong Zhang, Jimmy Ba:
On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach. CoRR abs/1910.07512 (2019)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.