![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Boyi Liu 0001
Person information
- affiliation: Northwestern University, IL, USA
Other persons with the same name
- Boyi Liu
- Boyi Liu 0002
— Beihang University, Beijing, China
- Boyi Liu 0003
— Hong Kong University of Science and Technology, Hong Kong
Refine list
![note](https://dblp.uni-trier.de./img/note-mark.dark.12x12.png)
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c10]Chau Pham, Boyi Liu, Yingxiang Yang, Zhengyu Chen, Tianyi Liu, Jianbo Yuan, Bryan A. Plummer, Zhaoran Wang, Hongxia Yang:
Let Models Speak Ciphers: Multiagent Debate through Embeddings. ICLR 2024 - [c9]Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang:
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents. ICML 2024 - [i14]Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang
:
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning. CoRR abs/2402.10810 (2024) - [i13]Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose H. Blanchet, Zhaoran Wang:
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer. CoRR abs/2405.16436 (2024) - [i12]Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang:
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs. CoRR abs/2410.08067 (2024) - 2023
- [j1]Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang:
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning. J. Mach. Learn. Res. 24: 385:1-385:43 (2023) - [c8]Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu:
Differentiable Arbitrating in Zero-sum Markov Games. AAMAS 2023: 1034-1043 - [c7]Jiayang Li, Jing Yu
, Boyi Liu, Yu Marco Nie, Zhaoran Wang:
Achieving Hierarchy-Free Approximation for Bilevel Programs with Equilibrium Constraints. ICML 2023: 20312-20335 - [c6]Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao:
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms. NeurIPS 2023 - [i11]Jiayang Li, Jing Yu, Boyi Liu, Zhaoran Wang, Yu Marco Nie:
Achieving Hierarchy-Free Approximation for Bilevel Programs With Equilibrium Constraints. CoRR abs/2302.09734 (2023) - [i10]Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu:
Differentiable Arbitrating in Zero-sum Markov Games. CoRR abs/2302.10058 (2023) - [i9]Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang:
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency. CoRR abs/2309.17382 (2023) - [i8]Chau Pham, Boyi Liu, Yingxiang Yang, Zhengyu Chen, Tianyi Liu, Jianbo Yuan, Bryan A. Plummer, Zhaoran Wang, Hongxia Yang:
Let Models Speak Ciphers: Multiagent Debate through Embeddings. CoRR abs/2310.06272 (2023) - [i7]Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao:
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms. CoRR abs/2310.19927 (2023) - 2022
- [c5]Boyi Liu, Jiayang Li, Zhuoran Yang, Hoi-To Wai, Mingyi Hong, Yu Marco Nie, Zhaoran Wang:
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence. NeurIPS 2022 - [c4]Fengzhuo Zhang, Boyi Liu, Kaixin Wang, Vincent Y. F. Tan, Zhuoran Yang, Zhaoran Wang:
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL. NeurIPS 2022 - [i6]Jiayang Li, Jing Yu, Qianni Wang, Boyi Liu, Zhaoran Wang, Yu Marco Nie:
Differentiable Bilevel Programming for Stackelberg Congestion Games. CoRR abs/2209.07618 (2022) - [i5]Fengzhuo Zhang, Boyi Liu, Kaixin Wang, Vincent Y. F. Tan, Zhuoran Yang, Zhaoran Wang:
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL. CoRR abs/2209.09845 (2022) - [i4]Yufeng Zhang, Boyi Liu, Qi Cai, Lingxiao Wang, Zhaoran Wang:
An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models. CoRR abs/2212.14852 (2022) - 2021
- [c3]Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang:
BooVI: Provably Efficient Bootstrapped Value Iteration. NeurIPS 2021: 7041-7053 - [i3]Boyi Liu, Jiayang Li, Zhuoran Yang, Hoi-To Wai, Mingyi Hong, Yu Marco Nie, Zhaoran Wang:
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima. CoRR abs/2110.01212 (2021)
2010 – 2019
- 2019
- [c2]Yuan Xie, Boyi Liu, Qiang Liu, Zhaoran Wang, Yuan Zhou, Jian Peng:
Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy. ICLR (Poster) 2019 - [c1]Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang:
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy. NeurIPS 2019: 10564-10575 - [i2]Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang:
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy. CoRR abs/1906.10306 (2019) - 2018
- [i1]Yuan Xie, Boyi Liu, Qiang Liu, Zhaoran Wang, Yuan Zhou, Jian Peng:
Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy. CoRR abs/1808.00232 (2018)
Coauthor Index
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-09 15:57 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint