default search action

combined dblp search
author search
venue search
publication search

ask others

Ziniu Li

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tsp/FanJPLLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsp/FanJPLLL24
Youlin Fan, Bo Jiu, Wenqiang Pu, Ziniu Li, Kang Li, Hongwei Liu:
Sensing Jamming Strategy From Limited Observations: An Imitation Learning Perspective. IEEE Trans. Signal Process. 72: 4098-4114 (2024)
2022
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/XuLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/XuLY22
Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments for Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6968-6980 (2022)
2020
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/HuangLLXGW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/HuangLLXGW20
Xinjian Huang, Ziniu Li, Zhiyuan Liu, Bin Xiang, Yingsan Geng, Jianhua Wang:
Solving the Inverse Design Problem of Electrical Fuse With Machine Learning. IEEE Access 8: 74137-74144 (2020)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  - electronic edition @ aclanthology.org
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhanCDL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhanCDL024
Heshen Zhan, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun:
Unlocking Black-Box Prompt Tuning Efficiency via Zeroth-Order Optimization. EMNLP (Findings) 2024: 14825-14838
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiX024
Ziniu Li, Tian Xu, Yang Yu:
When is RL better than DPO in RLHF? A Representation and Optimization Perspective. Tiny Papers @ ICLR 2024
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiXZL00L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiXZL00L24
Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo:
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models. ICML 2024
2023
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiXQ0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiXQ0L23
Ziniu Li, Tian Xu, Zeyu Qin, Yang Yu, Zhi-Quan Luo:
Imitation Learning from Imperfection: Theoretical Justifications and Algorithms. NeurIPS 2023
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/XuL0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/XuL0L23
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. UAI 2023: 2367-2378
2022
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiLZZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiLZZL22
Ziniu Li, Yingru Li, Yushun Zhang, Tong Zhang, Zhi-Quan Luo:
HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning. ICLR 2022
2020
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/LiC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/LiC20
Ziniu Li, Xiong-Hui Chen:
Efficient Exploration by Novelty-Pursuit. DAI 2020: 85-102
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XuLY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuLY20
Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. NeurIPS 2020

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16788
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16788
Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo:
Why Transformers Need Adam: A Hessian Perspective. CoRR abs/2402.16788 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16455
Jiancong Xiao, Ziniu Li, Xingyu Xie, Emily J. Getzen, Cong Fang, Qi Long, Weijie J. Su:
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization. CoRR abs/2405.16455 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17039
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17039
Chengxing Jia, Pengyuan Wang, Ziniu Li, Yi-Chen Li, Zhilong Zhang, Nan Tang, Yang Yu:
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation. CoRR abs/2405.17039 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-16793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-16793
Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun:
Adam-mini: Use Fewer Learning Rates To Gain More. CoRR abs/2406.16793 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16673
Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Ruoyu Sun, Zhi-Quan Luo:
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity. CoRR abs/2408.16673 (2024)
2023
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11687
Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Theoretical Analysis of Offline Imitation With Supplementary Dataset. CoRR abs/2301.11687 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07046
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07046
Ziniu Li, Ke Xu, Liu Liu, Lanqing Li, Deheng Ye, Peilin Zhao:
Deploying Offline Reinforcement Learning with Human Feedback. CoRR abs/2303.07046 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06563
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. CoRR abs/2306.06563 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10505
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10505
Ziniu Li, Tian Xu, Yushun Zhang, Yang Yu, Ruoyu Sun, Zhi-Quan Luo:
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models. CoRR abs/2310.10505 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10584
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10584
Ziniu Li, Tian Xu, Yang Yu:
Policy Optimization in RLHF: The Impact of Out-of-preference Data. CoRR abs/2312.10584 (2023)
2022
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-02468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-02468
Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Rethinking ValueDice: Does It Really Improve Performance? CoRR abs/2202.02468 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11489
Ziniu Li, Tian Xu, Yang Yu:
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle. CoRR abs/2203.11489 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01899
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis. CoRR abs/2208.01899 (2022)
2021
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10424
Tian Xu, Ziniu Li, Yang Yu:
Nearly Minimax Optimal Adversarial Imitation Learning with Known and Unknown Transitions. CoRR abs/2106.10424 (2021)
2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11876
Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. CoRR abs/2010.11876 (2020)
2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-07027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-07027
Tian Xu, Ziniu Li, Yang Yu:
On Value Discrepancy of Imitation Learning. CoRR abs/1911.07027 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.