default search action

combined dblp search
author search
venue search
publication search

ask others

Tengyu Xu

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/orl/LiGZXLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/orl/LiGZXLL24
Tianjiao Li, Ziwei Guan, Shaofeng Zou, Tengyu Xu, Yingbin Liang, Guanghui Lan:
Faster algorithm and sharper analysis for constrained Markov decision process. Oper. Res. Lett. 54: 107107 (2024)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tit/XuWZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tit/XuWZL24
Tengyu Xu, Yue Wang, Shaofeng Zou, Yingbin Liang:
Provably Efficient Offline Reinforcement Learning With Trajectory-Wise Reward. IEEE Trans. Inf. Theory 70(9): 6481-6518 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-20370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-20370
Tengyu Xu, Eryk Helenowski, Karthik Abinav Sankararaman, Di Jin, Kaiyan Peng, Eric Han, Shaoliang Nie, Chen Zhu, Hejia Zhang, Wenxuan Zhou, Zhouhao Zeng, Yun He, Karishma Mandyam, Arya Talabzadeh, Madian Khabsa, Gabriel Cohen, Yuandong Tian, Hao Ma, Sinong Wang, Han Fang:
The Perfect Blend: Redefining RLHF with Mixture of Judges. CoRR abs/2409.20370 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-15553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-15553
Yun He, Di Jin, Chaoqi Wang, Chloe Bi, Karishma Mandyam, Hejia Zhang, Chen Zhu, Ning Li, Tengyu Xu, Hongjiang Lv, Shruti Bhosale, Chenguang Zhu, Karthik Abinav Sankararaman, Eryk Helenowski, Melanie Kambadur, Aditya Tayade, Hao Ma, Han Fang, Sinong Wang:
Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following. CoRR abs/2410.15553 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jvca/ShangXKK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jvca/ShangXKK23
Xiumin Shang, Tengyu Xu, Ioannis Karamouzas, Marcelo Kallmann:
Constraint-based multi-agent reinforcement learning for collaborative tasks. Comput. Animat. Virtual Worlds 34(3-4) (2023)
2022
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GuanXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GuanXL22
Ziwei Guan, Tengyu Xu, Yingbin Liang:
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method. ICLR 2022
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LinWXLZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LinWXLZ22
Sen Lin, Jialin Wan, Tengyu Xu, Yingbin Liang, Junshan Zhang:
Model-Based Offline Meta-Reinforcement Learning with Regularization. ICLR 2022
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XuYWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuYWL22
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang:
A Unifying Framework of Off-Policy General Value Function Evaluation. NeurIPS 2022
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/XiongXZL022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/XiongXZL022
Huaqing Xiong, Tengyu Xu, Lin Zhao, Yingbin Liang, Wei Zhang:
Deterministic policy gradient: Convergence analysis. UAI 2022: 2159-2169
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-02929
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-02929
Sen Lin, Jialin Wan, Tengyu Xu, Yingbin Liang, Junshan Zhang:
Model-Based Offline Meta-Reinforcement Learning with Regularization. CoRR abs/2202.02929 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-06426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-06426
Tengyu Xu, Yingbin Liang:
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward. CoRR abs/2206.06426 (2022)
2021
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XiongXLZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XiongXLZ21
Huaqing Xiong, Tengyu Xu, Yingbin Liang, Wei Zhang:
Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling. AAAI 2021: 10460-10468
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/XuL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/XuL21
Tengyu Xu, Yingbin Liang:
Sample Complexity Bounds for Two Timescale Value-based Reinforcement Learning Algorithms. AISTATS 2021: 811-819
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/GuanXL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/GuanXL21
Ziwei Guan, Tengyu Xu, Yingbin Liang:
When Will Generative Adversarial Imitation Learning Algorithms Attain Global Convergence. AISTATS 2021: 1117-1125
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0002ZXL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0002ZXL21
Ziyi Chen, Yi Zhou, Tengyu Xu, Yingbin Liang:
Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry. ICLR 2021
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/XuLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XuLL21
Tengyu Xu, Yingbin Liang, Guanghui Lan:
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee. ICML 2021: 11480-11491
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/XuYWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XuYWL21
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang:
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality. ICML 2021: 11581-11591
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-04653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-04653
Ziyi Chen, Yi Zhou, Tengyu Xu, Yingbin Liang:
Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry. CoRR abs/2102.04653 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11866
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11866
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang:
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality. CoRR abs/2102.11866 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-02711
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-02711
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang:
A Unified Off-Policy Evaluation Approach for General Value Function. CoRR abs/2107.02711 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06906
Ziwei Guan, Tengyu Xu, Yingbin Liang:
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method. CoRR abs/2110.06906 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10351
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10351
Tianjiao Li, Ziwei Guan, Shaofeng Zou, Tengyu Xu, Yingbin Liang, Guanghui Lan:
Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process. CoRR abs/2110.10351 (2021)
2020
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XuWZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XuWZL20
Tengyu Xu, Zhe Wang, Yi Zhou, Yingbin Liang:
Reanalysis of Variance Reduced Temporal Difference Learning. ICLR 2020
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XuWL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuWL20
Tengyu Xu, Zhe Wang, Yingbin Liang:
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms. NeurIPS 2020
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01898
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01898
Tengyu Xu, Zhe Wang, Yi Zhou, Yingbin Liang:
Reanalysis of Variance Reduced Temporal Difference Learning. CoRR abs/2001.01898 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06286
Huaqing Xiong, Tengyu Xu, Yingbin Liang, Wei Zhang:
Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling. CoRR abs/2002.06286 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-12956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-12956
Tengyu Xu, Zhe Wang, Yingbin Liang:
Improving Sample Complexity Bounds for Actor-Critic Algorithms. CoRR abs/2004.12956 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-03557
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-03557
Tengyu Xu, Zhe Wang, Yingbin Liang:
Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms. CoRR abs/2005.03557 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-09361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-09361
Tengyu Xu, Zhe Wang, Yingbin Liang, H. Vincent Poor:
Enhanced First and Zeroth Order Variance Reduced Algorithms for Min-Max Optimization. CoRR abs/2006.09361 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13506
Ziwei Guan, Tengyu Xu, Yingbin Liang:
When Will Generative Adversarial Imitation Learning Algorithms Attain Global Convergence. CoRR abs/2006.13506 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-05053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-05053
Tengyu Xu, Yingbin Liang:
Sample Complexity Bounds for Two Timescale Value-based Reinforcement Learning Algorithms. CoRR abs/2011.05053 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-05869
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-05869
Tengyu Xu, Yingbin Liang, Guanghui Lan:
A Primal Approach to Constrained Policy Optimization: Global Optimality and Finite-Time Analysis. CoRR abs/2011.05869 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
- export record
  dblp key:
  - conf/nips/ZouXL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZouXL19
Shaofeng Zou, Tengyu Xu, Yingbin Liang:
Finite-Sample Analysis for SARSA with Linear Function Approximation. NeurIPS 2019: 8665-8675
[c1]
- view
- export record
  dblp key:
  - conf/nips/XuZL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuZL19
Tengyu Xu, Shaofeng Zou, Yingbin Liang:
Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples. NeurIPS 2019: 10633-10643
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02234
Shaofeng Zou, Tengyu Xu, Yingbin Liang:
Finite-Sample Analysis for SARSA and Q-Learning with Linear Function Approximation. CoRR abs/1902.02234 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11907
Tengyu Xu, Shaofeng Zou, Yingbin Liang:
Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples. CoRR abs/1909.11907 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-04339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-04339
Tengyu Xu, Yi Zhou, Kaiyi Ji, Yingbin Liang:
Convergence of SGD in Learning ReLU Models with Separable Data. CoRR abs/1806.04339 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.