default search action

combined dblp search
author search
venue search
publication search

ask others

Xiaoyu Chen 0008

> Home > Persons

Person information

affiliation: Peking University, Key Laboratory of Machine Perception, School of Intelligence Science and Technology, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YeC0D23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YeC0D23
Haotian Ye, Xiaoyu Chen, Liwei Wang, Simon Shaolei Du:
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness. ICML 2023: 39770-39800
2022
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChenH0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChenH0W22
Xiaoyu Chen, Jiachen Hu, Lin Yang, Liwei Wang:
Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver. ICLR 2022
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChenHJLW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChenHJLW22
Xiaoyu Chen, Jiachen Hu, Chi Jin, Lihong Li, Liwei Wang:
Understanding Domain Randomization for Sim-to-real Transfer. ICLR 2022
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenZYWW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenZYWW22
Xiaoyu Chen, Han Zhong, Zhuoran Yang, Zhaoran Wang, Liwei Wang:
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation. ICML 2022: 3773-3793
2021
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChenHL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChenHL021
Xiaoyu Chen, Jiachen Hu, Lihong Li, Liwei Wang:
Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL. ICLR 2021
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuCJL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuCJL021
Jiachen Hu, Xiaoyu Chen, Chi Jin, Lihong Li, Liwei Wang:
Near-Optimal Representation Learning for Linear Bandits and Linear RL. ICML 2021: 4349-4358
2020
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangDCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangDCW20
Yuanhao Wang, Kefan Dong, Xiaoyu Chen, Liwei Wang:
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP. ICLR 2020
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangHCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangHCW20
Yuanhao Wang, Jiachen Hu, Xiaoyu Chen, Liwei Wang:
Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication. ICLR 2020
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenZZYCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenZZYCW20
Xiaoyu Chen, Kai Zheng, Zixin Zhou, Yunchang Yang, Wei Chen, Liwei Wang:
(Locally) Differentially Private Combinatorial Semi-Bandits. ICML 2020: 1757-1767

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2022
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11140
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11140
Xiaoyu Chen, Han Zhong, Zhuoran Yang, Zhaoran Wang, Liwei Wang:
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation. CoRR abs/2205.11140 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10464
Haotian Ye, Xiaoyu Chen, Liwei Wang, Simon S. Du:
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness. CoRR abs/2210.10464 (2022)
2021
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-04132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-04132
Jiachen Hu, Xiaoyu Chen, Chi Jin, Lihong Li, Liwei Wang:
Near-optimal Representation Learning for Linear Bandits and Linear RL. CoRR abs/2102.04132 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03239
Xiaoyu Chen, Jiachen Hu, Chi Jin, Lihong Li, Liwei Wang:
Understanding Domain Randomization for Sim-to-real Transfer. CoRR abs/2110.03239 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03244
Xiaoyu Chen, Jiachen Hu, Lin F. Yang, Liwei Wang:
Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver. CoRR abs/2110.03244 (2021)
2020
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-00706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-00706
Xiaoyu Chen, Kai Zheng, Zixin Zhou, Yunchang Yang, Wei Chen, Liwei Wang:
(Locally) Differentially Private Combinatorial Semi-Bandits. CoRR abs/2006.00706 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-13319
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-13319
Xiaoyu Chen, Jiachen Hu, Lihong Li, Liwei Wang:
Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL. CoRR abs/2008.13319 (2020)
2019
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09311
Kefan Dong, Yuanhao Wang, Xiaoyu Chen, Liwei Wang:
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP. CoRR abs/1901.09311 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-06309
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-06309
Yuanhao Wang, Jiachen Hu, Xiaoyu Chen, Liwei Wang:
Distributed Bandit Learning: How Much Communication is Needed to Achieve (Near) Optimal Regret. CoRR abs/1904.06309 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.