default search action

combined dblp search
author search
venue search
publication search

ask others

Siwei Wang 0002

> Home > Persons

Person information

affiliation: Tsinghua University, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-19300
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-19300
Xutong Liu, Xiangxiang Dai, Jinhang Zuo, Siwei Wang, Carlee Joe-Wong, John C. S. Lui, Wei Chen:
Offline Learning for Combinatorial Multi-armed Bandits. CoRR abs/2501.19300 (2025)
2024
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/Shao0F24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Shao0F24
Junning Shao, Siwei Wang, Zhixuan Fang:
Balanced and Incentivized Learning with Limited Shared Information in Multi-agent Multi-armed Bandit. AAMAS 2024: 2459-2461
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChenDH0WH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChenDH0WH24
Yu Chen, Yihan Du, Pihe Hu, Siwei Wang, Desheng Wu, Longbo Huang:
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback. ICLR 2024
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/00020ZZWW0HL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/00020ZZWW0HL024
Xutong Liu, Siwei Wang, Jinhang Zuo, Han Zhong, Xuchuang Wang, Zhiyong Wang, Shuai Li, Mohammad Hajiesmaili, John C. S. Lui, Wei Chen:
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond. ICML 2024
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenZ0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenZ0H24
Yu Chen, Xiangcheng Zhang, Siwei Wang, Longbo Huang:
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation. ICML 2024
[c14]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0002SFST024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0002SFST024
Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen:
ALPINE: Unveiling The Planning Capability of Autoregressive Learning in Language Models. NeurIPS 2024
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-18159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-18159
Yu Chen, Xiangcheng Zhang, Siwei Wang, Longbo Huang:
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation. CoRR abs/2402.18159 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-09220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-09220
Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen:
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models. CoRR abs/2405.09220 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16276
Haoran Sun, Yurong Chen, Siwei Wang, Wei Chen, Xiaotie Deng:
Mechanism Design for LLM Fine-tuning with Multiple Reward Models. CoRR abs/2405.16276 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01386
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01386
Xutong Liu, Siwei Wang, Jinhang Zuo, Han Zhong, Xuchuang Wang, Zhiyong Wang, Shuai Li, Mohammad Hajiesmaili, John C. S. Lui, Wei Chen:
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond. CoRR abs/2406.01386 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-00798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-00798
Seockbean Song, Youngsik Yoon, Siwei Wang, Wei Chen, Jungseul Ok:
Combinatorial Rising Bandit. CoRR abs/2412.00798 (2024)
2023
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DuWH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DuWH23
Yihan Du, Siwei Wang, Longbo Huang:
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path. ICLR 2023
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0002ZWLHW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0002ZWLHW023
Xutong Liu, Jinhang Zuo, Siwei Wang, John C. S. Lui, Mohammad Hajiesmaili, Adam Wierman, Wei Chen:
Contextual Combinatorial Bandits with Probabilistically Triggered Arms. ICML 2023: 22559-22593
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-17110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-17110
Xutong Liu, Jinhang Zuo, Siwei Wang, John C. S. Lui, Mohammad H. Hajiesmaili, Adam Wierman, Wei Chen:
Contextual Combinatorial Bandits with Probabilistically Triggered Arms. CoRR abs/2303.17110 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13673
Jing Dong, Jingyu Wu, Siwei Wang, Baoxiang Wang, Wei Chen:
Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games. CoRR abs/2306.13673 (2023)
2022
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ml/WangC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/WangC22
Siwei Wang, Wei Chen:
The pure exploration problem with general reward functions depending on full distributions. Mach. Learn. 111(9): 3279-3306 (2022)
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangZ22b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangZ22b
Siwei Wang, Jun Zhu:
Thompson Sampling for (Combinatorial) Pure Exploration. ICML 2022: 23470-23483
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuXWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuXWF22
Qingsong Liu, Weihang Xu, Siwei Wang, Zhixuan Fang:
Combinatorial Bandits with Linear Constraints: Beyond Knapsacks and Fairness. NeurIPS 2022
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuZWJLC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuZWJLC22
Xutong Liu, Jinhang Zuo, Siwei Wang, Carlee Joe-Wong, John C. S. Lui, Wei Chen:
Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms. NeurIPS 2022
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangWF22
Yirui Zhang, Siwei Wang, Zhixuan Fang:
Matching in Multi-arm Bandit with Collision. NeurIPS 2022
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02678
Yihan Du, Siwei Wang, Longbo Huang:
Risk-Sensitive Reinforcement Learning: Iterated CVaR and the Worst Path. CoRR abs/2206.02678 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09150
Siwei Wang, Jun Zhu:
Thompson Sampling for (Combinatorial) Pure Exploration. CoRR abs/2206.09150 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05622
Qihan Guo, Siwei Wang, Jun Zhu:
Regret Analysis for Hierarchical Experts Bandit Problem. CoRR abs/2208.05622 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-14837
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-14837
Xutong Liu, Jinhang Zuo, Siwei Wang, Carlee Joe-Wong, John C. S. Lui, Wei Chen:
Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms. CoRR abs/2208.14837 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-10293
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-10293
Yihan Du, Siwei Wang, Longbo Huang:
Dueling Bandits: From Two-dueling to Multi-dueling. CoRR abs/2211.10293 (2022)
2021
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DuWH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DuWH21
Yihan Du, Siwei Wang, Longbo Huang:
A One-Size-Fits-All Solution to Conservative Bandit Problems. AAAI 2021: 7254-7261
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangWH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangWH21
Siwei Wang, Haoyun Wang, Longbo Huang:
Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback. AAAI 2021: 10210-10217
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DuWFH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuWFH21
Yihan Du, Siwei Wang, Zhixuan Fang, Longbo Huang:
Continuous Mean-Covariance Bandits. NeurIPS 2021: 875-886
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-12090
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-12090
Yihan Du, Siwei Wang, Zhixuan Fang, Longbo Huang:
Continuous Mean-Covariance Bandits. CoRR abs/2102.12090 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-03598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-03598
Siwei Wang, Wei Chen:
Pure Exploration Bandit Problem with General Reward Functions Depending on Full Distributions. CoRR abs/2105.03598 (2021)
2020
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/DuWH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/DuWH20
Yihan Du, Siwei Wang, Longbo Huang:
Dueling Bandits: From Two-dueling to Multi-dueling. AAMAS 2020: 348-356
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangHL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangHL20
Siwei Wang, Longbo Huang, John C. S. Lui:
Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits. NeurIPS 2020
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02664
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02664
Siwei Wang, Longbo Huang, John C. S. Lui:
Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits. CoRR abs/2011.02664 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07048
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07048
Siwei Wang, Haoyun Wang, Longbo Huang:
Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback. CoRR abs/2012.07048 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07341
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07341
Yihan Du, Siwei Wang, Longbo Huang:
A One-Size-Fits-All Solution to Conservative Bandit Problems. CoRR abs/2012.07341 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2018
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangC18
Siwei Wang, Wei Chen:
Thompson Sampling for Combinatorial Semi-Bandits. ICML 2018: 5101-5109
[c1]
- view
- export record
  dblp key:
  - conf/nips/WangH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangH18
Siwei Wang, Longbo Huang:
Multi-armed Bandits with Compensation. NeurIPS 2018: 5119-5128
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-04623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-04623
Siwei Wang, Wei Chen:
Thompson Sampling for Combinatorial Semi-Bandits. CoRR abs/1803.04623 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-01715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-01715
Siwei Wang, Longbo Huang:
Multi-armed Bandits with Compensation. CoRR abs/1811.01715 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.