


default search action
Runzhe Wan
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c10]Yu Liu, Runzhe Wan, James McQueen, Doug Hains, Jinxiang Gu, Rui Song:
Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches. AAAI 2024: 14044-14051 - [c9]Jin Zhu, Runzhe Wan, Zhengling Qi, Shikai Luo, Chengchun Shi:
Robust Offline Reinforcement Learning with Heavy-Tailed Rewards. AISTATS 2024: 541-549 - [i17]Yahui Bai, Yuhe Gao, Runzhe Wan, Sheng Zhang, Rui Song:
A Review of Reinforcement Learning in Financial Applications. CoRR abs/2411.12746 (2024) - 2023
- [c8]Runzhe Wan, Lin Ge, Rui Song:
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework. AISTATS 2023: 1144-1173 - [c7]Runzhe Wan, Haoyu Wei, Branislav Kveton, Rui Song:
Multiplier Bootstrap-based Exploration. ICML 2023: 35444-35490 - [c6]Runzhe Wan
, Yu Liu
, James McQueen
, Doug Hains
, Rui Song
:
Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring. KDD 2023: 5016-5027 - [i16]Xiaohong Chen, Zhengling Qi, Runzhe Wan:
STEEL: Singularity-aware Reinforcement Learning. CoRR abs/2301.13152 (2023) - [i15]Runzhe Wan, Haoyu Wei, Branislav Kveton, Rui Song:
Multiplier Bootstrap-based Exploration. CoRR abs/2302.01543 (2023) - [i14]Runzhe Wan, Yu Liu, James McQueen, Doug Hains, Rui Song:
Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring. CoRR abs/2304.00420 (2023) - [i13]Jin Zhu, Runzhe Wan, Zhengling Qi, Shikai Luo, Chengchun Shi:
Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards. CoRR abs/2310.18715 (2023) - [i12]Yu Liu, Runzhe Wan, James McQueen, Doug Hains, Jinxiang Gu, Rui Song:
Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches. CoRR abs/2312.12871 (2023) - [i11]Haoyu Wei, Runzhe Wan, Lei Shi, Rui Song:
Zero-Inflated Bandits. CoRR abs/2312.15595 (2023) - 2022
- [c5]Runzhe Wan, Branislav Kveton, Rui Song:
Safe Exploration for Efficient Policy Evaluation and Comparison. ICML 2022: 22491-22511 - [i10]Chengchun Shi, Runzhe Wan, Ge Song, Shikai Luo, Rui Song, Hongtu Zhu:
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets. CoRR abs/2202.10574 (2022) - [i9]Runzhe Wan, Lin Ge, Rui Song:
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework. CoRR abs/2202.13227 (2022) - [i8]Runzhe Wan, Branislav Kveton, Rui Song:
Safe Exploration for Efficient Policy Evaluation and Comparison. CoRR abs/2202.13234 (2022) - [i7]Runzhe Wan, Yingying Li, Wenbin Lu, Rui Song:
Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies. CoRR abs/2212.12845 (2022) - [i6]Ye Shen, Runzhe Wan, Hengrui Cai, Rui Song:
Heterogeneous Synthetic Learner for Panel Data. CoRR abs/2212.14580 (2022) - 2021
- [c4]Chengchun Shi, Runzhe Wan, Victor Chernozhukov, Rui Song:
Deeply-Debiased Off-Policy Interval Estimation. ICML 2021: 9580-9591 - [c3]Runzhe Wan, Xinyu Zhang, Rui Song
:
Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control. KDD 2021: 1634-1644 - [c2]Runzhe Wan, Lin Ge, Rui Song:
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models. NeurIPS 2021: 29655-29668 - [i5]Chengchun Shi, Runzhe Wan, Victor Chernozhukov, Rui Song:
Deeply-Debiased Off-Policy Interval Estimation. CoRR abs/2105.04646 (2021) - [i4]Runzhe Wan, Sheng Zhang, Chengchun Shi, Shikai Luo, Rui Song:
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching. CoRR abs/2105.13218 (2021) - [i3]Runzhe Wan, Lin Ge, Rui Song:
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models. CoRR abs/2108.06422 (2021) - 2020
- [c1]Chengchun Shi, Runzhe Wan, Rui Song, Wenbin Lu, Ling Leng:
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making. ICML 2020: 8807-8817 - [i2]Chengchun Shi, Runzhe Wan, Rui Song, Wenbin Lu, Ling Leng:
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making. CoRR abs/2002.01751 (2020) - [i1]Runzhe Wan, Xinyu Zhang, Rui Song:
Multi-Objective Reinforcement Learning for Infectious Disease Control with Application to COVID-19 Spread. CoRR abs/2009.04607 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:17 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint