default search action

combined dblp search
author search
venue search
publication search

ask others

Shentao Yang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-02790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-02790
Yueqin Yin, Shentao Yang, Yujia Xie, Ziyi Yang, Yuting Sun, Hany Hassan Awadalla, Weizhu Chen, Mingyuan Zhou:
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model. CoRR abs/2501.02790 (2025)
2024
[j1]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/rlc/ChitnisYG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/ChitnisYG24
Rohan Chitnis, Shentao Yang, Alborz Geramifard:
Sequential Decision-Making for Inline Text Autocomplete. RLJ 2: 946-960 (2024)
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangCZ24
Shentao Yang, Tianqi Chen, Mingyuan Zhou:
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference. ICML 2024
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-08265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-08265
Shentao Yang, Tianqi Chen, Mingyuan Zhou:
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference. CoRR abs/2402.08265 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15502
Rohan Chitnis, Shentao Yang, Alborz Geramifard:
Sequential Decision-Making for Inline Text Autocomplete. CoRR abs/2403.15502 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-07759
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-07759
Shentao Yang, Haichuan Yang, Linna Du, Adithya Ganesh, Bo Peng, Boying Liu, Serena Li, Ji Liu:
SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis. CoRR abs/2408.07759 (2024)
2023
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FengYZ0XZW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FengYZ0XZW23
Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang:
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems. ICLR 2023
[c3]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangZXFXZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangZXFXZ23
Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou:
Preference-grounded Token-level Guidance for Language Model Fine-tuning. NeurIPS 2023
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10342
Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang:
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems. CoRR abs/2302.10342 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00398
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00398
Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou:
Preference-grounded Token-level Guidance for Language Model Fine-tuning. CoRR abs/2306.00398 (2023)
2022
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangFZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangFZZ22
Shentao Yang, Yihao Feng, Shujian Zhang, Mingyuan Zhou:
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning. ICML 2022: 24980-25006
[c1]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangZFZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangZFZ22
Shentao Yang, Shujian Zhang, Yihao Feng, Mingyuan Zhou:
A Unified Framework for Alternating Offline Model Training and Policy Learning. NeurIPS 2022
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09673
Shentao Yang, Zhendong Wang, Huangjie Zheng, Yihao Feng, Mingyuan Zhou:
A Regularized Implicit Policy for Offline Reinforcement Learning. CoRR abs/2202.09673 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07166
Shentao Yang, Yihao Feng, Shujian Zhang, Mingyuan Zhou:
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning. CoRR abs/2206.07166 (2022)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05922
Shentao Yang, Shujian Zhang, Yihao Feng, Mingyuan Zhou:
A Unified Framework for Alternating Offline Model Training and Policy Learning. CoRR abs/2210.05922 (2022)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.