default search action

combined dblp search
author search
venue search
publication search

ask others

Chuheng Zhang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChoiBBJZSZ0K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChoiBBJZSZ0K24
Yunseon Choi, Sangmin Bae, Seonghyun Ban, Minchan Jeong, Chuheng Zhang, Lei Song, Li Zhao, Jiang Bian, Kee-Eung Kim:
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL. ACL (1) 2024: 8252-8271
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangW0YWS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangW0YWS024
Chuheng Zhang, Xiangsen Wang, Wei Jiang, Xianliang Yang, Siwei Wang, Lei Song, Jiang Bian:
Whittle Index with Multiple Actions and State Constraint for Inventory Management. ICLR 2024
[c18]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/Choi0ZS0K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Choi0ZS0K24
Yunseon Choi, Li Zhao, Chuheng Zhang, Lei Song, Jiang Bian, Kee-Eung Kim:
Diversification of Adaptive Policy for Effective Offline Reinforcement Learning. IJCAI 2024: 3863-3871
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15834
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15834
Yiwen Chen, Yuyao Ye, Ziyi Chen, Chuheng Zhang, Marcelo H. Ang:
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning. CoRR abs/2403.15834 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-11027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-11027
Guangran Cheng, Chuheng Zhang, Wenzhe Cai, Li Zhao, Changyin Sun, Jiang Bian:
Empowering Large Language Models on Robotic Manipulation with Affordance Prompting. CoRR abs/2404.11027 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-14733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-14733
Yunseon Choi, Sangmin Bae, Seonghyun Ban, Minchan Jeong, Chuheng Zhang, Lei Song, Li Zhao, Jiang Bian, Kee-Eung Kim:
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL. CoRR abs/2407.14733 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06957
Wei Shen, Chuheng Zhang:
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation. CoRR abs/2409.06957 (2024)
2023
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/CaiZSZRH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/CaiZSZRH23
Yuanying Cai, Chuheng Zhang, Wei Shen, Xuyun Zhang, Wenjie Ruan, Longbo Huang:
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning. AAAI 2023: 6879-6887
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/CaiZZ0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/CaiZZ0023
Yuanying Cai, Chuheng Zhang, Hanye Zhao, Li Zhao, Jiang Bian:
Curriculum Offline Reinforcement Learning. AAMAS 2023: 1221-1229
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangZZ0SZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangZZ0SZ023
Jinpeng Zhang, Yufeng Zheng, Chuheng Zhang, Li Zhao, Lei Song, Yuan Zhou, Jiang Bian:
Robust Situational Reinforcement Learning in Face of Context Disturbances. ICML 2023: 41973-41989
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhangDCCLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhangDCCLZ23
Chuheng Zhang, Yitong Duan, Xiaoyu Chen, Jianyu Chen, Jian Li, Li Zhao:
Towards Generalizable Reinforcement Learning for Trade Execution. IJCAI 2023: 4975-4983
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01668
Yuanying Cai, Chuheng Zhang, Wei Shen, Xuyun Zhang, Wenjie Ruan, Longbo Huang:
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning. CoRR abs/2303.01668 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07542
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07542
Xianliang Yang, Zhihao Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Jiang Bian:
A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management. CoRR abs/2306.07542 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11685
Chuheng Zhang, Yitong Duan, Xiaoyu Chen, Jianyu Chen, Jian Li, Li Zhao:
Towards Generalizable Reinforcement Learning for Trade Execution. CoRR abs/2307.11685 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-03028
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-03028
Lei Song, Chuheng Zhang, Li Zhao, Jiang Bian:
Pre-Trained Large Language Models for Industrial Control. CoRR abs/2308.03028 (2023)
2022
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/CaiZSHZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/CaiZSHZH22
Yuanying Cai, Chuheng Zhang, Wei Shen, Xiaonan He, Xuyun Zhang, Longbo Huang:
Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations. CIKM 2022: 128-137
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/ShenHZZX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/ShenHZZX22
Wei Shen, Xiaonan He, Chuheng Zhang, Xuyun Zhang, Jian Xie:
A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS. CIKM 2022: 1777-1786
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/WangLSWZ0WW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/WangLSWZ0WW22
Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks. CIKM 2022: 3555-3564
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/WangLSWZZWWW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/WangLSWZZWWW22
Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang:
Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation. CIKM 2022: 4560-4564
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/CaiZZSZS0QL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/CaiZZSZS0QL22
Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, Tieyan Liu:
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets. ICDM 2022: 21-30
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/LiaoSWWZ0WW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/LiaoSWWZ0WW22
Guogang Liao, Xiaowen Shi, Ze Wang, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation. SIGIR 2022: 2292-2296
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/www/LiaoWWSZ0WW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/LiaoWWSZ0WW22
Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed. WWW 2022: 401-409
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00377
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00377
Guogang Liao, Xiaowen Shi, Ze Wang, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation. CoRR abs/2204.00377 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00888
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00888
Guogang Liao, Ze Wang, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks. CoRR abs/2204.00888 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-11589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-11589
Guogang Liao, Ze Wang, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang:
Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation. CoRR abs/2204.11589 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-02125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-02125
Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, Tieyan Liu:
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets. CoRR abs/2212.02125 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03817
Wei Shen, Xiaonan He, Chuheng Zhang, Xuyun Zhang, Jian Xie:
A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS. CoRR abs/2212.03817 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-07684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-07684
Yuandong Ding, Mingxiao Feng, Guozi Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Houqiang Li, Yan Jin, Jiang Bian:
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management. CoRR abs/2212.07684 (2022)
2021
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangCHL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangCHL21
Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li:
Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework. AAAI 2021: 10859-10867
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/0005ZTZHD021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/0005ZTZHD021
Wei Shen, Chuheng Zhang, Yun Tian, Liang Zeng, Xiaonan He, Wanchun Dou, Xiaolong Xu:
Inductive Matrix Completion Using Graph Autoencoder. CIKM 2021: 1609-1618
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuZZQZLYL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuZZQZLYL21
Guoqing Liu, Chuheng Zhang, Li Zhao, Tao Qin, Jinhua Zhu, Jian Li, Nenghai Yu, Tie-Yan Liu:
Return-Based Contrastive Representation Learning for Reinforcement Learning. ICLR 2021
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-10960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-10960
Guoqing Liu, Chuheng Zhang, Li Zhao, Tao Qin, Jinhua Zhu, Jian Li, Nenghai Yu, Tie-Yan Liu:
Return-Based Contrastive Representation Learning for Reinforcement Learning. CoRR abs/2102.10960 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-11124
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-11124
Wei Shen, Chuheng Zhang, Yun Tian, Liang Zeng, Xiaonan He, Wanchun Dou, Xiaolong Xu:
Inductive Matrix Completion Using Graph Autoencoder. CoRR abs/2108.11124 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-04353
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-04353
Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed. CoRR abs/2109.04353 (2021)
2020
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangL020
Chuheng Zhang, Yuanqi Li, Jian Li:
Policy Search by Target Distribution Learning for Continuous Control. AAAI 2020: 6770-6777
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/ShenHZNDW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/ShenHZNDW20
Wei Shen, Xiaonan He, Chuheng Zhang, Qiang Ni, Wanchun Dou, Yan Wang:
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing. CIKM 2020: 1355-1364
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/ZhangLCJTL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/ZhangLCJTL20
Chuheng Zhang, Yuanqi Li, Xi Chen, Yifei Jin, Pingzhong Tang, Jian Li:
DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis. ICDM 2020: 781-790
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-06193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-06193
Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li:
Exploration by Maximizing Rényi Entropy for Zero-Shot Meta RL. CoRR abs/2006.06193 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-11087
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-11087
Wei Shen, Xiaonan He, Chuheng Zhang, Qiang Ni, Wanchun Dou, Yan Wang:
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing. CoRR abs/2008.11087 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01265
Chuheng Zhang, Yuanqi Li, Xi Chen, Yifei Jin, Pingzhong Tang, Jian Li:
DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis. CoRR abs/2010.01265 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11041
Chuheng Zhang, Yuanqi Li, Jian Li:
Policy Search by Target Distribution Learning for Continuous Control. CoRR abs/1905.11041 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.