default search action
Kaitao Song
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Cairong Zhao, Yubin Wang, Xinyang Jiang, Yifei Shen, Kaitao Song, Dongsheng Li, Duoqian Miao:
Learning Domain Invariant Prompt for Vision-Language Models. IEEE Trans. Image Process. 33: 1348-1360 (2024) - [c28]Meiqi Chen, Yubo Ma, Kaitao Song, Yixin Cao, Yan Zhang, Dongsheng Li:
Improving Large Language Models in Event Relation Logical Prediction. ACL (1) 2024: 9451-9478 - [c27]Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, Yujiu Yang:
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers. ICLR 2024 - [c26]Yichong Leng, Zhifang Guo, Kai Shen, Zeqian Ju, Xu Tan, Eric Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiangyang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. ICLR 2024 - [c25]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiangyang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. ICML 2024 - [i37]Siyu Yuan, Kaitao Song, Jiangjie Chen, Xu Tan, Yongliang Shen, Kan Ren, Dongsheng Li, Deqing Yang:
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction. CoRR abs/2401.06201 (2024) - [i36]Yuqi Chen, Kan Ren, Kaitao Song, Yansen Wang, Yifan Wang, Dongsheng Li, Lili Qiu:
EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model. CoRR abs/2401.10278 (2024) - [i35]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. CoRR abs/2403.03100 (2024) - [i34]Xixi Wu, Yifei Shen, Caihua Shan, Kaitao Song, Siwei Wang, Bohang Zhang, Jiarui Feng, Hong Cheng, Wei Chen, Yun Xiong, Dongsheng Li:
Can Graph Learning Improve Task Planning? CoRR abs/2405.19119 (2024) - [i33]Ping Yu, Kaitao Song, Fengchen He, Ming Chen, Jianfeng Lu:
TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models. CoRR abs/2406.04941 (2024) - [i32]Siyu Yuan, Kaitao Song, Jiangjie Chen, Xu Tan, Dongsheng Li, Deqing Yang:
EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms. CoRR abs/2406.14228 (2024) - 2023
- [c24]Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu:
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition. AAAI 2023: 13034-13042 - [c23]Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang:
DiffusionNER: Boundary Diffusion for Named Entity Recognition. ACL (1) 2023: 3875-3890 - [c22]Yicheng Zou, Kaitao Song, Xu Tan, Zhongkai Fu, Qi Zhang, Dongsheng Li, Tao Gui:
Towards Understanding Omission in Dialogue Summarization. ACL (1) 2023: 14268-14286 - [c21]Dingyao Yu, Kaitao Song, Peiling Lu, Tianyu He, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian:
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models. EMNLP (Demos) 2023: 246-255 - [c20]Jinchao Li, Kaitao Song, Junan Li, Bo Zheng, Dongsheng Li, Xixin Wu, Xunying Liu, Helen Meng:
Leveraging Pretrained Representations With Task-Related Keywords for Alzheimer's Disease Detection. ICASSP 2023: 1-5 - [c19]Jinchao Li, Xixin Wu, Kaitao Song, Dongsheng Li, Xunying Liu, Helen Meng:
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition. ICASSP 2023: 1-5 - [c18]Yansen Wang, Xinyang Jiang, Kan Ren, Caihua Shan, Xufang Luo, Dongqi Han, Kaitao Song, Yifei Shen, Dongsheng Li:
CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling. ICML 2023: 35817-35835 - [c17]Yukang Liang, Kaitao Song, Shaoguang Mao, Huiqiang Jiang, Luna Qiu, Yuqing Yang, Dongsheng Li, Linli Xu, Lili Qiu:
End-to-End Word-Level Pronunciation Assessment with MASK Pre-training. INTERSPEECH 2023: 969-973 - [c16]Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang:
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face. NeurIPS 2023 - [i31]Jinchao Li, Kaitao Song, Junan Li, Bo Zheng, Dongsheng Li, Xixin Wu, Xunying Liu, Helen Meng:
Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection. CoRR abs/2303.08019 (2023) - [i30]Jinchao Li, Xixin Wu, Kaitao Song, Dongsheng Li, Xunying Liu, Helen Meng:
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition. CoRR abs/2303.08027 (2023) - [i29]Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang:
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace. CoRR abs/2303.17580 (2023) - [i28]Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang:
DiffusionNER: Boundary Diffusion for Named Entity Recognition. CoRR abs/2305.13298 (2023) - [i27]Bei Li, Rui Wang, Junliang Guo, Kaitao Song, Xu Tan, Hany Hassan, Arul Menezes, Tong Xiao, Jiang Bian, JingBo Zhu:
Deliberate then Generate: Enhanced Prompting Framework for Text Generation. CoRR abs/2305.19835 (2023) - [i26]Yukang Liang, Kaitao Song, Shaoguang Mao, Huiqiang Jiang, Luna Qiu, Yuqing Yang, Dongsheng Li, Linli Xu, Lili Qiu:
End-to-End Word-Level Pronunciation Assessment with MASK Pre-training. CoRR abs/2306.02682 (2023) - [i25]Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. CoRR abs/2309.02285 (2023) - [i24]Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, Yujiu Yang:
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers. CoRR abs/2309.08532 (2023) - [i23]Meiqi Chen, Yubo Ma, Kaitao Song, Yixin Cao, Yan Zhang, Dongsheng Li:
Learning To Teach Large Language Models Logical Reasoning. CoRR abs/2310.09158 (2023) - [i22]Dingyao Yu, Kaitao Song, Peiling Lu, Tianyu He, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian:
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models. CoRR abs/2310.11954 (2023) - [i21]Yongliang Shen, Kaitao Song, Xu Tan, Wenqi Zhang, Kan Ren, Siyu Yuan, Weiming Lu, Dongsheng Li, Yueting Zhuang:
TaskBench: Benchmarking Large Language Models for Task Automation. CoRR abs/2311.18760 (2023) - 2022
- [j3]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
PVT v2: Improved baselines with Pyramid Vision Transformer. Comput. Vis. Media 8(3): 415-424 (2022) - [c15]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee:
A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System. ICASSP 2022: 6087-6091 - [c14]Jin Xu, Xu Tan, Kaitao Song, Renqian Luo, Yichong Leng, Tao Qin, Tie-Yan Liu, Jian Li:
Analyzing and Mitigating Interference in Neural Architecture Search. ICML 2022: 24646-24662 - [c13]Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee, Sheng Zhao:
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. INTERSPEECH 2022: 456-460 - [c12]Kaitao Song, Teng Wan, Bixia Wang, Huiqiang Jiang, Luna Qiu, Jiahang Xu, Liping Jiang, Qun Lou, Yuqing Yang, Dongsheng Li, Xudong Wang, Lili Qiu:
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech. INTERSPEECH 2022: 4820-4824 - [c11]Kaitao Song, Yichong Leng, Xu Tan, Yicheng Zou, Tao Qin, Dongsheng Li:
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling. NeurIPS 2022 - [i20]Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee, Sheng Zhao:
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. CoRR abs/2203.17190 (2022) - [i19]Kaitao Song, Yichong Leng, Xu Tan, Yicheng Zou, Tao Qin, Dongsheng Li:
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling. CoRR abs/2205.12986 (2022) - [i18]Yezhen Wang, Tong Che, Bo Li, Kaitao Song, Hengzhi Pei, Yoshua Bengio, Dongsheng Li:
Your Autoregressive Generative Model Can be Better If You Treat It as an Energy-Based One. CoRR abs/2206.12840 (2022) - [i17]Yicheng Zou, Kaitao Song, Xu Tan, Zhongkai Fu, Tao Gui, Qi Zhang, Dongsheng Li:
Towards Understanding Omission in Dialogue Summarization. CoRR abs/2211.07145 (2022) - [i16]Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu:
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition. CoRR abs/2212.01039 (2022) - [i15]Cairong Zhao, Yubin Wang, Xinyang Jiang, Yifei Shen, Kaitao Song, Dongsheng Li, Duoqian Miao:
Learning Domain Invariant Prompt for Vision-Language Models. CoRR abs/2212.04196 (2022) - 2021
- [j2]Kaitao Song, Qingkang Huang, Faen Zhang, Jianfeng Lu:
Coarse-to-fine: A dual-view attention network for click-through rate prediction. Knowl. Based Syst. 216: 106767 (2021) - [c10]Zhonghao Sheng, Kaitao Song, Xu Tan, Yi Ren, Wei Ye, Shikun Zhang, Tao Qin:
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint. AAAI 2021: 13798-13805 - [c9]Lanqing Xue, Kaitao Song, Duocai Wu, Xu Tan, Nevin L. Zhang, Tao Qin, Wei-Qiang Zhang, Tie-Yan Liu:
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling. ACL/IJCNLP (1) 2021: 69-81 - [c8]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. ICCV 2021: 548-558 - [c7]Jin Xu, Xu Tan, Renqian Luo, Kaitao Song, Jian Li, Tao Qin, Tie-Yan Liu:
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search. KDD 2021: 1933-1943 - [c6]Xuefei Liu, Kaitao Song, Jianfeng Lu:
MPN: Multi-scale Progressive Restoration Network for Unsupervised Defect Detection. PRCV (2) 2021: 349-359 - [i14]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. CoRR abs/2102.12122 (2021) - [i13]Jin Xu, Xu Tan, Renqian Luo, Kaitao Song, Jian Li, Tao Qin, Tie-Yan Liu:
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search. CoRR abs/2105.14444 (2021) - [i12]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
PVTv2: Improved Baselines with Pyramid Vision Transformer. CoRR abs/2106.13797 (2021) - [i11]Lanqing Xue, Kaitao Song, Duocai Wu, Xu Tan, Nevin L. Zhang, Tao Qin, Wei-Qiang Zhang, Tie-Yan Liu:
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling. CoRR abs/2107.01875 (2021) - [i10]Jin Xu, Xu Tan, Kaitao Song, Renqian Luo, Yichong Leng, Tao Qin, Tie-Yan Liu, Jian Li:
Analyzing and Mitigating Interference in Neural Architecture Search. CoRR abs/2108.12821 (2021) - [i9]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee:
A study on the efficacy of model pre-training in developing neural text-to-speech system. CoRR abs/2110.03857 (2021) - 2020
- [j1]Kaitao Song, Xiu-Shen Wei, Xiangbo Shu, Ren-Jie Song, Jianfeng Lu:
Bi-Modal Progressive Mask Attention for Fine-Grained Recognition. IEEE Trans. Image Process. 29: 7006-7018 (2020) - [c5]Kaitao Song, Xu Tan, Jianfeng Lu:
Neural Machine Translation with Error Correction. IJCAI 2020: 3891-3897 - [c4]Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu:
MPNet: Masked and Permuted Pre-training for Language Understanding. NeurIPS 2020 - [i8]Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu:
MPNet: Masked and Permuted Pre-training for Language Understanding. CoRR abs/2004.09297 (2020) - [i7]Kaitao Song, Hao Sun, Xu Tan, Tao Qin, Jianfeng Lu, Hongzhi Liu, Tie-Yan Liu:
LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning. CoRR abs/2004.12817 (2020) - [i6]Kaitao Song, Xu Tan, Jianfeng Lu:
Neural Machine Translation with Error Correction. CoRR abs/2007.10681 (2020) - [i5]Zhonghao Sheng, Kaitao Song, Xu Tan, Yi Ren, Wei Ye, Shikun Zhang, Tao Qin:
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint. CoRR abs/2012.05168 (2020)
2010 – 2019
- 2019
- [c3]Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu:
MASS: Masked Sequence to Sequence Pre-training for Language Generation. ICML 2019: 5926-5936 - [i4]Ping Yu, Kaitao Song, Jianfeng Lu:
Generating Adversarial Examples With Conditional Generative Adversarial Net. CoRR abs/1903.07282 (2019) - [i3]Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu:
MASS: Masked Sequence to Sequence Pre-training for Language Generation. CoRR abs/1905.02450 (2019) - 2018
- [c2]Kaitao Song, Xu Tan, Di He, Jianfeng Lu, Tao Qin, Tie-Yan Liu:
Double Path Networks for Sequence to Sequence Learning. COLING 2018: 3064-3074 - [c1]Ping Yu, Kaitao Song, Jianfeng Lu:
Generating Adversarial Examples With Conditional Generative Adversarial Net. ICPR 2018: 676-681 - [i2]Kaitao Song, Xu Tan, Di He, Jianfeng Lu, Tao Qin, Tie-Yan Liu:
Double Path Networks for Sequence to Sequence Learning. CoRR abs/1806.04856 (2018) - [i1]Kaitao Song, Tan Xu, Furong Peng, Jianfeng Lu:
Hybrid Self-Attention Network for Machine Translation. CoRR abs/1811.00253 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-26 01:58 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint