


default search action
Daxin Tan
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i13]Dehua Tao, Daxin Tan, Yu Ting Yeung, Xiao Chen, Tan Lee:
ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis. CoRR abs/2406.08989 (2024) - [i12]Mingyu Cui, Daxin Tan, Yifan Yang, Dingdong Wang, Huimeng Wang, Xiao Chen, Xie Chen, Xunying Liu:
Exploring SSL Discrete Tokens for Multilingual ASR. CoRR abs/2409.08805 (2024) - [i11]Jing Xu, Daxin Tan, Jiaqi Wang, Xiao Chen:
Enhancing Multilingual Speech Generation and Recognition Abilities in LLMs with Constructed Code-switched Data. CoRR abs/2409.10969 (2024) - [i10]Kai Chen, Yunhao Gou, Runhui Huang, Zhili Liu, Daxin Tan, Jing Xu, Chunwei Wang, Yi Zhu, Yihan Zeng, Kuo Yang, Dingdong Wang, Kun Xiang, Haoyuan Li, Haoli Bai, Jianhua Han, Xiaohui Li, Weike Jin, Nian Xie, Yu Zhang, James T. Kwok, Hengshuang Zhao, Xiaodan Liang, Dit-Yan Yeung, Xiao Chen, Zhenguo Li, Wei Zhang, Qun Liu, Jun Yao, Lanqing Hong, Lu Hou, Hang Xu:
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions. CoRR abs/2409.18042 (2024) - 2022
- [c7]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee
:
A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System. ICASSP 2022: 6087-6091 - [c6]Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee
, Sheng Zhao:
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. INTERSPEECH 2022: 456-460 - [c5]Daxin Tan, Guangyan Zhang, Tan Lee
:
Environment Aware Text-to-Speech Synthesis. INTERSPEECH 2022: 481-485 - [c4]Daxin Tan, Liqun Deng, Nianzu Zheng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee
:
CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction. ISCSLP 2022: 81-85 - [i9]Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee
, Sheng Zhao:
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. CoRR abs/2203.17190 (2022) - [i8]Daxin Tan, Liqun Deng, Nianzu Zheng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee
:
CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction. CoRR abs/2204.05460 (2022) - [i7]Daxin Tan, Nikos Kargas, David McHardy, Constantinos Papayiannis, Antonio Bonafonte, Marek Strelec, Jonas Rohnke, Agis Oikonomou-Filandras, Trevor Wood:
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue. CoRR abs/2212.03398 (2022) - 2021
- [c3]Daxin Tan, Liqun Deng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee
:
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion. ASRU 2021: 626-633 - [c2]Guangyan Zhang, Ying Qin, Daxin Tan, Tan Lee
:
Applying the Information Bottleneck Principle to Prosodic Representation Learning. Interspeech 2021: 3156-3160 - [c1]Daxin Tan, Tan Lee
:
Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement. Interspeech 2021: 4683-4687 - [i6]Daxin Tan, Hing-Pang Huang, Guangyan Zhang, Tan Lee:
CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge. CoRR abs/2103.04699 (2021) - [i5]Daxin Tan, Liqun Deng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee:
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion. CoRR abs/2107.01554 (2021) - [i4]Guangyan Zhang, Ying Qin, Daxin Tan, Tan Lee:
Applying the Information Bottleneck Principle to Prosodic Representation Learning. CoRR abs/2108.02821 (2021) - [i3]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee:
A study on the efficacy of model pre-training in developing neural text-to-speech system. CoRR abs/2110.03857 (2021) - [i2]Daxin Tan, Guangyan Zhang, Tan Lee:
Environment Aware Text-to-Speech Synthesis. CoRR abs/2110.03887 (2021) - 2020
- [i1]Daxin Tan, Tan Lee:
Fine-grained style modelling and transfer in text-to-speech synthesis via content-style disentanglement. CoRR abs/2011.03943 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-02 23:22 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint