default search action
Zhengkun Tian
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j7]Jiangyan Yi, Chenglong Wang, Jianhua Tao, Chuyuan Zhang, Cunhang Fan, Zhengkun Tian, Haoxin Ma, Ruibo Fu:
SceneFake: An initial dataset and benchmarks for scene fake audio detection. Pattern Recognit. 152: 110468 (2024) - [i24]Song Li, Yongbin You, Xuezhi Wang, Zhengkun Tian, Ke Ding, Guanglu Wan:
MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research. CoRR abs/2406.18301 (2024) - 2023
- [j6]Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Cunhang Fan:
Transfer knowledge for punctuation prediction via adversarial training. Speech Commun. 149: 1-10 (2023) - [c22]Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao:
TST: Time-Sparse Transducer for Automatic Speech Recognition. CICAI (2) 2023: 68-80 - [c21]Zhengkun Tian, Hongyu Xiang, Min Li, Feifei Lin, Ke Ding, Guanglu Wan:
Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization. ICASSP 2023: 1-5 - [i23]Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao:
TST: Time-Sparse Transducer for Automatic Speech Recognition. CoRR abs/2307.08323 (2023) - [i22]Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan:
CPPF: A contextual and post-processing-free model for automatic speech recognition. CoRR abs/2309.07413 (2023) - 2022
- [j5]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition. IEEE Signal Process. Lett. 29: 762-766 (2022) - [c20]Cong Cai, Bin Liu, Jianhua Tao, Zhengkun Tian, Jiahao Lu, Kexin Wang:
End-to-End Network Based on Transformer for Automatic Detection of Covid-19. ICASSP 2022: 9082-9086 - [c19]Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li:
ADD 2022: the first Audio Deep Synthesis Detection Challenge. ICASSP 2022: 9216-9220 - [c18]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Yu Ting Yeung, Liqun Deng:
reducing multilingual context confusion for end-to-end code-switching automatic speech recognition. INTERSPEECH 2022: 3894-3898 - [c17]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Haiyang Sun, Xun Chen, Zhengkun Tian, Haoxin Ma, Cunhang Fan, Ruibo Fu:
Fully Automated End-to-End Fake Audio Detection. DDAM@MM 2022: 27-33 - [i21]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Yu Ting Yeung, Liqun Deng:
Reducing language context confusion for end-to-end code-switching automatic speech recognition. CoRR abs/2201.12155 (2022) - [i20]Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu:
ADD 2022: the First Audio Deep Synthesis Detection Challenge. CoRR abs/2202.08433 (2022) - [i19]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Haiyang Sun, Xun Chen, Zhengkun Tian, Haoxin Ma, Cunhang Fan, Ruibo Fu:
Fully Automated End-to-End Fake Audio Detection. CoRR abs/2208.09618 (2022) - [i18]Xinrui Yan, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Haoxin Ma, Zhengkun Tian, Ruibo Fu:
System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation. CoRR abs/2208.10489 (2022) - [i17]Zhengkun Tian, Hongyu Xiang, Min Li, Feifei Lin, Ke Ding, Guanglu Wan:
Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization. CoRR abs/2211.03284 (2022) - [i16]Jiangyan Yi, Chenglong Wang, Jianhua Tao, Zhengkun Tian, Cunhang Fan, Haoxin Ma, Ruibo Fu:
SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection. CoRR abs/2211.06073 (2022) - 2021
- [j4]Cunhang Fan, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Bin Liu, Zhengqi Wen:
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 198-209 (2021) - [j3]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Shuai Zhang:
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1340-1351 (2021) - [j2]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Fast End-to-End Speech Recognition Via Non-Autoregressive Models and Cross-Modal Knowledge Transferring From BERT. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1897-1911 (2021) - [c16]Dianbo Sui, Zhengkun Tian, Yubo Chen, Kang Liu, Jun Zhao:
A Large-Scale Chinese Multimodal NER Dataset with Speech Clues. ACL/IJCNLP (1) 2021: 2807-2818 - [c15]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition. APSIPA ASC 2021: 454-459 - [c14]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi Wen:
Decoupling Pronunciation and Language for End-to-End Code-Switching Automatic Speech Recognition. ICASSP 2021: 6249-6253 - [c13]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Xuefei Liu, Zhengqi Wen:
End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition. Interspeech 2021: 266-270 - [c12]Haoxin Ma, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Chenglong Wang:
Continual Learning for Fake Audio Detection. Interspeech 2021: 886-890 - [c11]Jiangyan Yi, Ye Bai, Jianhua Tao, Haoxin Ma, Zhengkun Tian, Chenglong Wang, Tao Wang, Ruibo Fu:
Half-Truth: A Partially Fake Audio Detection Dataset. Interspeech 2021: 1654-1658 - [c10]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization. Interspeech 2021: 4034-4038 - [c9]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian:
Hierarchically Attending Time-Frequency and Channel Features for Improving Speaker Verification. ISCSLP 2021: 1-5 - [c8]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Ye Bai:
Rnn-transducer With Language Bias For End-to-end Mandarin-English Code-switching Speech Recognition. ISCSLP 2021: 1-5 - [i15]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Fast End-to-End Speech Recognition via a Non-Autoregressive Model and Cross-Modal Knowledge Transferring from BERT. CoRR abs/2102.07594 (2021) - [i14]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen, Xuefei Liu:
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition. CoRR abs/2104.01522 (2021) - [i13]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization. CoRR abs/2104.02882 (2021) - [i12]Jiangyan Yi, Ye Bai, Jianhua Tao, Zhengkun Tian, Chenglong Wang, Tao Wang, Ruibo Fu:
Half-Truth: A Partially Fake Audio Detection Dataset. CoRR abs/2104.03617 (2021) - [i11]Haoxin Ma, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Chenglong Wang:
Continual Learning for Fake Audio Detection. CoRR abs/2104.07286 (2021) - 2020
- [j1]Bocheng Zhao, Jianhua Tao, Minghao Yang, Zhengkun Tian, Cunhang Fan, Ye Bai:
Deep imitator: Handwriting calligraphy imitation via deep attention networks. Pattern Recognit. 104: 107080 (2020) - [c7]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
Synchronous Transformers for end-to-end Speech Recognition. ICASSP 2020: 7884-7888 - [c6]Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Ye Bai, Cunhang Fan:
Focal Loss for Punctuation Prediction. INTERSPEECH 2020: 721-725 - [c5]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition. INTERSPEECH 2020: 3381-3385 - [c4]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen:
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition. INTERSPEECH 2020: 5026-5030 - [i10]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Ye Bai:
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition. CoRR abs/2002.08126 (2020) - [i9]Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Cunhang Fan:
Adversarial Transfer Learning for Punctuation Restoration. CoRR abs/2004.00248 (2020) - [i8]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition. CoRR abs/2005.04862 (2020) - [i7]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen:
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition. CoRR abs/2005.07903 (2020) - [i6]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi Wen:
Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition. CoRR abs/2010.14798 (2020) - [i5]Cunhang Fan, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Bin Liu, Zhengqi Wen:
Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition. CoRR abs/2011.04249 (2020)
2010 – 2019
- 2019
- [c3]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Chenghao Zhao, Cunhang Fan:
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting. INTERSPEECH 2019: 2190-2194 - [c2]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen:
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition. INTERSPEECH 2019: 3795-3799 - [c1]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengqi Wen:
Self-Attention Transducers for End-to-End Speech Recognition. INTERSPEECH 2019: 4395-4399 - [i4]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen:
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition. CoRR abs/1907.06017 (2019) - [i3]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengqi Wen:
Self-Attention Transducers for End-to-End Speech Recognition. CoRR abs/1909.13037 (2019) - [i2]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Integrating Whole Context to Sequence-to-sequence Speech Recognition. CoRR abs/1912.01777 (2019) - [i1]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
Synchronous Transformers for End-to-End Speech Recognition. CoRR abs/1912.02958 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-05 21:10 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint