default search action
Tan Lee 0001
Person information
- affiliation: Chinese University of Hong Kong, Department of Electronic Engineering, Hong Kong
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Books and Theses
- 1996
- [b1]Tan Lee:
Automatic recognition of isolated Cantonese syllables using neural networks =: 利用神經網絡識別粤語單音節. Chinese University of Hong Kong, Hong Kong, 1996
Journal Articles
- 2024
- [j40]Si Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee:
Automatic Detection of Speech Sound Disorder in Cantonese-Speaking Pre-School Children. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4355-4368 (2024) - 2023
- [j39]Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang, Tan Lee:
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis Based on Disentanglement Between Prosody and Timbre. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1693-1705 (2023) - 2022
- [j38]Shuiyang Mao, P. C. Ching, Tan Lee:
Enhancing Segment-Based Speech Emotion Recognition by Iterative Self-Learning. IEEE ACM Trans. Audio Speech Lang. Process. 30: 123-134 (2022) - 2021
- [j37]Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
Bayesian Learning for Deep Neural Network Adaptation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2096-2110 (2021) - 2020
- [j36]Juan Ignacio Godino-Llorente, Douglas D. O'Shaughnessy, Tan Lee, Najim Dehak, Claudia Manfredi:
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing. IEEE J. Sel. Top. Signal Process. 14(2): 234-239 (2020) - [j35]Ying Qin, Tan Lee, Anthony Pak-Hin Kong:
Automatic Assessment of Speech Impairment in Cantonese-Speaking People with Aphasia. IEEE J. Sel. Top. Signal Process. 14(2): 331-345 (2020) - [j34]Ying Qin, Yuzhong Wu, Tan Lee, Anthony Pak-Hin Kong:
An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia. J. Signal Process. Syst. 92(8): 819-830 (2020) - 2019
- [j33]Yuanyuan Liu, Tan Lee, Thomas K. T. Law, Kathy Yuet-Sheung Lee:
Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 1047-1059 (2019) - [j32]Siyuan Feng, Tan Lee:
Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2000-2011 (2019) - 2018
- [j31]Lei Xie, Tan Lee, Man-Wai Mak:
Guest Editorial: Advances in Deep Learning for Speech Processing. J. Signal Process. Syst. 90(7): 959-961 (2018) - 2017
- [j30]Hansjörg Mixdorff, Angelika Hönemann, Albert Rilliard, Tan Lee, Matthew K. H. Ma:
Audio-visual expressions of attitude: How many different attitudes can perceivers decode? Speech Commun. 95: 114-126 (2017) - 2016
- [j29]Shing Yu, Tan Lee, Manwa L. Ng:
Surface Electromyographic Activity of Extrinsic Laryngeal Muscles in Cantonese Tone Production. J. Signal Process. Syst. 82(2): 287-294 (2016) - 2015
- [j28]Feng Huang, Tan Lee, W. Bastiaan Kleijn, Ying-Yee Kong:
A method of speech periodicity enhancement using transform-domain signal decomposition. Speech Commun. 67: 102-112 (2015) - [j27]Huijun Ding, Tan Lee, Ing Yann Soon, Chai Kiat Yeo, Peng Dai, Guo Dan:
Objective measures for quality assessment of noise-suppressed speech. Speech Commun. 71: 62-73 (2015) - [j26]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Acoustic Segment Modeling with Spectral Clustering Methods. IEEE ACM Trans. Audio Speech Lang. Process. 23(2): 264-277 (2015) - [j25]Yu Ting Yeung, Tan Lee, Cheung-Chi Leung:
Supervised Single-Microphone Multi-Talker Speech Separation with Conditional Random Fields. IEEE ACM Trans. Audio Speech Lang. Process. 23(12): 2334-2342 (2015) - 2013
- [j24]Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Shifted-Delta MLP Features for Spoken Language Recognition. IEEE Signal Process. Lett. 20(1): 15-18 (2013) - [j23]Feng Huang, Tan Lee:
Pitch Estimation in Noisy Speech Using Accumulated Peak Spectrum and Sparse Estimation Technique. IEEE Trans. Speech Audio Process. 21(1): 97-107 (2013) - [j22]Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Spoken Language Recognition With Prosodic Features. IEEE Trans. Speech Audio Process. 21(9): 1841-1853 (2013) - 2011
- [j21]Ning Wang, P. C. Ching, Nengheng Zheng, Tan Lee:
Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features. IEEE Trans. Speech Audio Process. 19(1): 196-205 (2011) - 2009
- [j20]Juan Ignacio Godino-Llorente, Pedro Gómez Vilda, Tan Lee:
Analysis and Signal Processing of Oesophageal and Pathological Voices. EURASIP J. Adv. Signal Process. 2009 (2009) - [j19]Joyce Y. C. Chan, Houwei Cao, P. C. Ching, Tan Lee:
Automatic Recognition of Cantonese-English Code-Mixing Speech. Int. J. Comput. Linguistics Chin. Lang. Process. 14(3) (2009) - [j18]Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Analysis and Selection of Prosodic Features for Asian Language Recognition. Int. J. Asian Lang. Process. 19(4): 139-152 (2009) - 2008
- [j17]Yao Qian, Frank K. Soong, Tan Lee:
Tone-enhanced generalized character posterior probability (GCPP) for Cantonese LVCSR. Comput. Speech Lang. 22(4): 360-373 (2008) - 2007
- [j16]Shan Ouyang, Tan Lee, P. C. Ching:
A power-based adaptive method for eigenanalysis without square-root operations. Digit. Signal Process. 17(1): 209-224 (2007) - [j15]Nengheng Zheng, Tan Lee, Ning Wang, P. C. Ching:
Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification. Int. J. Comput. Linguistics Chin. Lang. Process. 12(3) (2007) - [j14]Nengheng Zheng, Tan Lee, Pak-Chung Ching:
Integration of Complementary Acoustic Features for Speaker Recognition. IEEE Signal Process. Lett. 14(3): 181-184 (2007) - [j13]Chen Yang, Frank K. Soong, Tan Lee:
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR. IEEE Trans. Speech Audio Process. 15(3): 1087-1097 (2007) - [j12]Wai Nang Chan, Nengheng Zheng, Tan Lee:
Discrimination Power of Vocal Source and Vocal Tract Related Features for Speaker Segmentation. IEEE Trans. Speech Audio Process. 15(6): 1884-1892 (2007) - 2006
- [j11]Tan Lee, Patgi Kam, Frank K. Soong:
Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition. Int. J. Comput. Linguistics Chin. Lang. Process. 11(1) (2006) - [j10]Yu Zhu, Tan Lee:
Using Duration Information in Cantonese Connected-Digit Recognition. Int. J. Comput. Linguistics Chin. Lang. Process. 11(1) (2006) - [j9]Meng Yuan, Tan Lee, P. C. Ching, Yu Zhu:
Speech recognition on DSP: issues on computational efficiency and performance analysis. Microprocess. Microsystems 30(3): 155-164 (2006) - 2004
- [j8]Yujia Li, Tan Lee, Yao Qian:
Analysis and modeling of F0 contours for cantonese text-to-speech. ACM Trans. Asian Lang. Inf. Process. 3(3): 169-180 (2004) - 2002
- [j7]Ge Gao, P. C. Ching, Tan Lee:
A new approach to generating Pitch Cycle Waveform (PCW) for Waveform Interpolation codec. Microprocess. Microsystems 25(9-10): 421-426 (2002) - [j6]Tan Lee, Wai Kit Lo, P. C. Ching, Helen M. Meng:
Spoken language resources for Cantonese speech processing. Speech Commun. 36(3-4): 327-342 (2002) - [j5]Tan Lee, Wai H. Lau, Yiu Wing Wong, P. C. Ching:
Using tone information in Cantonese continuous speech recognition. ACM Trans. Asian Lang. Inf. Process. 1(1): 83-102 (2002) - 1999
- [j4]Tan Lee, P. C. Ching:
Cantonese syllable recognition using neural networks. IEEE Trans. Speech Audio Process. 7(4): 466-472 (1999) - 1998
- [j3]Tan Lee, P. C. Ching, Lai-Wan Chan:
Isolated word recognition using modular recurrent neural networks. Pattern Recognit. 31(6): 751-760 (1998) - 1995
- [j2]Tan Lee, P. C. Ching, Lai-Wan Chan, Y. H. Cheng, Brian Mak:
Tone recognition of isolated Cantonese syllables. IEEE Trans. Speech Audio Process. 3(3): 204-209 (1995) - 1992
- [j1]Fu-Lai Chung, Tan Lee:
A Node Pruning Algorithm for Backpropagation Networks. Int. J. Neural Syst. 3(3): 301-314 (1992)
Conference and Workshop Papers
- 2024
- [c197]Yusheng Tian, Jingyu Li, Tan Lee:
Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss. ICASSP 2024: 11501-11505 - [c196]Wei Liu, Ying Qin, Zhiyuan Peng, Tan Lee:
Sparsely Shared Lora on Whisper for Child Speech Recognition. ICASSP 2024: 11751-11755 - [c195]Dehua Tao, Tan Lee, Harold Chui, Sarah Luk:
Modeling Intrapersonal and Interpersonal Influences for Automatic Estimation of Therapist Empathy in Counseling Conversation. ICASSP 2024: 12692-12696 - [c194]Jingyu Li, Tan Lee:
Efficient Black-Box Speaker Verification Model Adaptation With Reprogramming And Backend Learning. ICASSP 2024: 12732-12736 - [c193]Yujia Xiao, Xi Wang, Xu Tan, Lei He, Xinfa Zhu, Sheng Zhao, Tan Lee:
Contrastive Context-Speech Pretraining for Expressive Text-to-Speech Synthesis. ACM Multimedia 2024: 2099-2107 - 2023
- [c192]Yusheng Tian, Wei Liu, Tan Lee:
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data. ASRU 2023: 1-7 - [c191]Monira Islam, Tan Lee:
Functional Connectivity Analysis in Multi-channel EEG for Emotion Detection with Phase Locking Value and 3D CNN. EMBC 2023: 1-4 - [c190]Jingyu Li, Yusheng Tian, Tan Lee:
Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification. ICASSP 2023: 1-5 - [c189]Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee:
Leveraging Phone-Level Linguistic-Acoustic Similarity For Utterance-Level Pronunciation Scoring. ICASSP 2023: 1-5 - [c188]Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee:
An ASR-Free Fluency Scoring Approach with Self-Supervised Learning. ICASSP 2023: 1-5 - [c187]Zhiyuan Peng, Mingjie Shao, Xuanji He, Xu Li, Tan Lee, Ke Ding, Guanglu Wan:
Covariance Regularization for Probabilistic Linear Discriminant Analysis. ICASSP 2023: 1-5 - [c186]Jingyu Li, Wei Liu, Zhaoyang Zhang, Jiong Wang, Tan Lee:
Model Compression for DNN-based Speaker Verification Using Weight Quantization. INTERSPEECH 2023: 1988-1992 - [c185]Wei Liu, Zhiyuan Peng, Tan Lee:
CoMFLP: Correlation Measure Based Fast Search on ASR Layer Pruning. INTERSPEECH 2023: 3282-3286 - [c184]Dehua Tao, Tan Lee, Harold Chui, Sarah Luk:
A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation. INTERSPEECH 2023: 3662-3666 - [c183]Si Ioi Ng, Cymie Wing-Yee Ng, Tan Lee:
A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children. INTERSPEECH 2023: 4643-4647 - [c182]Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee:
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading. INTERSPEECH 2023: 4883-4887 - [c181]Yusheng Tian, Guangyan Zhang, Tan Lee:
Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models. INTERSPEECH 2023: 4893-4897 - 2022
- [c180]Monira Islam, Tan Lee:
MEMD-HHT based Emotion Detection from EEG using 3D CNN. EMBC 2022: 284-287 - [c179]Monira Islam, Tan Lee:
Multivariate Empirical Mode Decomposition of EEG for Mental State Detection at Localized Brain Lobes. EMBC 2022: 3694-3697 - [c178]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee:
A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System. ICASSP 2022: 6087-6091 - [c177]Yusheng Tian, Jingyu Li, Tan Lee:
Transport-Oriented Feature Aggregation for Speaker Embedding Learning. INTERSPEECH 2022: 316-320 - [c176]Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee, Guanglu Wan:
Unifying Cosine and PLDA Back-ends for Speaker Verification. INTERSPEECH 2022: 336-340 - [c175]Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee, Sheng Zhao:
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. INTERSPEECH 2022: 456-460 - [c174]Daxin Tan, Guangyan Zhang, Tan Lee:
Environment Aware Text-to-Speech Synthesis. INTERSPEECH 2022: 481-485 - [c173]Dehua Tao, Tan Lee, Harold Chui, Sarah Luk:
Characterizing Therapist's Speaking Style in Relation to Empathy in Psychotherapy. INTERSPEECH 2022: 2003-2007 - [c172]Dehua Tao, Tan Lee, Harold Chui, Sarah Luk:
Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session. INTERSPEECH 2022: 2008-2012 - [c171]Si Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee:
Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations. INTERSPEECH 2022: 2853-2857 - [c170]Jingyu Li, Wei Liu, Tan Lee:
EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification. INTERSPEECH 2022: 3694-3698 - [c169]Jonathan Him Nok Lee, Dehua Tao, Harold Chui, Tan Lee, Sarah Luk, Nicolette Wing Tung Lee, Koonkan Fung:
Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy. INTERSPEECH 2022: 5248-5252 - [c168]Daxin Tan, Liqun Deng, Nianzu Zheng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee:
CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction. ISCSLP 2022: 81-85 - [c167]Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee, Guanglu Wan:
Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition. ISCSLP 2022: 324-328 - [c166]Dehua Tao, Harold Chui, Sarah Luk, Tan Lee:
CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research. ISCSLP 2022: 354-358 - [c165]Ying Qin, Tan Lee, Anthony Pak-Hin Kong, Feng Lin:
Aphasia Detection for Cantonese-Speaking and Mandarin-Speaking Patients Using Pre-Trained Language Models. ISCSLP 2022: 359-363 - 2021
- [c164]Jingyu Li, Si Ioi Ng, Tan Lee:
Improving Text-Independent Speaker Verification with Auxiliary Speakers Using Graph. ASRU 2021: 198-205 - [c163]Wei Liu, Tan Lee:
Utterance-Level Neural Confidence Measure for End-to-End Children Speech Recognition. ASRU 2021: 449-456 - [c162]Daxin Tan, Liqun Deng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee:
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion. ASRU 2021: 626-633 - [c161]Si Ioi Ng, Cymie Wing-Yee Ng, Jingyu Li, Tan Lee:
Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding. Interspeech 2021: 2931-2935 - [c160]Guangyan Zhang, Ying Qin, Daxin Tan, Tan Lee:
Applying the Information Bottleneck Principle to Prosodic Representation Learning. Interspeech 2021: 3156-3160 - [c159]Zhiyuan Peng, Xu Li, Tan Lee:
Pairing Weak with Strong: Twin Models for Defending Against Adversarial Attack on Speaker Verification. Interspeech 2021: 4284-4288 - [c158]Daxin Tan, Tan Lee:
Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement. Interspeech 2021: 4683-4687 - [c157]Ying Qin, Yao Qian, Anastassia Loukina, Patrick L. Lange, Abhinav Misra, Keelan Evanini, Tan Lee:
Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output. ISCSLP 2021: 1-5 - [c156]Guangyan Zhang, Shirong Qiu, Ying Qin, Tan Lee:
Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis. ISCSLP 2021: 1-5 - [c155]Hei Yi Mak, Tan Lee:
Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong. NLPIR 2021: 81-87 - 2020
- [c154]Yuzhong Wu, Tan Lee:
Searching for Efficient Network Architectures for Acoustic Scene Classification. DCASE 2020: 220-224 - [c153]Yuzhong Wu, Tan Lee:
Time-Frequency Feature Decomposition Based on Sound Duration for Acoustic Scene Classification. ICASSP 2020: 716-720 - [c152]Matthew King-Hang Ma, Tan Lee, Manson Cheuk-Man Fong, William Shi-Yuan Wang:
Resting-State EEG-Based Biometrics with Signals Features Extracted by Multivariate Empirical Mode Decomposition. ICASSP 2020: 991-995 - [c151]Zhiyuan Peng, Siyuan Feng, Tan Lee:
Mixture Factorized Auto-Encoder for Unsupervised Hierarchical Deep Factorization of Speech Signal. ICASSP 2020: 6774-6778 - [c150]Si Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee, Kathy Yuet-Sheung Lee, Michael Chi-Fai Tong:
CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment. INTERSPEECH 2020: 424-428 - [c149]Shuiyang Mao, Pak-Chung Ching, Tan Lee:
Emotion Profile Refinery for Speech Emotion Classification. INTERSPEECH 2020: 531-535 - [c148]Jingyu Li, Tan Lee:
Text-Independent Speaker Verification with Dual Attention Network. INTERSPEECH 2020: 956-960 - [c147]Shuiyang Mao, P. C. Ching, Tan Lee:
EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification. INTERSPEECH 2020: 2352-2356 - [c146]Shuiyang Mao, P. C. Ching, C.-C. Jay Kuo, Tan Lee:
Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition. INTERSPEECH 2020: 2357-2361 - [c145]Guangyan Zhang, Ying Qin, Tan Lee:
Learning Syllable-Level Discrete Prosodic Representation for Expressive Speech Generation. INTERSPEECH 2020: 3426-3430 - [c144]Si Ioi Ng, Tan Lee:
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder. INTERSPEECH 2020: 4476-4480 - 2019
- [c143]Yuzhong Wu, Tan Lee:
Enhancing Sound Texture in CNN-based Acoustic Scene Classification. ICASSP 2019: 815-819 - [c142]Xurong Xie, Xunying Liu, Tan Lee, Shoukang Hu, Lan Wang:
BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation. ICASSP 2019: 5711-5715 - [c141]Zhiyuan Peng, Siyuan Feng, Tan Lee:
Adversarial Multi-task Deep Features and Unsupervised Back-end Adaptation for Language Recognition. ICASSP 2019: 5961-5965 - [c140]Ying Qin, Tan Lee, Anthony Pak-Hin Kong:
Combining Phone Posteriorgrams from Strong and Weak Recognizers for Automatic Speech Assessment of People with Aphasia. ICASSP 2019: 6420-6424 - [c139]Shuiyang Mao, Dehua Tao, Guangyan Zhang, P. C. Ching, Tan Lee:
Revisiting Hidden Markov Models for Speech Emotion Recognition. ICASSP 2019: 6715-6719 - [c138]Siyuan Feng, Tan Lee:
Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation. INTERSPEECH 2019: 281-285 - [c137]Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features. INTERSPEECH 2019: 759-763 - [c136]Siyuan Feng, Tan Lee, Zhiyuan Peng:
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling. INTERSPEECH 2019: 1093-1097 - [c135]Shuiyang Mao, P. C. Ching, Tan Lee:
Deep Learning of Segment-Level Feature Representation with Multiple Instance Learning for Utterance-Level Speech Emotion Recognition. INTERSPEECH 2019: 1686-1690 - [c134]Ying Qin, Tan Lee, Anthony Pak-Hin Kong:
Automatic Assessment of Language Impairment Based on Raw ASR Output. INTERSPEECH 2019: 3078-3082 - [c133]Jiarui Wang, Ying Qin, Zhiyuan Peng, Tan Lee:
Child Speech Disorder Detection with Siamese Recurrent Network Using Speech Attribute Features. INTERSPEECH 2019: 3885-3889 - [c132]Tan Lee:
22nd oriental COCOSDA conference region report 2019. O-COCOSDA 2019: 1-5 - 2018
- [c131]Man-Ling Sung, Siyuan Feng, Tan Lee:
Unsupervised Pattern Discovery from Thematic Speech Archives Based on Multilingual Bottleneck Features. APSIPA 2018: 1448-1455 - [c130]Yuzhong Wu, Tan Lee:
Reducing Model Complexity for DNN Based Large-Scale Audio Classification. ICASSP 2018: 331-335 - [c129]Ying Qin, Tan Lee, Anthony Pak-Hin Kong:
Automatic Speech Assessment for Aphasic Patients Based on Syllable-Level Embedding and Supra-Segmental Duration Features. ICASSP 2018: 5994-5998 - [c128]Hansjörg Mixdorff, Albert Rilliard, Tan Lee, Matthew K. H. Ma, Angelika Hönemann:
Cross-cultural (A)symmetries in Audio-visual Attitude Perception. INTERSPEECH 2018: 426-430 - [c127]Siyuan Feng, Tan Lee:
Improving Cross-Lingual Knowledge Transferability Using Multilingual TDNN-BLSTM with Language-Dependent Pre-Final Layer. INTERSPEECH 2018: 2439-2443 - [c126]Siyuan Feng, Tan Lee:
Exploiting Speaker and Phonetic Diversity of Mismatched Language Resources for Unsupervised Subword Modeling. INTERSPEECH 2018: 2673-2677 - [c125]Ying Qin, Tan Lee, Siyuan Feng, Anthony Pak-Hin Kong:
Automatic Speech Assessment for People with Aphasia Using TDNN-BLSTM with Multi-Task Learning. INTERSPEECH 2018: 3418-3422 - [c124]Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion. ISCSLP 2018: 36-40 - [c123]Yuanyuan Liu, Ying Qin, Siyuan Feng, Tan Lee, P. C. Ching:
Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling. ISCSLP 2018: 61-65 - [c122]Ying Qin, Tan Lee, Yuzhong Wu, Anthony Pak-Hin Kong:
An End-to-End Approach to Automatic Speech Assessment for People with Aphasia. ISCSLP 2018: 66-70 - [c121]Yuanyuan Liu, Tan Lee, Thomas K. T. Law, Kathy Y. S. Lee, P. C. Ching:
Prediction of Voice Disorder Severity: Contributions from Sustained Vowels and Continuous Speech. ISCSLP 2018: 290-294 - [c120]Jiarui Wang, Si Ioi Ng, Dehua Tao, Wing Yee Ng, Tan Lee:
A Study on Acoustic Modeling for Child Speech Based on Multi-Task Learning. ISCSLP 2018: 389-393 - [c119]Si Ioi Ng, Dehua Tao, Jiarui Wang, Yi Jiang, Wing Yee Ng, Tan Lee:
An Automated Assessment Tool for Child Speech Disorders. ISCSLP 2018: 493-494 - 2017
- [c118]Hansjörg Mixdorff, Angelika Hönemann, Albert Rilliard, Tan Lee, Matthew K. H. Ma:
Cross-Language Perception of Audio-visual Attitudinal Expressions. AVSP 2017: 119-124 - [c117]Lufei Gao, Li Su, Yi-Hsuan Yang, Tan Lee:
Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram. ICASSP 2017: 291-295 - [c116]Raymond W. M. Ng, Alvin C. M. Kwan, Tan Lee, Thomas Hain:
Shefce: A Cantonese-English bilingual speech corpus for pronunciation assessment. ICASSP 2017: 5825-5829 - [c115]Siyuan Feng, Tan Lee:
On the Linguistic Relevance of Speech Units Learned by Unsupervised Acoustic Modeling. INTERSPEECH 2017: 2068-2072 - [c114]Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
RNN-LDA Clustering for Feature Based DNN Adaptation. INTERSPEECH 2017: 2396-2400 - [c113]Yuanyuan Liu, Tan Lee, P. C. Ching, Thomas K. T. Law, Kathy Y. S. Lee:
Acoustic Assessment of Disordered Voice with Continuous Speech Based on Utterance-Level ASR Posterior Features. INTERSPEECH 2017: 2680-2684 - 2016
- [c112]Tan Lee, Yuanyuan Liu, Pei-Wen Huang, Jen-Tzung Chien, Wang-Kong Lam, Yu Ting Yeung, Thomas K. T. Law, Kathy Y. S. Lee, Anthony Pak-Hin Kong, Sam-Po Law:
Automatic speech recognition for acoustical analysis and assessment of cantonese pathological voice and speech. ICASSP 2016: 6475-6479 - [c111]Tan Lee, Yuanyuan Liu, Yu Ting Yeung, Thomas K. T. Law, Kathy Y. S. Lee:
Predicting Severity of Voice Disorder from DNN-HMM Acoustic Posteriors. INTERSPEECH 2016: 97-101 - [c110]Jen-Tzung Chien, Pei-Wen Huang, Tan Lee:
Hybrid Accelerated Optimization for Speech Recognition. INTERSPEECH 2016: 3399-3403 - [c109]Siyuan Feng, Tan Lee, Haipeng Wang:
Exploiting language-mismatched phoneme recognizers for unsupervised acoustic modeling. ISCSLP 2016: 1-5 - [c108]Ying Qin, Tan Lee, Anthony Pak-Hin Kong, Sam-Po Law:
Towards automatic assessment of aphasia speech using automatic speech recognition techniques. ISCSLP 2016: 1-4 - [c107]Raymond W. M. Ng, Mauro Nicolao, Oscar Saz, Madina Hasan, Bhusan Chettri, Mortaza Doulaty, Tan Lee, Thomas Hain:
The Sheffield language recognition system in NIST LRE 2015. Odyssey 2016: 181-187 - 2015
- [c106]Chun Hoy Wong, Tan Lee, Yu Ting Yeung, Pak-Chung Ching:
Modeling temporal dependency for robust estimation of LP model parameters in speech enhancement. INTERSPEECH 2015: 1730-1734 - [c105]Lufei Gao, Tan Lee:
Multi-pitch estimation based on sparse representation with pre-screened dictionary. MMSP 2015: 1-6 - [c104]Tan Lee, Wang-Kong Lam, Anthony Pak-Hin Kong, Sam-Po Law:
Analysis of intonation patterns in Cantonese aphasia speech. O-COCOSDA/CASLRE 2015: 86-89 - 2014
- [c103]Nan Yan, Manwa L. Ng, Tan Lee:
Improving the sound quality of an electronic voice box. APSIPA 2014: 1-4 - [c102]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
A graph-based Gaussian component clustering approach to unsupervised acoustic modeling. INTERSPEECH 2014: 875-879 - [c101]Yu Ting Yeung, Tan Lee, Cheung-Chi Leung:
Large-margin conditional random fields for single-microphone speech separation. INTERSPEECH 2014: 983-987 - [c100]Shing Yu, Tan Lee, Manwa L. Ng:
Surface electromyographic activity of non-laryngeal neck muscles in Cantonese tone production. ISCSLP 2014: 304-307 - [c99]Feng Huang, Tan Lee:
Multipitch tracking based on linear programming relaxation and sparsity-based pitch candidate estimation. ISCSLP 2014: 331-335 - [c98]Wang-Kong Lam, Tan Lee:
Correcting Chord Classification Errors Based on Tonal Organization Information of Classical Music. ISM 2014: 131-134 - [c97]Wang-Kong Lam, Tan Lee:
Automatic Key Partition Based on Tonal Organization Information of Classical Music. ISMIR 2014: 501-506 - [c96]Haipeng Wang, Tan Lee:
CUHK System for QUESST Task of MediaEval 2014. MediaEval 2014 - 2013
- [c95]Manwa L. Ng, Tan Lee, Nan Yan:
Improving the sound quality of an electronic voice box. BMEI 2013: 368-372 - [c94]Wang-Kong Lam, Tan Lee:
Chord classification of multi-instrumental music using exemplar-based sparse representation. ChinaSIP 2013: 113-117 - [c93]Yu Ting Yeung, Tan Lee:
Structured mean field method for single-microphone speech separation with factorial Hidden Markov Model. ChinaSIP 2013: 122-126 - [c92]Meng Yuan, Yang Sun, Haihong Feng, Tan Lee:
A speech enhancement method for cochlear implant listeners. EMBC 2013: 2036-2039 - [c91]Yu Ting Yeung, Tan Lee, Cheung-Chi Leung:
Using dynamic conditional random field on single-microphone speech separation. ICASSP 2013: 146-150 - [c90]Feng Huang, Yu Ting Yeung, Tan Lee:
Evaluation of pitch estimation algorithms on separated speech. ICASSP 2013: 6807-6811 - [c89]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection. ICASSP 2013: 8545-8549 - [c88]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams. INTERSPEECH 2013: 2297-2301 - [c87]Haipeng Wang, Tan Lee:
The CUHK Spoken Web Search System for MediaEval 2013. MediaEval 2013 - 2012
- [c86]Nengheng Zheng, Yi Cai, Xia Li, Tan Lee:
Classifying NMF components based on vector similarity for speech and music separation. APSIPA 2012: 1-6 - [c85]Ning Wang, P. C. Ching, Tan Lee:
Exploration of Phase and Vocal Excitation Modulation Features for Speaker Recognition. CCBR 2012: 251-259 - [c84]Yu Ting Yeung, Tan Lee, Cheung-Chi Leung:
Integrating multiple observations for model-based single-microphone speech separation with conditional random fields. ICASSP 2012: 257-260 - [c83]Feng Huang, Tan Lee, W. Bastiaan Kleijn:
Transform-domain Wiener filter for speech periodicity enhancement. ICASSP 2012: 4577-4580 - [c82]Feng Huang, Tan Lee:
Sparsity-based confidence measure for pitch estimation in noisy speech. ICASSP 2012: 4601-4604 - [c81]Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
An acoustic segment modeling approach to query-by-example spoken term detection. ICASSP 2012: 5157-5160 - [c80]Feng Huang, Tan Lee:
Robust Pitch Estimation Using l1-regularized Maximum Likelihood Estimation. INTERSPEECH 2012: 378-381 - [c79]Huijun Ding, Tan Lee, Ing Yann Soon:
Two objective measures for speech distortion and noise reduction evaluation of enhanced speech signals. ISCSLP 2012: 117-121 - [c78]Haipeng Wang, Tan Lee:
CUHK System for the Spoken Web Search task at Mediaeval 2012. MediaEval 2012 - 2011
- [c77]Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Score fusion and calibration in multiple language detectors with large performance variation. ICASSP 2011: 4404-4407 - [c76]Feng Huang, Tan Lee, W. Bastiaan Kleijn:
Transform-domain speech periodicity enhancement with adaptive coefficient weighting. ISPACS 2011: 1-5 - 2010
- [c75]Feng Huang, Tan Lee, W. Bastiaan Kleijn:
A method of speech periodicity enhancement based on transform-domain signal decomposition. EUSIPCO 2010: 984-988 - [c74]Meng Yuan, Haihong Feng, Tan Lee:
Improved Cantonese Tone Recognition with Approximated F0 Contour: Implications for Cochlear Implants. IALP 2010: 315-318 - [c73]Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Prosodic attribute model for spoken language identification. ICASSP 2010: 5022-5025 - [c72]Feng Huang, Tan Lee:
Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks. INTERSPEECH 2010: 641-644 - [c71]Houwei Cao, Tan Lee, P. C. Ching:
Cross-lingual speaker adaptation via Gaussian component mapping. INTERSPEECH 2010: 869-872 - [c70]Yujia Li, Tan Lee:
Perception-based automatic approximation of F0 contours in Cantonese speech. INTERSPEECH 2010: 1425-1428 - [c69]Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma, Haizhou Li:
Towards long-range prosodic attribute modeling for language recognition. INTERSPEECH 2010: 1792-1795 - [c68]Ning Wang, P. C. Ching, Tan Lee:
Exploitation of phase information for speaker recognition. INTERSPEECH 2010: 2126-2129 - [c67]Chun-Man Mak, Tan Lee, Siu Wa Lee:
Spectral trajectory estimation using nonnegative matrix factorization for model-based monaural speech separation. ISCSLP 2010: 23-28 - [c66]Houwei Cao, P. C. Ching, Tan Lee, Yu Ting Yeung:
Semantics-based language modeling for Cantonese-English code-mixing speech recognition. ISCSLP 2010: 246-250 - [c65]Nengheng Zheng, Xia Li, Thierry Blu, Tan Lee:
SURE-MSE speech enhancement for robust speech recognition. ISCSLP 2010: 271-274 - [c64]Yujia Li, Tan Lee:
Perception and analysis of linearly approximated F0 contours in Cantonese speech. ISCSLP 2010: 435-439 - [c63]Ning Wang, P. C. Ching, Tan Lee:
Robust speaker verification using phase information of speech. ISCSLP 2010: 483-487 - [c62]Chun-Man Mak, Tan Lee, Suman Senapati, Yu Ting Yeung, Wang-Kong Lam:
Similarity Measures for Chinese Pop Music Based on Low-level Audio Signal Attributes. ISMIR 2010: 513-518 - [c61]Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Detection target dependent score calibration for language recognition. Odyssey 2010: 18 - 2009
- [c60]Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Analysis and Selection of Prosodic Features for Language Identification. IALP 2009: 123-128 - [c59]Ning Wang, P. C. Ching, Tan Lee:
Exploration of vocal excitation modulation features for speaker recognition. INTERSPEECH 2009: 892-895 - [c58]Siu Wa Lee, Frank K. Soong, Tan Lee:
Model-based speech separation: identifying transcription using orthogonality. INTERSPEECH 2009: 1343-1346 - [c57]Houwei Cao, P. C. Ching, Tan Lee:
Effects of language mixing for automatic recognition of Cantonese-English code-mixing utterances. INTERSPEECH 2009: 3011-3014 - 2008
- [c56]Yu Ting Yeung, Yao Qian, Tan Lee, Frank K. Soong:
Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech. INTERSPEECH 2008: 1133-1136 - [c55]Yu Ting Yeung, Houwei Cao, Nengheng Zheng, Tan Lee, P. C. Ching:
Language modeling for speech recognition of spoken Cantonese. INTERSPEECH 2008: 1570-1573 - [c54]Yujia Li, Tan Lee:
A Perceptual Study of Approximated Cantonese Tone Contours. ISCSLP 2008: 49-52 - [c53]Raymond W. M. Ng, Tan Lee:
Entropy-Based Analysis of the Prosodic Features of Chinese Dialects. ISCSLP 2008: 65-68 - [c52]Nengheng Zheng, Xia Li, Houwei Cao, Tan Lee, P. C. Ching:
Deriving MFCC Parameters from the Dynamic Spectrum for Robust Speech Recognition. ISCSLP 2008: 85-88 - [c51]Siu Wa Lee, Frank K. Soong, P. C. Ching, Tan Lee:
Pitch Tracking for Model-Based Speech Separation. ISCSLP 2008: 145-148 - [c50]Meng Yuan, Tan Lee, Sigfrid D. Soli:
Mandarin Tone Perception with Temporal Envelope and Periodicity Cues from Different Frequency Regions. ISCSLP 2008: 338-341 - [c49]Wentao Gu, Tan Lee, P. C. Ching:
Prosodic Variation in Cantonese-English Code-Mixed Speech. ISCSLP 2008: 342-345 - 2007
- [c48]Wentao Gu, Rerrario Shui-Ching Ho, Tan Lee:
Modeling tones in hakka on the basis of the command-response model. INTERSPEECH 2007: 2633-2636 - [c47]Yujia Li, Tan Lee:
Perceptual equivalence of approximated Cantonese tone contours. INTERSPEECH 2007: 2677-2680 - [c46]Wentao Gu, Tan Lee:
Quantitative analysis of F0 contours of emotional speech of Mandarin. SSW 2007: 228-233 - 2006
- [c45]Yao Qian, Frank K. Soong, Tan Lee:
Tone-Enhanced Generalized Character Posterior Probability (GCPP) for Cantonese LVCSR. ICASSP (1) 2006: 133-136 - [c44]Hua Ouyang, Tan Lee, Wai Nang Chan:
Feature Extraction From Talking Mouths for Video-Based Bi-Modal Speaker Verification. ICASSP (5) 2006: 513-516 - [c43]Wai Nang Chan, Tan Lee, Nengheng Zheng, Hua Ouyang:
Use of Vocal Source Features in Speaker Segmentation. ICASSP (1) 2006: 657-660 - [c42]Joyce Y. C. Chan, P. C. Ching, Tan Lee, Houwei Cao:
Automatic speech recognition of Cantonese-English code-mixing utterances. INTERSPEECH 2006 - [c41]Xin Lei, Man-Hung Siu, Mei-Yuh Hwang, Mari Ostendorf, Tan Lee:
Improved tone modeling for Mandarin broadcast news speech recognition. INTERSPEECH 2006 - [c40]Raymond W. M. Ng, Tan Lee, Wentao Gu:
Towards automatic parameter extraction of command-response model for Cantonese. INTERSPEECH 2006 - [c39]Nengheng Zheng, Ning Wang, Tan Lee, P. C. Ching:
Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract. ISCSLP (Selected Papers) 2006: 518-528 - [c38]Nengheng Zheng, P. C. Ching, Ning Wang, Tan Lee:
Integrating Complementary Features with a Confidence Measure for Speaker Identification. ISCSLP (Selected Papers) 2006: 549-557 - 2005
- [c37]Chen Yang, Frank K. Soong, Tan Lee:
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR. ICASSP (1) 2005: 241-244 - [c36]Joyce Y. C. Chan, P. C. Ching, Tan Lee:
Development of a Cantonese-English code-mixing speech corpus. INTERSPEECH 2005: 1533-1536 - 2004
- [c35]Yu Zhu, Tan Lee:
Explicit duration modeling for Cantonese connected-digit recognition. INTERSPEECH 2004: 685-688 - [c34]Yao Qian, Tan Lee, Frank K. Soong:
Tone information as a confidence measure for improving Cantonese LVCSR. INTERSPEECH 2004: 1965-1968 - [c33]Nengheng Zheng, P. C. Ching, Tan Lee:
Time -frequency analysis of vocal source signal for speaker recognition. INTERSPEECH 2004: 2333-2336 - [c32]Siu Wa Lee, Pak-Chung Ching, Tan Lee:
Noise-robust automatic speech recognition using mainlobe-resilient time-frequency quantile-based noise estimation. ISCAS (3) 2004: 425-428 - [c31]Chen Yang, Frank K. Soong, Tan Lee:
On noise robustness of dynamic and static features for continuous Cantonese digit recognition. ISCSLP 2004: 277-280 - [c30]Joyce Y. C. Chan, P. C. Ching, Tan Lee, Helen M. Meng:
Detection of language boundary in code-switching utterances by bi-phone probabilities. ISCSLP 2004: 293-296 - [c29]Chao Qin, Tan Lee:
Cantonese verbal information verification system using GMM-based anti-model. ISCSLP 2004: 297-300 - 2003
- [c28]Patgi Kam, Tan Lee, Frank K. Soong:
Modeling Cantonese pronunciation variation by acoustic model refinement. INTERSPEECH 2003: 1477-1480 - [c27]Yao Qian, Tan Lee, Yujia Li:
Overlapped di-tone modeling for tone recognition in continuous Cantonese speech. INTERSPEECH 2003: 1845-1848 - [c26]Wei Han, Kwok-Wai Hon, Cheong-Fat Chan, Tan Lee, Chiu-sing Choy, Kong-Pang Pun, Pak-Chung Ching:
An HMM-based speech recognition IC. ISCAS (2) 2003: 744-747 - 2002
- [c25]Ka-Yan Kwan, Tan Lee, Chen Yang:
Unsupervised n-best based model adaptation using model-level confidence measures. INTERSPEECH 2002: 69-72 - [c24]Tan Lee, Greg Kochanski, Chilin Shih, Yujia Li:
Modeling tones in continuous Cantonese speech. INTERSPEECH 2002: 2401-2404 - [c23]Yujia Li, Tan Lee, Yao Qian:
Acoustical F0 analysis of continuous cantonese speech. ISCSLP 2002 - 2001
- [c22]Ka Man Law, Tan Lee, Wai H. Lau:
Cantonese text-to-speech synthesis using sub-syllable units. INTERSPEECH 2001: 991-994 - [c21]Helen M. Meng, Shuk Fong Chan, Yee Fong Wong, Cheong Chat Chan, Yiu Wing Wong, Tien Ying Fung, Wai Ching Tsui, Ke Chen, Lan Wang, Ting-Yao Wu, Xiaolong Li, Tan Lee, Wing Nin Choi, P. C. Ching, Huisheng Chi:
ISIS: a learning system with combined interaction and delegation dialogs. INTERSPEECH 2001: 1551-1554 - [c20]H. S. Lam, Tan Lee, P. C. Ching:
A Low Missing Rate Audio Search Technique for Cantonese Radio Broadcast Recording. IEEE Pacific Rim Conference on Multimedia 2001: 546-549 - [c19]Wai Kit Lo, P. C. Ching, Tan Lee, Helen Meng:
Design, Compilation and Processing of CUCall: A Set of Cantonese Spoken Language Corpora Collected Over Telephone Networks. ROCLING 2001 - 2000
- [c18]Sheng Gao, Tan Lee, Yiu Wing Wong, Bo Xu, Pak-Chung Ching, Taiyi Huang:
Acoustic modeling for Chinese speech recognition: a comparative study of Mandarin and Cantonese. ICASSP 2000: 1261-1264 - [c17]Helen M. Meng, Shuk Fong Chan, Yee Fong Wong, Tien Ying Fung, Wai Ching Tsui, Tin Hang Lo, Cheong Chat Chan, Ke Chen, Lan Wang, Ting-Yao Wu, Xiaolong Li, Tan Lee, Wing Nin Choi, Yiu Wing Wong, P. C. Ching, Huisheng Chi:
ISIS: A multilingual spoken dialog system developed with CORBA and KQML agents. INTERSPEECH 2000: 150-153 - [c16]Wing Nin Choi, Yiu Wing Wong, Tan Lee, P. C. Ching:
Lexical tree decoding with a class-based language model for Chinese speech recognition. INTERSPEECH 2000: 174-177 - [c15]Ka Man Law, Tan Lee:
Using cross-syllable units for Cantonese speech synthesis. INTERSPEECH 2000: 407-410 - [c14]Wai H. Lau, Tan Lee, Yiu Wing Wong, P. C. Ching:
Incorporating tone information into Cantonese large-vocabulary continuous speech recognition. INTERSPEECH 2000: 883-886 - [c13]Wai H. Lau, Yiu Wing Wong, Wai Kit Lo, Tan Lee, P. C. Ching:
A Study on the Contribution of Lexical Tones in Chinese LVCSR. ISCSLP 2000 - [c12]Ka Man Law, Ka-Yan Kwan, Tan Lee:
Corpus-based Cantonese Speech Synthesis With Non-uniform Units. ISCSLP 2000 - 1999
- [c11]Chun-Ping Chan, Yiu Wing Wong, Tan Lee, Pak-Chung Ching:
Two-dimensional multi-resolution analysis of speech signals and its application to speech recognition. ICASSP 1999: 405-408 - [c10]Tan Lee, Helen M. Meng, Wai H. Lau, Wai Kit Lo, P. C. Ching:
Micro-prosodic control in cantonese text-to-speech synthesis. EUROSPEECH 1999: 1855-1858 - [c9]Yiu Wing Wong, Ka-Fai Chow, Wai H. Lau, Wai Kit Lo, Tan Lee, Pak-Chung Ching:
Acoustic modeling and language modeling for cantonese LVCSR. EUROSPEECH 1999 - 1998
- [c8]Tan Lee, Rolf Carlson, Björn Granström:
Context-dependent duration modelling for continuous speech recognition. ICSLP 1998 - [c7]Ka-Fai Chow, Tan Lee, P. C. Ching:
Sub-Syllable Acoustic Modelling for Cantonese Speech Recognition. ISCSLP 1998 - [c6]Wai Kit Lo, Tan Lee, P. C. Ching:
Development of Cantonese Spoken Language Corpora for Speech Application. ISCSLP 1998 - 1997
- [c5]Pak-Chung Ching, Ka-Fai Chow, Tan Lee, Alfred Ying Pang Ng, Lai-Wan Chan:
Development of a large vocabulary speech database for Cantonese. ICASSP 1997: 1775-1778 - [c4]Tan Lee, Pak-Chung Ching:
A neural network based speech recognition system for isolated Cantonese syllables. ICASSP 1997: 3269-3272 - 1996
- [c3]Tan Lee, P. C. Ching:
On improving discrimination capability of an RNN based recognizer. ICSLP 1996: 526-529 - 1995
- [c2]Tan Lee, Pak-Chung Ching, Lai-Wan Chan:
Recurrent neural networks for speech modeling and speech recognition. ICASSP 1995: 3319-3322 - [c1]Tan Lee, P. C. Ching, Lai-Wan Chan:
An RNN based speech recognition system with discriminative training. EUROSPEECH 1995: 1667-1670
Informal and Other Publications
- 2024
- [i50]Wei Liu, Jingyong Hou, Dong Yang, Muyong Cao, Tan Lee:
LUPET: Incorporating Hierarchical Information Path into Multilingual ASR. CoRR abs/2401.03689 (2024) - [i49]Yusheng Tian, Jingyu Li, Tan Lee:
Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss. CoRR abs/2401.03816 (2024) - [i48]Wei Liu, Jingyong Hou, Dong Yang, Muyong Cao, Tan Lee:
A Parameter-efficient Language Extension Framework for Multilingual ASR. CoRR abs/2406.06329 (2024) - [i47]Dehua Tao, Daxin Tan, Yu Ting Yeung, Xiao Chen, Tan Lee:
ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis. CoRR abs/2406.08989 (2024) - [i46]Yusheng Tian, Junbin Liu, Tan Lee:
User-Driven Voice Generation and Editing through Latent Space Navigation. CoRR abs/2408.17068 (2024) - [i45]Dehua Tao, Harold Chui, Sarah Luk, Tan Lee:
CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research. CoRR abs/2409.02466 (2024) - 2023
- [i44]Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee:
Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring. CoRR abs/2302.10444 (2023) - [i43]Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee:
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading. CoRR abs/2307.00782 (2023) - [i42]Wei Liu, Ying Qin, Zhiyuan Peng, Tan Lee:
Sparsely Shared LoRA on Whisper for Child Speech Recognition. CoRR abs/2309.11756 (2023) - [i41]Wei Liu, Zhiyuan Peng, Tan Lee:
CoMFLP: Correlation Measure based Fast Search on ASR Layer Pruning. CoRR abs/2309.11768 (2023) - [i40]Jingyu Li, Tan Lee:
Efficient Black-Box Speaker Verification Model Adaptation with Reprogramming and Backend Learning. CoRR abs/2309.13605 (2023) - 2022
- [i39]Si Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee:
Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations. CoRR abs/2203.15405 (2022) - [i38]Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee, Sheng Zhao:
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. CoRR abs/2203.17190 (2022) - [i37]Daxin Tan, Liqun Deng, Nianzu Zheng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee:
CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction. CoRR abs/2204.05460 (2022) - [i36]Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee, Guanglu Wan:
Unifying Cosine and PLDA Back-ends for Speaker Verification. CoRR abs/2204.10523 (2022) - [i35]Wei Liu, Jingyu Li, Tan Lee:
An Investigation on Applying Acoustic Feature Conversion to ASR of Adult and Child Speech. CoRR abs/2205.12477 (2022) - [i34]Yusheng Tian, Jingyu Li, Tan Lee:
Transport-Oriented Feature Aggregation for Speaker Embedding Learning. CoRR abs/2206.12857 (2022) - [i33]Xu Yang, Daoyuan Wu, Xiao Yi, Jimmy H. M. Lee, Tan Lee:
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition. CoRR abs/2206.13356 (2022) - [i32]Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang, Tan Lee:
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre. CoRR abs/2206.14866 (2022) - [i31]Jingyu Li, Yusheng Tian, Tan Lee:
Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification. CoRR abs/2210.17310 (2022) - [i30]Zhiyuan Peng, Mingjie Shao, Xuanji He, Xu Li, Tan Lee, Ke Ding, Guanglu Wan:
Covariance Regularization for Probabilistic Linear Discriminant Analysis. CoRR abs/2212.03039 (2022) - [i29]Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee, Guanglu Wan:
Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition. CoRR abs/2212.03090 (2022) - 2021
- [i28]Daxin Tan, Hing-Pang Huang, Guangyan Zhang, Tan Lee:
CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge. CoRR abs/2103.04699 (2021) - [i27]Shuiyang Mao, P. C. Ching, Tan Lee:
Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning. CoRR abs/2103.16456 (2021) - [i26]Si Ioi Ng, Cymie Wing-Yee Ng, Jingyu Li, Tan Lee:
Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding. CoRR abs/2106.08536 (2021) - [i25]Daxin Tan, Liqun Deng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee:
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion. CoRR abs/2107.01554 (2021) - [i24]Guangyan Zhang, Ying Qin, Daxin Tan, Tan Lee:
Applying the Information Bottleneck Principle to Prosodic Representation Learning. CoRR abs/2108.02821 (2021) - [i23]Yuzhong Wu, Tan Lee:
Robust Feature Learning on Long-Duration Sounds for Acoustic Scene Classification. CoRR abs/2108.05008 (2021) - [i22]Wei Liu, Tan Lee:
Utterance-level neural confidence measure for end-to-end children speech recognition. CoRR abs/2109.07750 (2021) - [i21]Jingyu Li, Si Ioi Ng, Tan Lee:
Improving Text-Independent Speaker Verification with Auxiliary Speakers Using Graph. CoRR abs/2109.09674 (2021) - [i20]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee:
A study on the efficacy of model pre-training in developing neural text-to-speech system. CoRR abs/2110.03857 (2021) - [i19]Daxin Tan, Guangyan Zhang, Tan Lee:
Environment Aware Text-to-Speech Synthesis. CoRR abs/2110.03887 (2021) - [i18]Si Ioi Ng, Tan Lee:
Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition. CoRR abs/2110.04511 (2021) - [i17]Si Ioi Ng, Rui-Si Ma, Tan Lee, Raymond Kim-Wai Sum:
Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy. CoRR abs/2111.12566 (2021) - 2020
- [i16]Si Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee, Kathy Yuet-Sheung Lee, Michael Chi-Fai Tong:
CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment. CoRR abs/2008.03188 (2020) - [i15]Si Ioi Ng, Tan Lee:
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder. CoRR abs/2008.03193 (2020) - [i14]Shuiyang Mao, P. C. Ching, Tan Lee:
Emotion Profile Refinery for Speech Emotion Classification. CoRR abs/2008.05259 (2020) - [i13]Shuiyang Mao, P. C. Ching, Tan Lee:
EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification. CoRR abs/2008.06665 (2020) - [i12]Shuiyang Mao, P. C. Ching, C.-C. Jay Kuo, Tan Lee:
Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition. CoRR abs/2008.06667 (2020) - [i11]Man-Ling Sung, Siyuan Feng, Tan Lee:
Unsupervised Pattern Discovery from Thematic Speech Archives Based on Multilingual Bottleneck Features. CoRR abs/2011.01986 (2020) - [i10]Daxin Tan, Tan Lee:
Fine-grained style modelling and transfer in text-to-speech synthesis via content-style disentanglement. CoRR abs/2011.03943 (2020) - [i9]Si Ioi Ng, Wei Liu, Zhiyuan Peng, Siyuan Feng, Hing-Pang Huang, Odette Scharenborg, Tan Lee:
The CUHK-TUDELFT System for The SLT 2021 Children Speech Recognition Challenge. CoRR abs/2011.06239 (2020) - [i8]Man-Ling Sung, Tan Lee:
Unsupervised Spoken Term Discovery Based on Re-clustering of Hypothesized Speech Segments with Siamese and Triplet Networks. CoRR abs/2011.14062 (2020) - [i7]Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
Bayesian Learning for Deep Neural Network Adaptation. CoRR abs/2012.07460 (2020) - 2019
- [i6]Yuzhong Wu, Tan Lee:
Enhancing Sound Texture in CNN-Based Acoustic Scene Classification. CoRR abs/1901.01502 (2019) - [i5]Siyuan Feng, Tan Lee, Zhiyuan Peng:
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling. CoRR abs/1906.07234 (2019) - [i4]Siyuan Feng, Tan Lee:
Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation. CoRR abs/1906.07245 (2019) - [i3]Siyuan Feng, Tan Lee:
Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling. CoRR abs/1908.03538 (2019) - [i2]Zhiyuan Peng, Siyuan Feng, Tan Lee:
Mixture factorized auto-encoder for unsupervised hierarchical deep factorization of speech signal. CoRR abs/1911.01806 (2019) - 2017
- [i1]Yuzhong Wu, Tan Lee:
Reducing Model Complexity for DNN Based Large-Scale Audio Classification. CoRR abs/1711.00229 (2017)
Coauthor Index
aka: P. C. Ching
aka: Wing Yee Ng
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-04 21:13 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint