default search action
Chin-Hui Lee 0001
Person information
- affiliation: Georgia Institute of Technology, School of Electrical and Computer Engineering, USA
- affiliation (1981-2001): Bell Laboratories, Dialogue Systems Research Department, Murray Hill, New Jersey, NY, USA
Other persons with the same name
- Chin-Hui Lee — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j60]Hang Chen, Qing Wang, Jun Du, Bao-Cai Yin, Jia Pan, Chin-Hui Lee:
Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2508-2521 (2024) - [j59]Zilu Guo, Qing Wang, Jun Du, Jia Pan, Qing-Feng Liu, Chin-Hui Lee:
A Variance-Preserving Interpolation Approach for Diffusion Models With Applications to Single Channel Speech Enhancement and Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3025-3038 (2024) - [j58]Hang Chen, Qing Wang, Jun Du, Genshun Wan, Shifu Xiong, Baocai Yin, Jia Pan, Chin-Hui Lee:
Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading. IEEE Trans. Multim. 26: 9358-9371 (2024) - [c208]Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Haotian Wang, Chin-Hui Lee:
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition. CVPR 2024: 27435-27445 - [c207]Hang Chen, Shilong Wu, Chenxi Wang, Jun Du, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Jingdong Chen, Odette Scharenborg, Zhong-Qiu Wang, Bao-Cai Yin, Jia Pan:
Summary on the Multimodal Information-Based Speech Processing (MISP) 2023 Challenge. ICASSP Workshops 2024: 123-124 - [c206]Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao:
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction. ICASSP 2024: 8351-8355 - [c205]Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang, Yanyan Yue, Shuangqing Qian, Shilong Wu, Jun Du, Chin-Hui Lee:
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture. ICASSP 2024: 11626-11630 - [c204]Haotian Wang, Jun Du, Yusheng Dai, Chin-Hui Lee, Yuling Ren, Yu Liu:
Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization. ICASSP 2024: 11766-11770 - [c203]Hao Yen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Boosting End-to-End Multilingual Phoneme Recognition Through Exploiting Universal Speech Attributes Constraints. ICASSP 2024: 11876-11880 - [c202]Feng Ma, Yanhui Tu, Maokui He, Ruoyu Wang, Shutong Niu, Lei Sun, Zhongfu Ye, Jun Du, Jia Pan, Chin-Hui Lee:
A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition. ICASSP 2024: 12331-12335 - [c201]Ya Jiang, Qing Wang, Jun Du, Maocheng Hu, Pengfei Hu, Zeyan Liu, Shi Cheng, Zhaoxu Nian, Yuxuan Dong, Mingqi Cai, Xin Fang, Chin-Hui Lee:
Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios. ICME 2024: 1-6 - [c200]Chen-Yue Zhang, Hang Chen, Jun Du, Sabato Marco Siniscalchi, Ya Jiang, Chin-Hui Lee:
Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge. ICME Workshops 2024: 1-6 - [i46]Hu Hu, Sabato Marco Siniscalchi, Chin-Hui Lee:
Bayesian adaptive learning to latent variables via Variational Bayes and Maximum a Posteriori. CoRR abs/2401.13766 (2024) - [i45]Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee:
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition. CoRR abs/2403.04245 (2024) - [i44]Hao Yen, Pin-Jui Ku, Sabato Marco Siniscalchi, Chin-Hui Lee:
Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition. CoRR abs/2406.02488 (2024) - [i43]Ming Gao, Hang Chen, Jun Du, Xin Xu, Hongxiao Guo, Hui Bu, Jianxing Yang, Ming Li, Chin-Hui Lee:
Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design. CoRR abs/2406.10304 (2024) - [i42]Pin-Jui Ku, Chun-Wei Ho, Hao Yen, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement. CoRR abs/2409.16282 (2024) - [i41]Mao-Kui He, Jun Du, Shutong Niu, Qing-Feng Liu, Chin-Hui Lee:
Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization. CoRR abs/2410.22350 (2024) - 2023
- [j57]Shi Cheng, Jun Du, Shutong Niu, Alejandrina Cristià, Xin Wang, Qing Wang, Chin-Hui Lee:
Using iterative adaptation and dynamic mask for child speech extraction under real-world multilingual conditions. Speech Commun. 152: 102956 (2023) - [j56]Li Chai, Hang Chen, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
Space-and-speaker-aware acoustic modeling with effective data augmentation for recognition of multi-array conversational speech. Speech Commun. 153: 102958 (2023) - [j55]Shutong Niu, Jun Du, Lei Sun, Yu Hu, Chin-Hui Lee:
QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1037-1049 (2023) - [j54]Qing Wang, Jun Du, Huaxin Wu, Jia Pan, Feng Ma, Chin-Hui Lee:
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1251-1264 (2023) - [j53]Mao-Kui He, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1561-1573 (2023) - [c199]Hang Chen, Jun Du, Zhe Wang, Chenxi Wang, Yuling Ren, Qinglong Li, Ruibo Liu, Chin-Hui Lee:
Correlated Multi-Level Speech Enhancement for Robust Real-World ASR Applications Using Mask-Waveform-Feature Optimization. APSIPA ASC 2023: 96-101 - [c198]Chang Wang, Jun Du, Hang Chen, Ruoyu Wang, Chao-Han Huck Yang, Jiangjiang Zhao, Yuling Ren, Qinglong Li, Chin-Hui Lee:
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition. APSIPA ASC 2023: 635-642 - [c197]Shi Cheng, Jun Du, Qing Wang, Ya Jiang, Zhaoxu Nian, Shutong Niu, Chin-Hui Lee, Yu Gao, Wenbin Zhang:
Improving Sound Event Localization and Detection with Class-Dependent Sound Separation for Real-World Scenarios. APSIPA ASC 2023: 2068-2073 - [c196]Shilong Wu, Jun Du, Mao-Kui He, Shutong Niu, Hang Chen, Haitao Tang, Chin-Hui Lee:
Semi-Supervised Multi-Channel Speaker Diarization With Cross-Channel Attention. ASRU 2023: 1-8 - [c195]Hang Chen, Shilong Wu, Yusheng Dai, Zhe Wang, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge. ICASSP 2023: 1-2 - [c194]Ya Jiang, Hang Chen, Jun Du, Qing Wang, Chin-Hui Lee:
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion. ICASSP 2023: 1-5 - [c193]Shutong Niu, Jun Du, Qing Wang, Li Chai, Huaxin Wu, Zhaoxu Nian, Lei Sun, Yi Fang, Jia Pan, Chin-Hui Lee:
An Experimental Study on Sound Event Localization and Detection Under Realistic Testing Conditions. ICASSP 2023: 1-5 - [c192]Qing Wang, Jun Du, Zhaoxu Nian, Shutong Niu, Li Chai, Huaxin Wu, Jia Pan, Chin-Hui Lee:
Loss Function Design for DNN-Based Sound Event Localization and Detection on Low-Resource Realistic Data. ICASSP 2023: 1-5 - [c191]Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition. ICASSP 2023: 1-5 - [c190]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. ICASSP 2023: 1-5 - [c189]Chenyue Zhang, Hang Chen, Jun Du, Bao-Cai Yin, Jia Pan, Chin-Hui Lee:
Incorporating Visual Information Reconstruction into Progressive Learning for Optimizing audio-visual Speech Enhancement. ICASSP 2023: 1-5 - [c188]Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. ICME 2023: 2627-2632 - [c187]Gaobin Yang, Jun Du, Maokui He, Shutong Niu, Baoxiang Li, Jiakui Li, Chin-Hui Lee:
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark. INTERSPEECH 2023: 421-425 - [c186]Zilu Guo, Jun Du, Chin-Hui Lee, Yu Gao, Wenbin Zhang:
Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement. INTERSPEECH 2023: 1065-1069 - [c185]Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models. INTERSPEECH 2023: 2453-2457 - [c184]Haotian Wang, Jun Du, Hengshun Zhou, Chin-Hui Lee, Yuling Ren, Jiangjiang Zhao:
A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting. INTERSPEECH 2023: 2678-2682 - [c183]Shutong Niu, Jun Du, Maokui He, Chin-Hui Lee, Baoxiang Li, Jiakui Li:
Unsupervised Adaptation with Quality-Aware Masking to Improve Target-Speaker Voice Activity Detection for Speaker Diarization. INTERSPEECH 2023: 3482-3486 - [i40]Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition. CoRR abs/2303.06326 (2023) - [i39]Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models. CoRR abs/2306.00331 (2023) - [i38]Zilu Guo, Jun Du, Chin-Hui Lee, Yu Gao, Wenbin Zhang:
Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement. CoRR abs/2306.08527 (2023) - [i37]Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. CoRR abs/2308.08488 (2023) - [i36]Ruoyu Wang, Maokui He, Jun Du, Hengshun Zhou, Shutong Niu, Hang Chen, Yanyan Yue, Gaobin Yang, Shilong Wu, Lei Sun, Yanhui Tu, Haitao Tang, Shuangqing Qian, Tian Gao, Mengzhi Wang, Genshun Wan, Jia Pan, Jianqing Gao, Chin-Hui Lee:
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge. CoRR abs/2308.14638 (2023) - [i35]Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao:
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction. CoRR abs/2309.08348 (2023) - [i34]Hao Yen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints. CoRR abs/2309.08828 (2023) - [i33]Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang, Yanyan Yue, Shuangqing Qian, Shilong Wu, Jun Du, Chin-Hui Lee:
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture. CoRR abs/2309.09180 (2023) - 2022
- [c182]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. ICASSP 2022: 4041-4045 - [c181]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. ICASSP 2022: 7572-7576 - [c180]Shutong Niu, Jun Du, Lei Sun, Chin-Hui Lee:
Improving Separation-Based Speaker Diarization Via Iterative Model Refinement And Speaker Embedding Based Post-Processing. ICASSP 2022: 8387-8391 - [c179]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge. ICASSP 2022: 9166-9170 - [c178]Hang Chen, Hengshun Zhou, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results. ICASSP 2022: 9266-9270 - [c177]Hengshun Zhou, Jun Du, Gongzhen Zou, Zhaoxu Nian, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Odette Scharenborg, Jingdong Chen, Shifu Xiong, Jianqing Gao:
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1111-1115 - [c176]Mao-Kui He, Jun Du, Chin-Hui Lee:
End-to-End Audio-Visual Neural Speaker Diarization. INTERSPEECH 2022: 1461-1465 - [c175]Hang Chen, Jun Du, Yusheng Dai, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Odette Scharenborg, Jingdong Chen, Baocai Yin, Jia Pan:
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1766-1770 - [c174]Yajian Wang, Jun Du, Hang Chen, Qing Wang, Chin-Hui Lee:
Deep Segment Model for Acoustic Scene Classification. INTERSPEECH 2022: 4177-4181 - [c173]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. ISCSLP 2022: 1-5 - [c172]Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. ISCSLP 2022: 250-254 - [c171]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification. ISCSLP 2022: 453-457 - [c170]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. SLT 2022: 1074-1080 - [i32]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge. CoRR abs/2202.04855 (2022) - [i31]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. CoRR abs/2202.08509 (2022) - [i30]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification. CoRR abs/2203.04114 (2022) - [i29]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. CoRR abs/2210.05614 (2022) - [i28]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. CoRR abs/2210.06382 (2022) - [i27]Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. CoRR abs/2210.14581 (2022) - [i26]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. CoRR abs/2211.01263 (2022) - 2021
- [j52]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement. Neural Networks 143: 171-182 (2021) - [j51]Li Chai, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
A Cross-Entropy-Guided Measure (CEGM) for Assessing Speech Recognition Performance and Optimizing DNN-Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 29: 106-117 (2021) - [j50]Hengshun Zhou, Jun Du, Yuanyuan Zhang, Qing Wang, Qing-Feng Liu, Chin-Hui Lee:
Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2617-2629 (2021) - [c169]Koen Oostermeijer, Jun Du, Qing Wang, Chin-Hui Lee:
Speech Enhancement Autoencoder with Hierarchical Latent Structure. ICASSP 2021: 671-675 - [c168]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. ICASSP 2021: 845-849 - [c167]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. ICASSP 2021: 6523-6527 - [c166]Zhaoxu Nian, Yan-Hui Tu, Jun Du, Chin-Hui Lee:
A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition. ICASSP 2021: 6913-6917 - [c165]Hengshun Zhou, Jun Du, Hang Chen, Zijun Jing, Shifu Xiong, Chin-Hui Lee:
Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments. Interspeech 2021: 341-345 - [c164]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. Interspeech 2021: 881-885 - [c163]Xiaoqi Zhang, Jun Du, Li Chai, Chin-Hui Lee:
A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement. Interspeech 2021: 2701-2705 - [c162]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries. Interspeech 2021: 3001-3005 - [c161]Yu-Xuan Wang, Jun Du, Maokui He, Shutong Niu, Lei Sun, Chin-Hui Lee:
Scenario-Dependent Speaker Diarization for DIHARD-III Challenge. Interspeech 2021: 3106-3110 - [c160]Qing Wang, Huaxin Wu, Zijun Jing, Feng Ma, Yi Fang, Yuxuan Wang, Tairan Chen, Jia Pan, Jun Du, Chin-Hui Lee:
A Model Ensemble Approach for Sound Event Localization and Detection. ISCSLP 2021: 1-5 - [c159]Siyuan Zheng, Jun Du, Hengshun Zhou, Xue Bai, Chin-Hui Lee, Shipeng Li:
Speech Emotion Recognition Based on Acoustic Segment Model. ISCSLP 2021: 1-5 - [c158]Li Chai, Jun Du, Diyuan Liu, Yanhui Tu, Chin-Hui Lee:
Acoustic Modeling for Multi-Array Conversational Speech Recognition in the Chime-6 Challenge. SLT 2021: 912-918 - [i25]Qing Wang, Jun Du, Huaxin Wu, Jia Pan, Feng Ma, Chin-Hui Lee:
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection. CoRR abs/2101.02919 (2021) - [i24]Yuxuan Wang, Mao-Kui He, Shutong Niu, Lei Sun, Tian Gao, Xin Fang, Jia Pan, Jun Du, Chin-Hui Lee:
USTC-NELSLIP System Description for DIHARD-III Challenge. CoRR abs/2103.10661 (2021) - [i23]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. CoRR abs/2104.01271 (2021) - [i22]Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Qing Wang, Yuyang Wang, Xianjun Xia, Yuanjun Zhao, Yuzhong Wu, Yannan Wang, Jun Du, Chin-Hui Lee:
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification. CoRR abs/2107.01461 (2021) - [i21]Shutong Niu, Jun Du, Lei Sun, Chin-Hui Lee:
Separation Guided Speaker Diarization in Realistic Mismatched Conditions. CoRR abs/2107.02357 (2021) - [i20]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. CoRR abs/2110.08598 (2021) - [i19]Hengshun Zhou, Jun Du, Yuanyuan Zhang, Qing Wang, Qing-Feng Liu, Chin-Hui Lee:
Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition. CoRR abs/2111.08910 (2021) - 2020
- [j49]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. IEEE Signal Process. Lett. 27: 1485-1489 (2020) - [j48]Yanhui Tu, Jun Du, Tian Gao, Chin-Hui Lee:
A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1608-1619 (2020) - [j47]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression. IEEE Trans. Signal Process. 68: 3411-3422 (2020) - [c157]Jun Qi, Xiaoli Ma, Chin-Hui Lee, Jun Du, Sabato Marco Siniscalchi:
Performance Analysis for Tensor-Train Decomposition to Deep Neural Network Based Vector-to-Vector Regression. CISS 2020: 1-6 - [c156]Xue Bai, Jun Du, Jia Pan, Hengshun Zhou, Yanhui Tu, Chin-Hui Lee:
High-Resolution Attention Network with Acoustic Segment Model for Acoustic Scene Classification. ICASSP 2020: 656-660 - [c155]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee:
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. ICASSP 2020: 3107-3111 - [c154]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang, I-Te Danny Hung, Chin-Hui Lee, Xiaoli Ma:
Enhanced Adversarial Strategically-Timed Attacks Against Deep Reinforcement Learning. ICASSP 2020: 3407-3411 - [c153]Sicheng Wang, Wei Li, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Cross-Task Transfer Learning Approach to Adapting Deep Speech Enhancement Models to Unseen Background Noise Using Paired Senone Classifiers. ICASSP 2020: 6219-6223 - [c152]Shutong Niu, Jun Du, Li Chai, Chin-Hui Lee:
A Maximum Likelihood Approach to Multi-Objective Learning Using Generalized Gaussian Distributions for Dnn-Based Speech Enhancement. ICASSP 2020: 6229-6233 - [c151]Yanhui Tu, Jun Du, Chin-Hui Lee:
2D-to-2D Mask Estimation for Speech Enhancement Based on Fully Convolutional Neural Network. ICASSP 2020: 6664-6668 - [c150]Lei Sun, Jun Du, Xueyang Zhang, Tian Gao, Xin Fang, Chin-Hui Lee:
Progressive Multi-Target Network Based Speech Enhancement with Snr-Preselection for Robust Speaker Diarization. ICASSP 2020: 7099-7103 - [c149]Xin Wang, Jun Du, Alejandrina Cristià, Lei Sun, Chin-Hui Lee:
A Study of Child Speech Extraction Using Joint Speech Enhancement and Separation in Realistic Conditions. ICASSP 2020: 7304-7308 - [c148]Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. ICASSP 2020: 7389-7393 - [c147]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network. ICASSP 2020: 7504-7508 - [c146]Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:
Geometry Constrained Progressive Learning for Lstm-Based Speech Enhancement. ICASSP 2020: 7514-7518 - [c145]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. INTERSPEECH 2020: 76-80 - [c144]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Jia Pan, Chin-Hui Lee:
A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge. INTERSPEECH 2020: 96-100 - [c143]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. INTERSPEECH 2020: 1196-1200 - [c142]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. INTERSPEECH 2020: 1201-1205 - [c141]Hengshun Zhou, Jun Du, Yanhui Tu, Chin-Hui Lee:
Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions. INTERSPEECH 2020: 4098-4102 - [c140]Yu-Xuan Wang, Jun Du, Li Chai, Chin-Hui Lee, Jia Pan:
A Noise-Aware Memory-Attention Network Architecture for Regression-Based Speech Enhancement. INTERSPEECH 2020: 4501-4505 - [i18]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network. CoRR abs/2002.00544 (2020) - [i17]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang, I-Te Danny Hung, Chin-Hui Lee, Xiaoli Ma:
Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning. CoRR abs/2002.09027 (2020) - [i16]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee:
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. CoRR abs/2003.13917 (2020) - [i15]Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. CoRR abs/2004.13480 (2020) - [i14]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation. CoRR abs/2007.08389 (2020) - [i13]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. CoRR abs/2007.13024 (2020) - [i12]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. CoRR abs/2008.00107 (2020) - [i11]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. CoRR abs/2008.00110 (2020) - [i10]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.05459 (2020) - [i9]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.07281 (2020) - [i8]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement. CoRR abs/2009.09561 (2020) - [i7]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. CoRR abs/2010.13309 (2020) - [i6]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. CoRR abs/2011.01447 (2020) - [i5]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin:
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention. CoRR abs/2012.14360 (2020)
2010 – 2019
- 2019
- [j46]Lei Sun, Jun Du, Tian Gao, Yi Fang, Feng Ma, Chin-Hui Lee:
A Speaker-Dependent Approach to Separation of Far-Field Multi-Talker Microphone Array Speech for Front-End Processing in the CHiME-5 Challenge. IEEE J. Sel. Top. Signal Process. 13(4): 827-840 (2019) - [j45]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Hai-Kun Wang, Jingdong Chen, Chin-Hui Lee:
An iterative mask estimation approach to deep learning based multi-channel speech recognition. Speech Commun. 106: 31-43 (2019) - [j44]Li Chai, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
Using Generalized Gaussian Distributions to Improve Regression Error Modeling for Deep Learning-Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1919-1931 (2019) - [j43]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1932-1943 (2019) - [j42]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2012-2024 (2019) - [j41]Yanhui Tu, Jun Du, Chin-Hui Lee:
Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2080-2091 (2019) - [c139]Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:
A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising. APSIPA 2019: 274-278 - [c138]Nan Zhou, Jun Du, Yanhui Tu, Tian Gao, Chin-Hui Lee:
A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition. APSIPA 2019: 873-877 - [c137]Yanhui Tu, Jun Du, Chin-Hui Lee:
DNN Training Based on Classic Gain Function for Single-channel Speech Enhancement and Recognition. ICASSP 2019: 910-914 - [c136]Wei Li, Sicheng Wang, Ming Lei, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Audio-visual Speech Recognition Performance with Cross-modal Student-teacher Training. ICASSP 2019: 6560-6564 - [c135]Lei Sun, Jun Du, Tian Gao, Yi Fang, Feng Ma, Jia Pan, Chin-Hui Lee:
A Two-stage Single-channel Speaker-dependent Speech Separation Approach for Chime-5 Challenge. ICASSP 2019: 6650-6654 - [c134]Feng Ma, Li Chai, Jun Du, Diyuan Liu, Zhongfu Ye, Chin-Hui Lee:
Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge. INTERSPEECH 2019: 1258-1262 - [c133]Li Chai, Jun Du, Chin-Hui Lee:
KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement. INTERSPEECH 2019: 1806-1810 - [c132]Li Chai, Jun Du, Chin-Hui Lee:
A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition. INTERSPEECH 2019: 3431-3435 - [c131]Xue Bai, Jun Du, Zi-Rui Wang, Chin-Hui Lee:
A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models. INTERSPEECH 2019: 3619-3623 - 2018
- [j40]Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1181-1193 (2018) - [j39]Yanhui Tu, Jun Du, Chin-Hui Lee:
A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech. J. Signal Process. Syst. 90(7): 963-973 (2018) - [j38]Zhengqi Wen, Kehuang Li, Zhen Huang, Chin-Hui Lee, Jianhua Tao:
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning. J. Signal Process. Syst. 90(7): 1025-1037 (2018) - [j37]Ju Lin, Wei Li, Yingming Gao, Yanlu Xie, Nancy F. Chen, Sabato Marco Siniscalchi, Jinsong Zhang, Chin-Hui Lee:
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks. J. Signal Process. Syst. 90(7): 1077-1087 (2018) - [c130]Yanhui Tu, Jun Du, Nan Zhou, Chin-Hui Lee:
Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR. APSIPA 2018: 362-366 - [c129]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Densely Connected Progressive Learning for LSTM-Based Speech Enhancement. ICASSP 2018: 5054-5058 - [c128]Lei Sun, Jun Du, Tian Gao, Yu-Ding Lu, Yu Tsao, Chin-Hui Lee, Neville Ryant:
A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions. ICASSP 2018: 5234-5238 - [c127]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models. ICASSP 2018: 6249-6253 - [c126]Lei Sun, Jun Du, Chao Jiang, Xueyang Zhang, Shan He, Bing Yin, Chin-Hui Lee:
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge. INTERSPEECH 2018: 2793-2797 - [c125]Li Chai, Jun Du, Chin-Hui Lee:
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement. INTERSPEECH 2018: 3269-3273 - [c124]Quandong Wang, Sicheng Wang, Fengpei Ge, Chang Woo Han, Jaewon Lee, Lianghao Guo, Chin-Hui Lee:
Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech. ISCSLP 2018: 21-25 - [c123]Xin Wang, Jun Du, Lei Sun, Qing Wang, Chin-Hui Lee:
A Progressive Deep Learning Approach to Child Speech Separation. ISCSLP 2018: 76-80 - [c122]Qing Wang, Jun Du, Li Chai, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network. ISCSLP 2018: 295-299 - [i4]Li Chai, Jun Du, Chin-Hui Lee:
Acoustics-guided evaluation (AGE): a new measure for estimating performance of speech enhancement algorithms for robust ASR. CoRR abs/1811.11517 (2018) - 2017
- [j36]Yanhui Tu, Jun Du, Qing Wang, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:
An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech. Comput. Speech Lang. 46: 517-534 (2017) - [j35]Bo Wu, Minglei Yang, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Tong Wang, Chin-Hui Lee:
A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation. EURASIP J. Adv. Signal Process. 2017: 81 (2017) - [j34]Bo Wu, Kehuang Li, Fengpei Ge, Zhen Huang, Minglei Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition. IEEE J. Sel. Top. Signal Process. 11(8): 1289-1300 (2017) - [j33]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation. Pattern Recognit. Lett. 98: 1-7 (2017) - [j32]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A unified DNN approach to speaker-dependent simultaneous speech enhancement and speech separation in low SNR environments. Speech Commun. 95: 28-39 (2017) - [j31]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 60-71 (2017) - [j30]Bo Wu, Kehuang Li, Minglei Yang, Chin-Hui Lee:
A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 98-107 (2017) - [j29]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1535-1546 (2017) - [j28]Ying-Hui Lai, Fei Chen, Syu-Siang Wang, Xugang Lu, Yu Tsao, Chin-Hui Lee:
A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation. IEEE Trans. Biomed. Eng. 64(7): 1568-1578 (2017) - [c121]Yanhui Tu, Jun Du, Lei Sun, Chin-Hui Lee:
LSTM-based iterative mask estimation and post-processing for multi-channel speech enhancement. APSIPA 2017: 488-491 - [c120]Bo Wu, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Minglei Yang, Chin-Hui Lee:
A unified deep modeling approach to simultaneous speech dereverberation and recognition for the reverb challenge. HSCMA 2017: 36-40 - [c119]Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features. HSCMA 2017: 101-105 - [c118]Lei Sun, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Multiple-target deep learning for LSTM-RNN based speech enhancement. HSCMA 2017: 136-140 - [c117]Sicheng Wang, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A transfer learning and progressive stacking approach to reducing deep model sizes with an application to speech enhancement. ICASSP 2017: 5575-5579 - [c116]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Chin-Hui Lee:
On Design of Robust Deep Models for CHiME-4 Multi-Channel Speech Recognition with Multiple Configurations of Array Microphones. INTERSPEECH 2017: 394-398 - [c115]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation. INTERSPEECH 2017: 1178-1182 - [c114]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models. INTERSPEECH 2017: 2759-2763 - [c113]Fengpei Ge, Kehuang Li, Bo Wu, Sabato Marco Siniscalchi, Yonghong Yan, Chin-Hui Lee:
Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition. INTERSPEECH 2017: 3847-3851 - [c112]Shi-Xue Wen, Jun Du, Chin-Hui Lee:
On generating mixing noise signals with basis functions for simulating noisy speech and learning dnn-based speech enhancement models. MLSP 2017: 1-6 - [i3]Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-Objective Learning and Mask-Based Post-Processing for Deep Neural Network Based Speech Enhancement. CoRR abs/1703.07172 (2017) - 2016
- [j27]Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition. EURASIP J. Adv. Signal Process. 2016: 86 (2016) - [j26]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition. Neurocomputing 218: 448-459 (2016) - [j25]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Tomi Kinnunen, Chin-Hui Lee:
i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 29-41 (2016) - [j24]Jun Du, Yanhui Tu, Li-Rong Dai, Chin-Hui Lee:
A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1424-1437 (2016) - [j23]I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, Chin-Hui Lee:
A Keyword-Aware Language Modeling Approach to Spoken Keyword Search. J. Signal Process. Syst. 82(2): 197-206 (2016) - [c111]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Chin-Hui Lee:
Towards a direct Bayesian adaptation framework for deep models. APSIPA 2016: 1-4 - [c110]Su Jun Leow, Eng Siong Chng, Chin-Hui Lee:
Zero resource anti-spoofing detection for unit selection based synthetic speech using image spectrogram artifacts. APSIPA 2016: 1-6 - [c109]Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations. APSIPA 2016: 1-4 - [c108]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Unsupervised single-channel speech separation via deep neural network for different gender mixtures. APSIPA 2016: 1-4 - [c107]Zhengqi Wen, Kehuang Li, Jianhua Tao, Chin-Hui Lee:
Deep neural network based voice conversion with a large synthesized parallel corpus. APSIPA 2016: 1-5 - [c106]Bo Wu, Kehuang Li, Minglei Yang, Chin-Hui Lee:
A study on target feature activation and normalization and their impacts on the performance of DNN based speech dereverberation systems. APSIPA 2016: 1-4 - [c105]Bo Wu, Kehuang Li, Minglei Yang, Chin-Hui Lee:
A study on sampling of STFT modifications in time and frequency domains for DNN-based speech dereverberation. APSIPA 2016: 1-4 - [c104]Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li:
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili. ICASSP 2016: 6040-6044 - [c103]Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling. ICASSP 2016: 6135-6139 - [c102]Jianqing Gao, Jun Du, Changqing Kong, Huaifang Lu, Enhong Chen, Chin-Hui Lee:
An experimental study on joint modeling of mixed-bandwidth data via deep neural networks for robust speech recognition. IJCNN 2016: 588-594 - [c101]Wei Li, Kehuang Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees. INTERSPEECH 2016: 3127-3131 - [c100]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement. INTERSPEECH 2016: 3713-3717 - [c99]Kehuang Li, Bo Wu, Chin-Hui Lee:
An Iterative Phase Recovery Framework with Phase Mask for Spectral Mapping with an Application to Speech Enhancement. INTERSPEECH 2016: 3773-3777 - [c98]Yanhui Tu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition. ISCSLP 2016: 1-5 - [c97]Zhengqi Wen, Kehuang Li, Zhen Huang, Jianhua Tao, Chin-Hui Lee:
Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks. ISCSLP 2016: 1-5 - 2015
- [j22]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Regression Approach to Speech Enhancement Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 23(1): 7-19 (2015) - [j21]Ji Wu, Miao Li, Chin-Hui Lee:
A Probabilistic Framework for Representing Dialog Systems and Entropy-Based Dialog Management Through Dynamic Stochastic State Evolution. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 2026-2035 (2015) - [c96]Jun Du, Qing Wang, Yanhui Tu, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:
An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework. ASRU 2015: 430-435 - [c95]Tian Gao, Jun Du, Li Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
A unified speaker-dependent speech separation and enhancement system based on deep neural networks. ChinaSIP 2015: 687-691 - [c94]Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments. LVA/ICA 2015: 75-82 - [c93]Yanhui Tu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Speech Separation based on signal-noise-dependent deep neural networks for robust speech recognition. ICASSP 2015: 61-65 - [c92]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Joint training of front-end and back-end deep neural networks for robust speech recognition. ICASSP 2015: 4375-4379 - [c91]I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, Chin-Hui Lee:
A keyword-aware grammar framework for LVCSR-based spoken keyword search. ICASSP 2015: 5196-5200 - [c90]Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao, Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina Goh, Engsiong Chng, Bin Ma, Haizhou Li:
Low-resource keyword search strategies for tamil. ICASSP 2015: 5366-5370 - [c89]Su Jun Leow, Engsiong Chng, Chin-Hui Lee:
Language-resource independent speech segmentation using cues from a spectrogram image. ICASSP 2015: 5813-5817 - [c88]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
High-resolution acoustic modeling and compact language modeling of language-universal speech attributes for spoken language identification. INTERSPEECH 2015: 992-996 - [c87]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jinyu Li, Jiadong Wu, Chin-Hui Lee:
Maximum a posteriori adaptation of network parameters in deep models. INTERSPEECH 2015: 1076-1080 - [c86]Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement. INTERSPEECH 2015: 1508-1512 - [c85]Ji Wu, Miao Li, Chin-Hui Lee:
An entropy minimization framework for goal-driven dialogue management. INTERSPEECH 2015: 2027-2031 - [c84]Qing Wang, Jun Du, Xiao Bao, Zi-Rui Wang, Li-Rong Dai, Chin-Hui Lee:
A universal VAD based on jointly trained deep neural networks. INTERSPEECH 2015: 2282-2286 - [c83]Kehuang Li, Zhen Huang, Yong Xu, Chin-Hui Lee:
DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech. INTERSPEECH 2015: 2578-2582 - [c82]Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Ji Wu, Chin-Hui Lee:
Rapid adaptation for deep neural networks through multi-task learning. INTERSPEECH 2015: 3625-3629 - [i2]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jiadong Wu, Chin-Hui Lee:
Maximum a Posteriori Adaptation of Network Parameters in Deep Models. CoRR abs/1503.02108 (2015) - [i1]Ji Wu, Miao Li, Chin-Hui Lee:
A Probabilistic Framework for Representing Dialog Systems and Entropy-Based Dialog Management through Dynamic Stochastic State Evolution. CoRR abs/1504.07182 (2015) - 2014
- [j20]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
An artificial neural network approach to automatic speech processing. Neurocomputing 140: 326-338 (2014) - [j19]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
An Experimental Study on Speech Enhancement Based on Deep Neural Networks. IEEE Signal Process. Lett. 21(1): 65-68 (2014) - [j18]Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka, Chin-Hui Lee:
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 22(2): 403-416 (2014) - [j17]Ilseo Kim, Chin-Hui Lee:
An Efficient Gradient-based Approach to Optimizing Average Precision Through Maximal Figure-of-Merit Learning. J. Signal Process. Syst. 74(3): 285-295 (2014) - [c81]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Global variance equalization for improving deep neural network based speech enhancement. ChinaSIP 2014: 71-75 - [c80]Zhen Huang, Chao Weng, Kehuang Li, You-Chi Cheng, Chin-Hui Lee:
Deep learning vector quantization for acoustic information retrieval. ICASSP 2014: 1350-1354 - [c79]I-Fan Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Attribute based lattice rescoring in spontaneous speech recognition. ICASSP 2014: 3325-3329 - [c78]Kehuang Li, Zhen Huang, You-Chi Cheng, Chin-Hui Lee:
A maximal figure-of-merit learning approach to maximizing mean average precision with deep neural network based classifiers. ICASSP 2014: 4503-4507 - [c77]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Tomi Kinnunen, Chin-Hui Lee:
Introducing attribute features to foreign accent recognition. ICASSP 2014: 5332-5336 - [c76]You-Chi Cheng, Ville Hautamäki, Zhen Huang, Kehuang Li, Chin-Hui Lee:
An i-vector based descriptor for alphabetical gesture recognition. ICASSP 2014: 6593-6597 - [c75]Jun Du, Qing Wang, Tian Gao, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Robust speech recognition with speech enhanced deep neural networks. INTERSPEECH 2014: 616-620 - [c74]Zhen Huang, Jinyu Li, Chao Weng, Chin-Hui Lee:
Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition. INTERSPEECH 2014: 1214-1218 - [c73]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, Chin-Hui Lee:
Dialect levelling in Finnish: a universal speech attribute approach. INTERSPEECH 2014: 2165-2169 - [c72]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Dynamic noise aware training for speech enhancement based on deep neural networks. INTERSPEECH 2014: 2670-2674 - [c71]I-Fan Chen, Nancy F. Chen, Chin-Hui Lee:
A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling. INTERSPEECH 2014: 2779-2783 - [c70]Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Chao Weng, Chin-Hui Lee:
Feature space maximum a posteriori linear regression for adaptation of deep neural networks. INTERSPEECH 2014: 2992-2996 - [c69]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors. ISCSLP 2014: 158-162 - [c68]I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, Chin-Hui Lee:
A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search. ISCSLP 2014: 192-196 - [c67]Yanhui Tu, Jun Du, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers. ISCSLP 2014: 250-254 - [c66]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Cross-language transfer learning for deep neural network based speech enhancement. ISCSLP 2014: 336-340 - 2013
- [j16]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1): 209-227 (2013) - [j15]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Model-based margin estimation for hidden Markov model learning and generalisation. IET Signal Process. 7(8): 704-709 (2013) - [j14]Sabato Marco Siniscalchi, Dong Yu, Li Deng, Chin-Hui Lee:
Exploiting deep neural networks for detection-based speech recognition. Neurocomputing 106: 148-157 (2013) - [j13]Chin-Hui Lee, Sabato Marco Siniscalchi:
An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition. Proc. IEEE 101(5): 1089-1115 (2013) - [j12]Sabato Marco Siniscalchi, Dong Yu, Li Deng, Chin-Hui Lee:
Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model. IEEE Signal Process. Lett. 20(3): 201-204 (2013) - [j11]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition. IEEE Trans. Speech Audio Process. 21(4): 786-797 (2013) - [j10]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems. IEEE Trans. Speech Audio Process. 21(10): 2152-2161 (2013) - [c65]I-Fan Chen, Sabato Marco Siniscalchi, Seokyong Moon, Daejin Shin, Myoung-Wan Koo, Minhwa Chung, Chin-Hui Lee:
An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks. APSIPA 2013: 1-10 - [c64]Duc Hoang Ha Nguyen, Aleem Mushtaq, Xiong Xiao, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A particle filter compensation approach to robust LVCSR. APSIPA 2013: 1-7 - [c63]Chen-Yu Chiang, Sabato Marco Siniscalchi, Sin-Horng Chen, Chin-Hui Lee:
Knowledge integration for improving performance in LVCSR. INTERSPEECH 2013: 1786-1790 - [c62]Zhen Huang, You-Chi Cheng, Kehuang Li, Ville Hautamäki, Chin-Hui Lee:
A blind segmentation approach to acoustic event detection based on i-vector. INTERSPEECH 2013: 2282-2286 - [c61]Ville Hautamäki, You-Chi Cheng, Padmanabhan Rajan, Chin-Hui Lee:
Minimax i-vector extractor for short duration speaker verification. INTERSPEECH 2013: 3708-3712 - [c60]Sangmin Oh, A. G. Amitha Perera, Ilseo Kim, Megha Pandey, Kevin J. Cannons, Hossein Hajimirsadeghi, Arash Vahdat, Greg Mori, Ben Miller, Scott McCloskey, You-Chi Cheng, Zhen Huang, Chin-Hui Lee, Chenliang Xu, Rohit Kumar, Wei Chen, Jason J. Corso, Li Fei-Fei, Daphne Koller, Vignesh Ramanathan, Kevin Tang, Armand Joulin, Alexandre Alahi:
TRECVID 2013 GENIE: Multimedia Event Detection and Recounting. TRECVID 2013 - 2012
- [j9]Sabato Marco Siniscalchi, Dau-Cheng Lyu, Torbjørn Svendsen, Chin-Hui Lee:
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data. IEEE Trans. Speech Audio Process. 20(3): 875-887 (2012) - [c59]Ilseo Kim, Sangmin Oh, Byungki Byun, A. G. Amitha Perera, Chin-Hui Lee:
Explicit Performance Metric Optimization for Fusion-Based Video Retrieval. ECCV Workshops (3) 2012: 395-405 - [c58]Dong Yu, Sabato Marco Siniscalchi, Li Deng, Chin-Hui Lee:
Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition. ICASSP 2012: 4169-4172 - [c57]Ilseo Kim, Sangmin Oh, A. G. Amitha Perera, Chin-Hui Lee:
Per-Exemplar Fusion Learning for Video Retrieval and Recounting. ICME 2012: 146-151 - [c56]Byungki Byun, Ilseo Kim, Sabato Marco Siniscalchi, Chin-Hui Lee:
Consumer-level multimedia event detection through unsupervised audio signal modeling. INTERSPEECH 2012: 2081-2084 - [c55]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Hermitian based Hidden Activation Functions for Adaptation of Hybrid HMM/ANN Models. INTERSPEECH 2012: 2590-2593 - [c54]Su Jun Leow, Tze Siong Lau, Alvina Goh, Han Meng Peh, Teck Khim Ng, Sabato Marco Siniscalchi, Chin-Hui Lee:
A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting. ISCSLP 2012: 112-116 - [c53]Chen-Yu Chiang, Sabato Marco Siniscalchi, Yih-Ru Wang, Sin-Horng Chen, Chin-Hui Lee:
A study on cross-language knowledge integration in Mandarin LVCSR. ISCSLP 2012: 315-319 - [c52]A. G. Amitha Perera, Sangmin Oh, Megha Pandey, Tianyang Ma, Anthony Hoogs, Arash Vahdat, Kevin J. Cannons, Hossein Hajimirsadeghi, Greg Mori, Scott McCloskey, Ben Miller, Sharath Venkatesha, Pedro Davalos, Pradipto Das, Chenliang Xu, Jason J. Corso, Rohini K. Srihari, Ilseo Kim, You-Chi Cheng, Zhen Huang, Chin-Hui Lee, Kevin Tang, Li Fei-Fei, Daphne Koller:
TRECVID 2012 GENIE: Multimedia Event Detection and Recounting. TRECVID 2012 - 2011
- [c51]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines. INTERSPEECH 2011: 901-904 - [c50]A. G. Amitha Perera, Sangmin Oh, Matthew J. Leotta, Ilseo Kim, Byungki Byun, Chin-Hui Lee, Scott McCloskey, Jingchen Liu, Ben Miller, Zhi Feng Huang, Arash Vahdat, Weilong Yang, Greg Mori, Kevin Tang, Daphne Koller, Li Fei-Fei, Kang Li, Gang Chen, Jason J. Corso, Yun Fu, Rohini K. Srihari:
GENIE TRECVID 2011 Multimedia Event Detection: Late-Fusion Approaches to Combine Multiple Audio-Visual features. TRECVID 2011 - 2010
- [j8]Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 18(6): 1158-1169 (2010) - [c49]Yu Tsao, Hanwu Sun, Haizhou Li, Chin-Hui Lee:
An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition. ICASSP 2010: 4422-4425 - [c48]Sabato Marco Siniscalchi, Torbjørn Svendsen, Filippo Sorbello, Chin-Hui Lee:
Experimental studies on continuous speech recognition using neural architectures with "adaptive" hidden activation functions. ICASSP 2010: 4882-4885 - [c47]Jinyu Li, Yu Tsao, Chin-Hui Lee:
Shrinkage model adaptation in automatic speech recognition. INTERSPEECH 2010: 1656-1659 - [c46]Aleem Mushtaq, Yu Tsao, Chin-Hui Lee:
A particle filter feature compensation approach to robust speech recognition. INTERSPEECH 2010: 2054-2057 - [c45]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition. INTERSPEECH 2010: 2718-2721 - [c44]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A survey on recent progress in the ASAT/SIRKUS paradigm. ISCSLP 2010: 465-470
2000 – 2009
- 2009
- [j7]Sabato Marco Siniscalchi, Chin-Hui Lee:
A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition. Speech Commun. 51(11): 1139-1153 (2009) - [j6]Janet M. Baker, Li Deng, James R. Glass, Sanjeev Khudanpur, Chin-Hui Lee, Nelson Morgan, Douglas D. O'Shaughnessy:
Developments and directions in speech recognition and understanding, Part 1 [DSP Education]. IEEE Signal Process. Mag. 26(3): 75-80 (2009) - [j5]Janet M. Baker, Li Deng, Sanjeev Khudanpur, Chin-Hui Lee, James R. Glass, Nelson Morgan, Douglas D. O'Shaughnessy:
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education]. IEEE Signal Process. Mag. 26(4): 78-85 (2009) - [j4]Yu Tsao, Chin-Hui Lee:
An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition. IEEE Trans. Speech Audio Process. 17(5): 1025-1037 (2009) - [c43]Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A study on hidden Markov model's generalization capability for speech recognition. ASRU 2009: 255-260 - [c42]Yu Tsao, Shigeki Matsuda, Satoshi Nakamura, Chin-Hui Lee:
MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling. ASRU 2009: 271-275 - [c41]Yu Tsao, Jinyu Li, Chin-Hui Lee:
Ensemble speaker and speaking environment modeling approach with advanced online estimation process. ICASSP 2009: 3833-3836 - [c40]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A phonetic feature based lattice rescoring approach to LVCSR. ICASSP 2009: 3865-3868 - [c39]Hui Lin, Li Deng, Dong Yu, Yifan Gong, Alex Acero, Chin-Hui Lee:
A study on multilingual acoustic modeling for large vocabulary ASR. ICASSP 2009: 4333-4336 - [c38]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Exploring universal attribute characterization of spoken languages for spoken language recognition. INTERSPEECH 2009: 168-171 - [c37]Shigeki Matsuda, Yu Tsao, Jinyu Li, Satoshi Nakamura, Chin-Hui Lee:
A study on soft margin estimation of linear regression parameters for speaker adaptation. INTERSPEECH 2009: 1603-1606 - [c36]Jeremy Reed, Yushi Ueda, Sabato Marco Siniscalchi, Yuuki Uchiyama, Shigeki Sagayama, Chin-Hui Lee:
Minimum Classification Error Training to Improve Isolated Chord Recognition. ISMIR 2009: 609-614 - [c35]Yu Tsao, Jinyu Li, Chin-Hui Lee, Satoshi Nakamura:
Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling. IUCS 2009: 404-408 - 2008
- [j3]Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Optimizing the Performance of Spoken Language Recognition With Discriminative Training. IEEE Trans. Speech Audio Process. 16(8): 1642-1653 (2008) - [c34]Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Discriminative learning for optimizing detection performance in spoken language recognition. ICASSP 2008: 4161-4164 - [c33]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
Toward a detector-based universal phone recognizer. ICASSP 2008: 4261-4264 - [c32]Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
Soft margin estimation with various separation levels for LVCSR. INTERSPEECH 2008: 269-272 - [c31]Yu Tsao, Chin-Hui Lee:
Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process. INTERSPEECH 2008: 1265-1268 - [c30]Jinyu Li, Chin-Hui Lee:
On a generalization of margin-based discriminative training to robust speech recognition. INTERSPEECH 2008: 1992-1995 - [c29]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A penalized logistic regression approach to detection based phone classification. INTERSPEECH 2008: 2390-2393 - [c28]Dau-Cheng Lyu, Sabato Marco Siniscalchi, Tae-Yoon Kim, Chin-Hui Lee:
Continuous phone recognition without target language training data. INTERSPEECH 2008: 2687-2690 - 2007
- [j2]Haizhou Li, Bin Ma, Chin-Hui Lee:
A Vector Space Modeling Approach to Spoken Language Identification. IEEE Trans. Speech Audio Process. 15(1): 271-284 (2007) - [j1]Jinyu Li, Ming Yuan, Chin-Hui Lee:
Approximate Test Risk Bound Minimization Through Soft Margin Estimation. IEEE Trans. Speech Audio Process. 15(8): 2393-2404 (2007) - [c27]Yu Tsao, Chin-Hui Lee:
Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition. ASRU 2007: 77-80 - [c26]Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
A study on soft margin estimation for LVCSR. ASRU 2007: 268-271 - [c25]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
Towards bottom-up continuous phone recognition. ASRU 2007: 566-569 - [c24]Jinyu Li, Sabato Marco Siniscalchi, Chin-Hui Lee:
Approximate Test Risk Minimization Through Soft Margin Estimation. ICASSP (4) 2007: 653-656 - [c23]Sabato Marco Siniscalchi, Petr Schwarz, Chin-Hui Lee:
High-Accuracy Phone Recognition By Combining High-Performance Lattice Generation and Knowledge Based Rescoring. ICASSP (4) 2007: 869-872 - [c22]Filippo Vella, Chin-Hui Lee, Salvatore Gaglio:
Boosting of Maximal Figure of Merit Classifiers for Automatic Image Annotation. ICIP (2) 2007: 217-220 - [c21]Jinyu Li, Chin-Hui Lee:
Soft margin feature extraction for automatic speech recognition. INTERSPEECH 2007: 30-33 - [c20]Yu Tsao, Chin-Hui Lee:
An ensemble modeling approach to joint characterization of speaker and speaking environments. INTERSPEECH 2007: 1050-1053 - [c19]Yang Xiao, Tat-Seng Chua, Chin-Hui Lee:
Fusion of Region and Image-Based Techniques for Automatic Image Annotation. MMM (1) 2007: 247-258 - [c18]Mary P. Harper, Alex Acero, Srinivas Bangalore, Jaime Carbonell, Jordan Cohen, Barbara Cuthill, Carol Y. Espy-Wilson, Christiane Fellbaum, John Garofolo, Chin-Hui Lee, Jim Lester, Andrew McCallum, Nelson Morgan, Michael Picheney, Joe Picone, Lance Ramshaw, Jeffrey C. Reynar, Hadar Shemtov, Clare Voss:
Report on the NSF-sponsored Human Language Technology Workshop on Industrial Centers. MTSummit 2007 - [c17]Filippo Vella, Chin-Hui Lee:
Information fusion techniques for automatic image annotation. VISAPP (2) 2007: 60-67 - 2006
- [c16]Rui Shi, Tat-Seng Chua, Chin-Hui Lee, Sheng Gao:
Bayesian Learning of Hierarchical Multinomial Mixture Models of Concepts for Automatic Image Annotation. CIVR 2006: 102-112 - [c15]Jinyu Li, Ming Yuan, Chin-Hui Lee:
Soft margin estimation of hidden Markov model parameters. INTERSPEECH 2006 - [c14]Chengyuan Ma, Yu Tsao, Chin-Hui Lee:
A study on detection based automatic speech recognition. INTERSPEECH 2006 - [c13]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
A study on lattice rescoring with knowledge scores for automatic speech recognition. INTERSPEECH 2006 - [c12]Yu Tsao, Chin-Hui Lee:
A vector space approach to environment modeling for robust speech recognition. INTERSPEECH 2006 - [c11]Jinyu Li, Sibel Yaman, Chin-Hui Lee, Bin Ma, Rong Tong, Donglai Zhu, Haizhou Li:
Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier Fusion. Odyssey 2006: 1-5 - 2005
- [c10]Weon-Goo Kim, MinSeok Jang, Chin-Hui Lee:
Iterative Training Techniques for Phonetic Template Based Speech Recognition with a Speaker-Independent Phonetic Recognizer. Australian Conference on Artificial Intelligence 2005: 577-584 - [c9]Weon-Goo Kim, MinSeok Jang, Chin-Hui Lee:
Unsupervised Speaker Adaptation for Phonetic Transcription Based Voice Dialing. FSKD (2) 2005: 249-254 - [c8]Jinyu Li, Yu Tsao, Chin-Hui Lee:
A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition. ICASSP (1) 2005: 837-840 - [c7]Yu Tsao, Jinyu Li, Chin-Hui Lee:
A study on separation between acoustic models and its applications. INTERSPEECH 2005: 1109-1112 - [c6]Bin Ma, Haizhou Li, Chin-Hui Lee:
An acoustic segment modeling approach to automatic language identification. INTERSPEECH 2005: 2829-2832 - [c5]Sheng Gao, Bin Ma, Haizhou Li, Chin-Hui Lee:
A text categorization approach to automatic language identification. INTERSPEECH 2005: 2837-2840 - [c4]Jinyu Li, Chin-Hui Lee:
On designing and evaluating speech event detectors. INTERSPEECH 2005: 3365-3368 - 2002
- [c3]Bin Ma, Cuntai Guan, Haizhou Li, Chin-Hui Lee:
Multilingual speech recognition with language identification. INTERSPEECH 2002: 505-508 - [c2]Sheng Gao, Jinsong Zhang, Satoshi Nakamura, Chin-Hui Lee, Tat-Seng Chua:
Weighted graph based decision tree optimization for high accuracy acoustic modeling. INTERSPEECH 2002: 1233-1236 - 2000
- [c1]Chin-Hui Lee:
From Graphical to Voice User Interface: The Next Revolution. ISCSLP 2000
Coauthor Index
aka: Mao-Kui He
aka: Bao-Cai Yin
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-04 21:08 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint