default search action
Ming Li 0026
Person information
- affiliation: Duke Kunshan University, Data Science Research Center, China
- affiliation (former): Sun Yat-Sen University Carnegie Mellon University Joint Institute of Engineering, China
- affiliation (former): University of Southern California, Los Angeles, CA, USA
- affiliation (former): Chinese Academy of Sciences, Institute of Acoustics, China
Other persons with the same name
- Ming Li — disambiguation page
- Ming Li 0001 — University of Waterloo, ON, Canada (and 3 more)
- Ming Li 0002 — East China Normal University, Shanghai, China (and 1 more)
- Ming Li 0003 (aka: Ming (Fred) Li) — University of Arizona, Tucson, AZ, USA (and 2 more)
- Ming Li 0004 — Xidian University, National Key Lab of Radar Signal Processing, Xi'an, China
- Ming Li 0005 — Nanjing University, National Key Laboratory for Novel Software Technology, China
- Ming Li 0006 — University of Texas at Arlington, TX, USA (and 2 more)
- Ming Li 0007 — California State University, Fresno, CA, USA (and 1 more)
- Ming Li 0008 — Worcester Polytechnic Institute, MA, USA
- Ming Li 0009 — IBM T. J. Watson Research Center, Yorktown Heights, NY, USA (and 1 more)
- Ming Li 0010 — Deakin University, VIC, Australia
- Ming Li 0011 — Dalian University of Technology, School of Information and Communication Engineering, China (and 2 more)
- Ming Li 0012 — Taiyuan University of Technology, College of Mathematics, China (and 1 more)
- Ming Li 0013 — China University of Mining & Technology, Xuzhou, China
- Ming Li 0014 — Unilever Corporate Research, Sharnbrook, Bedford, UK
- Ming Li 0015 — Lanzhou University of Technology, China
- Ming Li 0016 — RWTH Aachen University, Germany
- Ming Li 0017 — Zhejiang University, State Key Laboratory of CAD&CG, China
- Ming Li 0018 — Max-Planck-Institut für Informatik, Saarbrücken, Germany
- Ming Li 0019 — Carleton University, Ottawa, ON, Canada
- Ming Li 0020 — Google (and 1 more)
- Ming Li 0021 — Oracle (and 1 more)
- Ming Li 0022 — Vanderbilt University, Department of Biostatistics, Nashville, TN, USA
- Ming Li 0023 — Simon Fraser University, Burnaby, BC, Canada
- Ming Li 0024 — Concordia University, Department of Economics, Montreal, QC, Canada
- Ming Li 0025 — Chinese Academy of Sciences, Institute of Semiconductors, China (and 1 more)
- Ming Li 0028 — National University of Defense Technology, College of Mechatronic Engineering and Automation, Changsha, China
- Ming Li 0029 — Beihang University, School of Automation Science and Electrical Engineering, Beijing, China (and 2 more)
- Ming Li 0030 — Auburn University MRI Research Center, Auburn, USA
- Ming Li 0031 — China University of Mining and Technology, School of Computer Science and Technology, Xuzhou, China
- Ming Li 0032 — Heidelberg University, Institute of Geography, Germany
- Ming Li 0033 — Chinese Academy of Sciences, Institute of Information Engineering, State Key Laboratory of Information Security, Beijing, China
- Ming Li 0034 — Beihang University, Institute of Solid Mechanics, Beijing, China
- Ming Li 0035 — Aalto University, Department of Computer Science, Espoo, Finland
- Ming Li 0036 — Honghe University, Department of Mathematics, Mengzi, Yunnan, China
- Ming Li 0037 — Wuhan University, State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing, China
- Ming Li 0038 — Second Military Medical University, Changhai Hospital, Department of Orthopaedics, Shanghai, China
- Ming Li 0039 — China Jiliang University, Department of Mathematics, Hangzhou, China
- Ming Li 0040 — Nanchang University, ISST, China (and 1 more)
- Ming Li 0041 — Tianjin Normal University, Tianjin Key Laboratory of Wireless Mobile Communications and Power Transmission, China (and 1 more)
- Ming Li 0042 — Hamburg University of Technology, Germany
- Ming Li 0043 — Unilever China (and 1 more)
- Ming Li 0044 — Colorado School of Mines, Department of Electrical Engineering and Computer Science, Golden, CO, USA
- Ming Li 0045 — Shanghai Jiao Tong University, Institute of Image Processing and Pattern Recognition, China
- Ming Li 0046 — Yanshan University, College of Electrical Engineering, Qinhuangdao, China
- Ming Li 0047 — Beijing Jiaotong University, School of Electronic and Information Engineering, China
- Ming Li 0048 — Sun Yat-sen University, School of Geography and Planning, Guangzhou, China
- Ming Li 0049 — Jinan University, College of Information Science and Technology, China
- Ming Li 0051 — China University of Petroleum, School of Economics and Management, Beijing, China
- Ming Li 0052 — National Institutes of Health, Center for Interventional Oncology / National Heart, Lung, and Blood Institute, Bethesda, MD, USA (and 1 more)
- Ming Li 0053 — Macquarie University, Sydney, NSW, Australia (and 3 more)
- Ming Li 0054 — Beihang University, School of Transportation Science and Engineering / Beijing Advanced Innovation Center for Big Data and Brain Computing, Beijing, China
- Ming Li 0055 — Hong Kong Polytechnic University, Department of Industrial and Systems Engineering, Hong Kong (and 2 more)
- Ming Li 0056 — Nanchang Hangkong University, MOE Key Laboratory of Nondestructive Testing, China (and 1 more)
- Ming Li 0057 — Ocean University of China, College of Engineering, Department of Automation, Qingdao, China (and 1 more)
- Ming Li 0058 — Shenyang University of Technology, School of Electrical Engineering, China
- Ming Li 0059 — National University of Defense Technology, College of Meteorology and Oceanography, Nanjing, China
- Ming Li 0060 — Harbin Engineering University, College of Computer Science and Technology, China
- Ming Li 0061 — Rizhao People's Hospital, Department of Nuclear Medicine, China
- Ming Li 0062 — CRRC Tangshan Company, Ltd., Tangshan, China
- Ming Li 0063 — Harbin Institute of Technology, Communication Research Center, China
- Ming Li 0064 — Beijing Institute of Technology, State Key Laboratory of Explosion Science and Technology, China
- Ming Li 0065 — Zhejiang Normal University, Department of Computer Science, Jinhua, China (and 2 more)
- Ming Li 0066 — National University of Defense Technology, College of Electronic Science and Technology, State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System, Changsha, China
- Ming Li 0067 — Lappeenranta University of Technology, LUT, Laboratory of Intelligent Machines, Department of Mechanical Engineering, Finland
- Ming Li 0068 — University of Amsterdam, IRLab, Netherlands (and 1 more)
- Ming Li 0069 — Nanjing University, School of Electronic Science and Engineering, China
- Ming Li 0070 — China University of Petroleum, School of Science, Qingdao, China (and 1 more)
- Ming Li 0071 — Jiangsu Ocean University, Department of Computer Science and Technology, China (and 1 more)
- Ming Li 0072 — Wuhan University of Technology, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j37]Zexin Cai, Ming Li:
Integrating frame-level boundary detection and deepfake detection for locating manipulated regions in partially spoofed audio forgery attacks. Comput. Speech Lang. 85: 101597 (2024) - [j36]Chengyan Yu, Dong Zhang, Wei Zou, Ming Li:
Joint Training on Multiple Datasets With Inconsistent Labeling Criteria for Facial Expression Recognition. IEEE Trans. Affect. Comput. 15(3): 1812-1825 (2024) - [j35]Xiaoyi Qin, Na Li, Shufei Duan, Ming Li:
Investigating Long-Term and Short-Term Time-Varying Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3408-3423 (2024) - [j34]Danwei Cai, Ming Li:
Leveraging ASR Pretrained Conformers for Speaker Verification Through Transfer Learning and Knowledge Distillation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3532-3545 (2024) - [j33]Chengyan Yu, Shihuan Wang, Dong Zhang, Yingying Zhang, Chaoqun Cen, Zhixiang You, Xiaobing Zou, Hongzhu Deng, Ming Li:
HSVRS: A Virtual Reality System of the Hide-and-Seek Game to Enhance Gaze Fixation Ability for Autistic Children. IEEE Trans. Learn. Technol. 17: 2065-2078 (2024) - 2023
- [j32]Yaogen Yang, Haozhe Zhang, Zexin Cai, Yao Shi, Ming Li, Dong Zhang, Xiaojun Ding, Jianhua Deng, Jie Wang:
Electrolaryngeal speech enhancement based on a two stage framework with bottleneck feature refinement and voice conversion. Biomed. Signal Process. Control. 80(Part): 104279 (2023) - [j31]Zexin Cai, Yaogen Yang, Ming Li:
Cross-lingual multi-speaker speech synthesis with limited bilingual training data. Comput. Speech Lang. 77: 101427 (2023) - [j30]Weicong Chen, Dong Zhang, Ming Li, Dah-Jye Lee:
STCAM: Spatial-Temporal and Channel Attention Module for Dynamic Facial Expression Recognition. IEEE Trans. Affect. Comput. 14(1): 800-810 (2023) - [j29]Jianing Teng, Dong Zhang, Wei Zou, Ming Li, Dah-Jye Lee:
Typical Facial Expression Network Using a Facial Feature Decoupler and Spatial-Temporal Learning. IEEE Trans. Affect. Comput. 14(2): 1125-1137 (2023) - [j28]Ming Cheng, Yingying Zhang, Yixiang Xie, Yueran Pan, Xiao Li, Wenxing Liu, Chengyan Yu, Dong Zhang, Yu Xing, Xiaoqian Huang, Fang Wang, Cong You, Yuanyuan Zou, Yuchong Liu, Fengjing Liang, Huilin Zhu, Chun Tang, Hongzhu Deng, Xiaobing Zou, Ming Li:
Computer-Aided Autism Spectrum Disorder Diagnosis With Behavior Signal Processing. IEEE Trans. Affect. Comput. 14(4): 2982-3000 (2023) - [j27]Zhesi Zhu, Dong Zhang, Cailong Chi, Ming Li, Dah-Jye Lee:
A Complementary Dual-Branch Network for Appearance-Based Gaze Estimation From Low-Resolution Facial Image. IEEE Trans. Cogn. Dev. Syst. 15(3): 1323-1334 (2023) - [j26]Xiaoyi Qin, Danwei Cai, Ming Li:
Robust Multi-Channel Far-Field Speaker Verification Under Different In-Domain Data Availability Scenarios. IEEE ACM Trans. Audio Speech Lang. Process. 31: 71-85 (2023) - [j25]Xiao Li, Dong Zhang, Ming Li, Dah-Jye Lee:
Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network. IEEE Trans. Multim. 25: 2239-2251 (2023) - 2022
- [j24]Yanze Xu, Weiqing Wang, Huahua Cui, Mingyang Xu, Ming Li:
Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy. EURASIP J. Audio Speech Music. Process. 2022(1): 8 (2022) - [j23]Danwei Cai, Weiqing Wang, Ming Li:
Incorporating Visual Information in Audio Based Self-Supervised Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1422-1435 (2022) - [j22]Weiqing Wang, Qingjian Lin, Danwei Cai, Ming Li:
Similarity Measurement of Segment-Level Speaker Embeddings in Speaker Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2645-2658 (2022) - 2021
- [j21]Wenbo Liu, Ming Li, Xiaobing Zou, Bhiksha Raj:
Discriminative Dictionary Learning for Autism Spectrum Disorder Identification. Frontiers Comput. Neurosci. 15: 662401 (2021) - [j20]Ming Li, Hao Xu, Xingchang Huang, Zhanmei Song, Xiaolin Liu, Xin Li:
Facial Expression Recognition with Identity and Emotion Joint Learning. IEEE Trans. Affect. Comput. 12(2): 544-550 (2021) - [j19]Weiqing Wang, Jin Pan, Hua Yi, Zhanmei Song, Ming Li:
Audio-Based Piano Performance Evaluation for Beginners With Convolutional Neural Network and Attention Mechanism. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1119-1133 (2021) - 2020
- [j18]Weicheng Cai, Jinkun Chen, Jun Zhang, Ming Li:
On-the-Fly Data Loader and Utterance-Level Aggregation for Speaker and Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1038-1051 (2020) - 2019
- [j17]Ming Li, Dengke Tang, Junlin Zeng, Tianyan Zhou, Huilin Zhu, Biyuan Chen, Xiaobing Zou:
An automated assessment framework for atypical prosody and stereotyped idiosyncratic phrases related to autism spectrum disorder. Comput. Speech Lang. 56: 80-94 (2019) - [j16]Zhicheng Li, Bin Hu, Ming Li, Gengnan Luo:
String Stability Analysis for Vehicle Platooning Under Unreliable Communication Links With Event-Triggered Strategy. IEEE Trans. Veh. Technol. 68(3): 2152-2164 (2019) - 2018
- [j15]Kong-Yik Chee, Zhe Jin, Danwei Cai, Ming Li, Wun-She Yap, Yen-Lung Lai, Bok-Min Goi:
Cancellable speech template via random binary orthogonal matrices projection hashing. Pattern Recognit. 76: 273-287 (2018) - 2017
- [j14]Yinliang Xu, Zaiyue Yang, Wei Gu, Ming Li, Zicong Deng:
Robust Real-Time Distributed Optimal Control Based Energy Management in a Smart Grid. IEEE Trans. Smart Grid 8(4): 1568-1579 (2017) - 2016
- [j13]Ming Li, Jangwon Kim, Adam C. Lammert, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Speaker verification based on the fusion of speech acoustics and inverted articulatory signals. Comput. Speech Lang. 36: 196-211 (2016) - [j12]Ming Li, Lun Liu, Weicheng Cai, Wenbo Liu:
Generalized I-vector Representation with Phonetic Tokenizations and Tandem Features for both Text Independent and Text Dependent Speaker Verification. J. Signal Process. Syst. 82(2): 207-215 (2016) - 2015
- [j11]Jangwon Kim, Naveen Kumar, Andreas Tsiartas, Ming Li, Shrikanth S. Narayanan:
Automatic intelligibility classification of sentence-level pathological speech. Comput. Speech Lang. 29(1): 132-144 (2015) - 2014
- [j10]Daniel Bone, Ming Li, Matthew P. Black, Shrikanth S. Narayanan:
Intoxicated speech detection: A fusion framework with speaker-normalized hierarchical functionals and GMM supervectors. Comput. Speech Lang. 28(2): 375-391 (2014) - [j9]Ming Li, Shrikanth S. Narayanan:
Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification. Comput. Speech Lang. 28(4): 940-958 (2014) - 2013
- [j8]Ming Li, Kyu Jeong Han, Shrikanth S. Narayanan:
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput. Speech Lang. 27(1): 151-167 (2013) - 2012
- [j7]Urbashi Mitra, B. Adar Emken, Sangwon Lee, Ming Li, Viktor Rozgic, Gautam Thatte, Harshvardhan Vathsangam, Daphney-Stavroula Zois, Murali Annavaram, Shrikanth S. Narayanan, Marco Levorato, Donna Spruijt-Metz, Gaurav S. Sukhatme:
KNOWME: a case study in wireless body area sensor network design. IEEE Commun. Mag. 50(5): 116-125 (2012) - [j6]Gautam Thatte, Ming Li, Sangwon Lee, B. Adar Emken, Shrikanth S. Narayanan, Urbashi Mitra, Donna Spruijt-Metz, Murali Annavaram:
KNOWME: An Energy-Efficient Multimodal Body Area Network for Physical Activity Monitoring. ACM Trans. Embed. Comput. Syst. 11(S2): 48:1-48:24 (2012) - 2011
- [j5]Gautam Thatte, Ming Li, Sangwon Lee, B. Adar Emken, Murali Annavaram, Shrikanth S. Narayanan, Donna Spruijt-Metz, Urbashi Mitra:
Optimal Time-Resource Allocation for Energy-Efficient Physical Activity Detection. IEEE Trans. Signal Process. 59(4): 1843-1857 (2011) - 2009
- [j4]Chuan Cao, Ming Li, Xiao Wu, Hongbin Suo, Jian Liu, Yonghong Yan:
Automatic Singing Performance Evaluation for Untrained Singers. IEICE Trans. Inf. Syst. 92-D(8): 1596-1600 (2009) - 2008
- [j3]Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan:
Using SVM as Back-End Classifier for Language Identification. EURASIP J. Audio Speech Music. Process. 2008 (2008) - [j2]Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan:
Automatic Language Identification with Discriminative Language Characterization Based on SVM. IEICE Trans. Inf. Syst. 91-D(3): 567-575 (2008) - [j1]Xiao Wu, Ming Li, Hongbin Suo, Yonghong Yan:
Melody Track Selection Using Discriminative Language Model. IEICE Trans. Inf. Syst. 91-D(6): 1838-1840 (2008)
Conference and Workshop Papers
- 2024
- [c133]Rongqi Bei, Yajie Liu, Yihe Wang, Yuxuan Huang, Ming Li, Yuhang Zhao, Xin Tong:
StarRescue: the Design and Evaluation of A Turn-Taking Collaborative Game for Facilitating Autistic Children's Social Skills. CHI 2024: 67:1-67:19 - [c132]Zexin Cai, Ming Li:
Invertible Voice Conversion with Parallel Data. ICASSP 2024: 10041-10045 - [c131]Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiying Wu, Ming Li:
Voxblink: A Large Scale Speaker Verification Dataset on Camera. ICASSP 2024: 10271-10275 - [c130]Weiqing Wang, Danwei Cai, Ming Cheng, Ming Li:
Joint Inference of Speaker Diarization and ASR with Multi-Stage Information Sharing. ICASSP 2024: 11011-11015 - [c129]Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer. ICASSP 2024: 11556-11560 - [c128]Bang Zeng, Ming Cheng, Yao Tian, Haifeng Liu, Ming Li:
Efficient Personal Voice Activity Detection with Wake Word Reference Speech. ICASSP 2024: 12241-12245 - 2023
- [c127]Bang Zeng, Hongbin Suo, Yulong Wan, Ming Li:
Low-complexity Multi-Channel Speaker Extraction with Pure Speech Cues. APSIPA ASC 2023: 114-118 - [c126]Xiaoyi Qin, Xingming Wang, Yanli Chen, Qinglin Meng, Ming Li:
From Speaker Verification to Deepfake Algorithm Recognition: Our Learned Lessons from ADD2023 Track 3. DADA@IJCAI 2023: 107-112 - [c125]Danwei Cai, Zexin Cai, Ming Li:
Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification Systems. ICASSP 2023: 1-5 - [c124]Zexin Cai, Weiqing Wang, Ming Li:
Waveform Boundary Detection for Partially Spoofed Audio. ICASSP 2023: 1-5 - [c123]Danwei Cai, Weiqing Wang, Ming Li, Rui Xia, Chuanzeng Huang:
Pretraining Conformer with ASR for Speaker Verification. ICASSP 2023: 1-5 - [c122]Ming Cheng, Haoxu Wang, Ziteng Wang, Qiang Fu, Ming Li:
The WHU-Alibaba Audio-Visual Speaker Diarization System for the MISP 2022 Challenge. ICASSP 2023: 1-2 - [c121]Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li:
Target-Speaker Voice Activity Detection Via Sequence-to-Sequence Prediction. ICASSP 2023: 1-5 - [c120]Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis. ICASSP 2023: 1-5 - [c119]Xingming Wang, Hao Wu, Chen Ding, Chuanzeng Huang, Ming Li:
Exploring Universal Singing Speech Language Identification Using Self-Supervised Learning Based Front-End Features. ICASSP 2023: 1-5 - [c118]Bang Zeng, Hongbin Suo, Yulong Wan, Ming Li:
SEF-Net: Speaker Embedding Free Target Speaker Extraction Network. INTERSPEECH 2023: 3452-3456 - [c117]Xingming Wang, Bang Zeng, Hongbin Suo, Yulong Wan, Ming Li:
Robust Audio Anti-spoofing Countermeasure with Joint Training of Front-end and Back-end Models. INTERSPEECH 2023: 4004-4008 - [c116]Yucong Zhang, Hongbin Suo, Yulong Wan, Ming Li:
Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning. INTERSPEECH 2023: 5381-5385 - [c115]Wenxing Liu, Ming Cheng, Yueran Pan, Lynn Yuan, Suxiu Hu, Ming Li, Songtian Zeng:
Assessing the Social Skills of Children with Autism Spectrum Disorder via Language-Image Pre-training Models. PRCV (13) 2023: 260-271 - 2022
- [c114]Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li:
SIG-VC: A Speaker Information Guided Zero-Shot Voice Conversion System for Both Human Beings and Machines. ICASSP 2022: 6567-65571 - [c113]Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Simple Attention Module Based Speaker Verification with Iterative Noisy Label Detection. ICASSP 2022: 6722-6726 - [c112]Qingjian Li, Lin Yang, Xuyang Wang, Xiaoyi Qin, Junjie Wang, Ming Li:
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification. ICASSP 2022: 7067-7071 - [c111]Weiqing Wang, Ming Li:
Incorporating End-to-End Framework Into Target-Speaker Voice Activity Detection. ICASSP 2022: 8362-8366 - [c110]Weiqing Wang, Xiaoyi Qin, Ming Li:
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for the M2met Challenge. ICASSP 2022: 9171-9175 - [c109]Ming Cheng, Haoxu Wang, Yechen Wang, Ming Li:
The DKU Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge. ICASSP 2022: 9256-9260 - [c108]Yueran Pan, Jiaxin Wu, Ran Ju, Ziang Zhou, Jiayue Gu, Songtian Zeng, Lynn Yuan, Ming Li:
A Multimodal Framework for Automated Teaching Quality Assessment of One-to-many Online Instruction Videos. ICPR 2022: 1777-1783 - [c107]Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings. INTERSPEECH 2022: 1436-1440 - [c106]Weiqing Wang, Ming Li, Qingjian Lin:
Online Target Speaker Voice Activity Detection for Speaker Diarization. INTERSPEECH 2022: 1441-1445 - [c105]Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li:
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge. INTERSPEECH 2022: 4396-4400 - [c104]Yikang Wang, Xingming Wang, Hiromitsu Nishizaki, Ming Li:
Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities. ISCSLP 2022: 438-442 - [c103]Yuxiang Zhang, Jingze Lu, Xingming Wang, Zhuo Li, Runqiu Xiao, Wenchao Wang, Ming Li, Pengyuan Zhang:
Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion. DDAM@MM 2022: 43-52 - [c102]Hua Hua, Ziyi Chen, Yuxiang Zhang, Ming Li, Pengyuan Zhang:
Improving Spoofing Capability for End-to-end Any-to-many Voice Conversion. DDAM@MM 2022: 93-100 - [c101]Yucong Zhang, Qingjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li:
Low-Latency Online Speaker Diarization with Graph-Based Label Generation. Odyssey 2022: 162-169 - [c100]Jincheng He, Yuanyuan Bao, Na Xu, Hongfeng Li, Shicong Li, Linzhang Wang, Fei Xiang, Ming Li:
Single-Channel Target Speaker Separation Using Joint Training with Target Speaker's Pitch Information. Odyssey 2022: 301-305 - [c99]Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li:
Generating TTS Based Adversarial Samples for Training Wake-Up Word Detection Systems Against Confusing Words. Odyssey 2022: 402-406 - 2021
- [c98]Danwei Cai, Weiqing Wang, Ming Li:
An Iterative Framework for Self-Supervised Deep Speaker Representation Learning. ICASSP 2021: 6728-6732 - [c97]Huangrui Chu, Yechen Wang, Ran Ju, Yan Jia, Haoxu Wang, Ming Li, Qi Deng:
Call For Help Detection In Emergent Situations Using Keyword Spotting And Paralinguistic Analysis. ICMI Companion 2021: 104-111 - [c96]Ran Ju, Huangrui Chu, Yechen Wang, Qi Deng, Ming Cheng, Ming Li:
A Multimodal Dynamic Neural Network for Call for Help Recognition in Elevators. ICMI Companion 2021: 112-120 - [c95]Xinmeng Chen, Xuchen Gong, Ming Cheng, Qi Deng, Ming Li:
Cross-modal Assisted Training for Abnormal Event Recognition in Elevators. ICMI 2021: 530-538 - [c94]Tinglong Zhu, Xiaoyi Qin, Ming Li:
Binary Neural Network for Speaker Verification. Interspeech 2021: 86-90 - [c93]Weiqing Wang, Danwei Cai, Jin Wang, Qingjian Lin, Xuyang Wang, Mi Hong, Ming Li:
The DKU-Duke-Lenovo System Description for the Fearless Steps Challenge Phase III. Interspeech 2021: 1044-1048 - [c92]Xiaoyi Qin, Chao Wang, Yong Ma, Min Liu, Shilei Zhang, Ming Li:
Our Learned Lessons from Cross-Lingual Speaker Verification: The CRMI-DKU System Description for the Short-Duration Speaker Verification Challenge 2021. Interspeech 2021: 2317-2321 - [c91]Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li:
AISHELL-3: A Multi-Speaker Mandarin TTS Corpus. Interspeech 2021: 2756-2760 - [c90]Yan Jia, Xingming Wang, Xiaoyi Qin, Yinping Zhang, Xuyang Wang, Junjie Wang, Dong Zhang, Ming Li:
The 2020 Personalized Voice Trigger Challenge: Open Datasets, Evaluation Metrics, Baseline System and Results. Interspeech 2021: 4239-4243 - [c89]Tingle Li, Jiawei Chen, Haowen Hou, Ming Li:
Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. ISCSLP 2021: 1-5 - [c88]Murong Ma, Haiwei Wu, Xuyang Wang, Lin Yang, Junjie Wang, Ming Li:
Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. ISCSLP 2021: 1-5 - [c87]Danwei Cai, Ming Li:
Embedding Aggregation for Far-Field Speaker Verification with Distributed Microphone Arrays. SLT 2021: 308-315 - 2020
- [c86]Zexin Cai, Ming Li:
The Duke Entry for 2020 Blizzard Challenge. Blizzard Challenge / Voice Conversion Challenge 2020 - [c85]Danwei Cai, Weicheng Cai, Ming Li:
Within-Sample Variability-Invariant Loss for Robust Speaker Recognition Under Noisy Environments. ICASSP 2020: 6469-6473 - [c84]Xiaoyi Qin, Hui Bu, Ming Li:
HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. ICASSP 2020: 7609-7613 - [c83]Yueran Pan, Kunjing Cai, Ming Cheng, Xiaobing Zou, Ming Li:
Responsive Social Smile: A Machine Learning based Multimodal Behavior Assessment Framework towards Early Stage Autism Screening. ICPR 2020: 2240-2247 - [c82]Ming Cheng, Kunjing Cai, Ming Li:
RWF-2000: An Open Large Scale Video Database for Violence Detection. ICPR 2020: 4183-4190 - [c81]Qingjian Lin, Yu Hou, Ming Li:
Self-Attentive Similarity Measurement Strategies in Speaker Diarization. INTERSPEECH 2020: 284-288 - [c80]Tingle Li, Qingjian Lin, Yuanyuan Bao, Ming Li:
Atss-Net: Target Speaker Separation via Attention-Based Neural Network. INTERSPEECH 2020: 1411-1415 - [c79]Qingjian Lin, Tingle Li, Ming Li:
The DKU Speech Activity Detection and Speaker Identification Systems for Fearless Steps Challenge Phase-02. INTERSPEECH 2020: 2607-2611 - [c78]Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. INTERSPEECH 2020: 3456-3460 - [c77]Zexin Cai, Chuxiong Zhang, Ming Li:
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint. INTERSPEECH 2020: 3974-3978 - [c76]Qingjian Lin, Weicheng Cai, Lin Yang, Junjie Wang, Jun Zhang, Ming Li:
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team. Odyssey 2020: 102-109 - [c75]Qingjian Lin, Tingle Li, Lin Yang, Junjie Wang, Ming Li:
Optimal Mapping Loss: A Faster Loss for End-to-End Speaker Diarization. Odyssey 2020: 125-131 - 2019
- [c74]Jianing Teng, Dong Zhang, Ming Li, Yudong Huang:
Facial Expression Recognition with Identity and Spatial-temporal Integrated Learning. ACII Workshops 2019: 100-104 - [c73]Weiqing Wang, Haiwei Wu, Ming Li:
Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection. APSIPA 2019: 1323-1327 - [c72]Haiwei Wu, Weicheng Cai, Ming Li, Ji Gao, Shanshan Zhang, Zhiqiang Lyu, Shen Huang:
DKU-Tencent Submission to Oriental Language Recognition AP18-OLR Challenge. APSIPA 2019: 1646-1651 - [c71]Zexin Cai, Chuxiong Zhang, Yaogen Yang, Ming Li:
The DKU Speech Synthesis System for 2019 Blizzard Challenge. Blizzard Challenge 2019 - [c70]Weicheng Cai, Danwei Cai, Shen Huang, Ming Li:
Utterance-level End-to-end Language Identification Using Attention-based CNN-BLSTM. ICASSP 2019: 5991-5995 - [c69]Zexin Cai, Zhicheng Xu, Ming Li:
F0 Contour Estimation Using Phonetic Feature in Electrolaryngeal Speech Enhancement. ICASSP 2019: 6490-6494 - [c68]Sheng Sun, Shuangmei Li, Wenbo Liu, Xiaobing Zou, Ming Li:
Fixation Based Object Recognition in Autism Clinic Setting. ICIRA (4) 2019: 615-628 - [c67]Qingjian Lin, Ruiqing Yin, Ming Li, Hervé Bredin, Claude Barras:
LSTM Based Similarity Measurement with Spectral Clustering for Speaker Diarization. INTERSPEECH 2019: 366-370 - [c66]Weicheng Cai, Haiwei Wu, Danwei Cai, Ming Li:
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion. INTERSPEECH 2019: 1023-1027 - [c65]Zexin Cai, Yaogen Yang, Chuxiong Zhang, Xiaoyi Qin, Ming Li:
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-Level Embedding Features. INTERSPEECH 2019: 2110-2114 - [c64]Haiwei Wu, Weiqing Wang, Ming Li:
The DKU-LENOVO Systems for the INTERSPEECH 2019 Computational Paralinguistic Challenge. INTERSPEECH 2019: 2433-2437 - [c63]Danwei Cai, Xiaoyi Qin, Weicheng Cai, Ming Li:
The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge. INTERSPEECH 2019: 2493-2497 - [c62]Danwei Cai, Weicheng Cai, Ming Li:
The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation. INTERSPEECH 2019: 4370-4374 - [c61]Ming Li, Weicheng Cai, Danwei Cai:
Survey Talk: End-to-End Deep Neural Network Based Speaker and Language Recognition. INTERSPEECH 2019 - [c60]Xiaoyi Qin, Danwei Cai, Ming Li:
Far-Field End-to-End Text-Dependent Speaker Verification Based on Mixed Training Data with Transfer Learning and Enrollment Data Augmentation. INTERSPEECH 2019: 4045-4049 - [c59]Danwei Cai, Xiaoyi Qin, Ming Li:
Multi-Channel Training for End-to-End Speaker Recognition Under Reverberant and Noisy Environment. INTERSPEECH 2019: 4365-4369 - 2018
- [c58]Danwei Cai, Zexin Cai, Ming Li:
Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition. APSIPA 2018: 1478-1482 - [c57]Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li:
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification. ICASSP 2018: 5189-5193 - [c56]Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li:
Insights in-to-End Learning Scheme for Language Identification. ICASSP 2018: 5209-5213 - [c55]Weicheng Cai, Jinkun Chen, Ming Li:
Analysis of Length Normalization in End-to-End Speaker Verification System. INTERSPEECH 2018: 3618-3622 - [c54]Haiwei Wu, Ming Li, Zexin Cai, Haibin Zhong:
Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. ISCSLP 2018: 1-5 - [c53]Zexin Cai, Xiaoyi Qin, Danwei Cai, Ming Li, Xinzhong Liu, Haibin Zhong:
The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion. ISCSLP 2018: 235-239 - [c52]Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:
End-to-end Language Identification using NetFV and NetVLAD. ISCSLP 2018: 319-323 - [c51]Weicheng Cai, Jinkun Chen, Ming Li:
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System. Odyssey 2018: 74-81 - 2017
- [c50]Wenbo Liu, Tianyan Zhou, Chenghao Zhang, Xiaobing Zou, Ming Li:
Response to name: A dataset and a multimodal machine learning framework towards autism study. ACII 2017: 178-183 - [c49]Jinkun Chen, Cong Liu, Ming Li:
Automatic emotional spoken language text corpus construction from written dialogs in fictions. ACII 2017: 319-324 - [c48]Ming Li, Luting Wang, Zhicheng Xu, Danwei Cai:
Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization. APSIPA 2017: 1360-1363 - [c47]Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song:
SphereFace: Deep Hypersphere Embedding for Face Recognition. CVPR 2017: 6738-6746 - [c46]Weicheng Cai, Danwei Cai, Wenbo Liu, Gang Li, Ming Li:
Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion. INTERSPEECH 2017: 17-21 - [c45]Danwei Cai, Zhidong Ni, Wenbo Liu, Weicheng Cai, Gang Li, Ming Li:
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum. INTERSPEECH 2017: 3452-3456 - 2016
- [c44]Zhiding Yu, Weiyang Liu, Wenbo Liu, Yingzhen Yang, Ming Li, B. V. K. Vijaya Kumar:
On Order-Constrained Transitive Distance Clustering. AAAI 2016: 2293-2299 - [c43]Danwei Cai, Weicheng Cai, Zhidong Ni, Ming Li:
Locality sensitive discriminant analysis for speaker verification. APSIPA 2016: 1-5 - [c42]Gaoyuan He, Jinkun Chen, Xuebo Liu, Ming Li:
The SYSU System for CCPR 2016 Multimodal Emotion Recognition Challenge. CCPR (2) 2016: 707-720 - [c41]Huadi Zheng, Weicheng Cai, Tianyan Zhou, Shilei Zhang, Ming Li:
Text-independent voice conversion using deep neural network based phonetic level features. ICPR 2016: 2872-2877 - [c40]Tianyan Zhou, Weicheng Cai, Xiaoyan Chen, Xiaobing Zou, Shilei Zhang, Ming Li:
Speaker diarization system for autism children's real-life audio data. ISCSLP 2016: 1-5 - 2015
- [c39]Wenbo Liu, Li Yi, Zhiding Yu, Xiaobing Zou, Bhiksha Raj, Ming Li:
Efficient autism spectrum disorder prediction with eye movement: A machine learning framework. ACII 2015: 649-655 - [c38]Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Yiming Zhou, Ming Li:
The SYSU system for the interspeech 2015 automatic speaker verification spoofing and countermeasures challenge. APSIPA 2015: 152-155 - [c37]Weicheng Cai, Ming Li, Lin Li, Qingyang Hong:
Duration dependent covariance regularization in PLDA modeling for speaker verification. INTERSPEECH 2015: 1027-1031 - [c36]Qingyang Hong, Lin Li, Ming Li, Ling Huang, Lihong Wan, Jun Zhang:
Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system. INTERSPEECH 2015: 1037-1041 - [c35]Yingxue Wang, Shenghui Zhao, Wenbo Liu, Ming Li, Jingming Kuang:
Speech bandwidth expansion based on deep neural networks. INTERSPEECH 2015: 2593-2597 - [c34]Wenbo Liu, Zhiding Yu, Bhiksha Raj, Ming Li:
Locality constrained transitive distance clustering on speech data. INTERSPEECH 2015: 2917-2921 - 2014
- [c33]Ming Li, Xin Li:
Verification based ECG biometrics with cardiac irregular conditions using heartbeat level and segment level information fusion. ICASSP 2014: 3769-3773 - [c32]Prashanth Gurunath Shivakumar, Ming Li, Vedant Dhandhania, Shrikanth S. Narayanan:
Simplified and supervised i-vector modeling for speaker age regression. ICASSP 2014: 4833-4837 - [c31]Liming Song, Ming Li, Yonghong Yan:
Melody Extraction for Vocal Polyphonic Music Based on Bayesian Framework. IIH-MSP 2014: 570-573 - [c30]Ming Li, Wenbo Liu:
Speaker verification and spoken language identification using a generalized i-vector framework with phonetic tokenizations and tandem features. INTERSPEECH 2014: 1120-1124 - [c29]Wenbo Liu, Zhiding Yu, Ming Li:
An iterative framework for unsupervised learning in the PLDA based speaker verification. ISCSLP 2014: 78-82 - 2013
- [c28]Liming Song, Ming Li, Yonghong Yan:
Automatic Vocal Segments Detection in Popular Music. CIS 2013: 349-352 - [c27]Ming Li, Andreas Tsiartas, Maarten Van Segbroeck, Shrikanth S. Narayanan:
Speaker verification using simplified and supervised i-vector modeling. ICASSP 2013: 7199-7203 - [c26]Daniel Bone, Theodora Chaspari, Kartik Audhkhasi, James Gibson, Andreas Tsiartas, Maarten Van Segbroeck, Ming Li, Sungbok Lee, Shrikanth S. Narayanan:
Classifying language-related developmental disorders from speech cues: the promise and the potential confounds. INTERSPEECH 2013: 182-186 - [c25]Andreas Tsiartas, Theodora Chaspari, Nassos Katsamanis, Prasanta Kumar Ghosh, Ming Li, Maarten Van Segbroeck, Alexandros Potamianos, Shrikanth S. Narayanan:
Multi-band long-term signal variability features for robust voice activity detection. INTERSPEECH 2013: 718-722 - [c24]Kyu Jeong Han, Sriram Ganapathy, Ming Li, Mohamed Kamal Omar, Shrikanth S. Narayanan:
TRAP language identification system for RATS phase II evaluation. INTERSPEECH 2013: 1502-1506 - [c23]Ming Li, Jangwon Kim, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Speaker verification based on fusion of acoustic and articulatory information. INTERSPEECH 2013: 1614-1618 - 2012
- [c22]Ming Li, Charley Lu, Anne Wang, Shrikanth S. Narayanan:
Speaker verification using Lasso based sparse total variability supervector with PLDA modeling. APSIPA 2012: 1-4 - [c21]Ming Li, Angeliki Metallinou, Daniel Bone, Shrikanth S. Narayanan:
Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling. ICASSP 2012: 1937-1940 - [c20]Kartik Audhkhasi, Angeliki Metallinou, Ming Li, Shrikanth S. Narayanan:
Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network. INTERSPEECH 2012: 262-265 - [c19]Jangwon Kim, Naveen Kumar, Andreas Tsiartas, Ming Li, Shrikanth S. Narayanan:
Intelligibility classification of pathological speech using fusion of multiple high level descriptors. INTERSPEECH 2012: 534-537 - 2011
- [c18]Samuel Kim, Ming Li, Sangwon Lee, Urbashi Mitra, B. Adar Emken, Donna Spruijt-Metz, Murali Annavaram, Shrikanth S. Narayanan:
Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals. EMBC 2011: 6033-6036 - [c17]Ming Li, Shrikanth S. Narayanan:
Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors. ICASSP 2011: 1481-1484 - [c16]Ming Li, Xiang Zhang, Yonghong Yan, Shrikanth S. Narayanan:
Speaker Verification Using Sparse Representations on Total Variability i-vectors. INTERSPEECH 2011: 2729-2732 - [c15]Daniel Bone, Matthew Black, Ming Li, Angeliki Metallinou, Sungbok Lee, Shrikanth S. Narayanan:
Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors. INTERSPEECH 2011: 3217-3220 - 2010
- [c14]Ming Li, Shrikanth S. Narayanan:
Robust ECG Biometrics by Fusing Temporal and Cepstral Information. ICPR 2010: 1326-1329 - [c13]Ming Li, Chi-Sang Jung, Kyu Jeong Han:
Combining five acoustic level modeling methods for automatic speaker age and gender recognition. INTERSPEECH 2010: 2826-2829 - 2009
- [c12]Gautam Thatte, Viktor Rozgic, Ming Li, Sabyasachi Ghosh, Urbashi Mitra, Shrikanth S. Narayanan, Murali Annavaram, Donna Spruijt-Metz:
Optimal time-resource allocation for activity-detection via multimodal sensing. BODYNETS 2009: 14 - [c11]Gautam Thatte, Viktor Rozgic, Ming Li, Sabyasachi Ghosh, Urbashi Mitra, Shrikanth S. Narayanan, Murali Annavaram, Donna Spruijt-Metz:
Optimal Allocation of Time-Resources for Multihypothesis Activity-Level Detection. DCOSS 2009: 273-286 - 2008
- [c10]Ming Li, Chuan Cao, Di Wang, Ping Lu, Qiang Fu, Yonghong Yan:
Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping. INTERSPEECH 2008: 151-154 - [c9]Chuan Cao, Ming Li, Jian Liu, Yonghong Yan:
An objective singing evaluation approach by relating acoustic measurements to perceptual ratings. INTERSPEECH 2008: 2058-2061 - 2007
- [c8]Hongbin Suo, Ming Li, Tantan Liu, Ping Lu, Yonghong Yan:
The Design of Backend Classifiers in PPRLM System for Language Identification. ICNC (1) 2007: 678-682 - [c7]Ming Li, Yun Lei, Xiang Zhang, Jian Liu, Yonghong Yan:
Authentication and Quality Monitoring based on Audio Watermark for Analog AM Shortwave Broadcasting. IIH-MSP 2007: 263-266 - [c6]Ming Li, Hongbin Suo, Xiao Wu, Ping Lu, Yonghong Yan:
Spoken language identification using score vector modeling and support vector machine. INTERSPEECH 2007: 350-353 - [c5]Chuan Cao, Ming Li, Jian Liu, Yonghong Yan:
Singing Melody Extraction in Polyphonic Music by Harmonic Tracking. ISMIR 2007: 373-374 - 2006
- [c4]Ming Li, Yun Lei, Jian Liu, Yonghong Yan:
A Novel Audio Watermarking in Wavelet Domain. IIH-MSP 2006: 27-32 - [c3]Ming Li, Jian Liu, Yonghong Yan:
An Efficient and Robust Approach to Audio ID Identification. ISCSLP 2006 - [c2]Xiao Wu, Ming Li, Jian Liu, Jun Yang, Yonghong Yan:
A Top-down Approach to Melody Match in Pitch Contour for Query by Humming. ISCSLP 2006 - 2000
- [c1]Ming Li, Tiecheng Yu:
Multi-group mixture weight HMM. INTERSPEECH 2000: 290-292
Informal and Other Publications
- 2024
- [i59]Danwei Cai, Zexin Cai, Ming Li:
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning. CoRR abs/2401.01473 (2024) - [i58]Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer. CoRR abs/2403.01700 (2024) - 2023
- [i57]Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis. CoRR abs/2303.02348 (2023) - [i56]Yu Hou, Cong Tran, Ming Li, Won-Yong Shin:
Graph Neural Network-Aided Exploratory Learning for Community Detection with Unknown Topology. CoRR abs/2304.04497 (2023) - [i55]Yuke Lin, Xiaoyi Qin, Ming Cheng, Ning Jiang, Guoqing Zhao, Ming Li:
VoxBlink: X-Large Speaker Verification Dataset on Camera. CoRR abs/2308.07056 (2023) - [i54]Zexin Cai, Weiqing Wang, Yikang Wang, Ming Li:
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023. CoRR abs/2308.10281 (2023) - [i53]Haoxu Wang, Fan Yu, Xian Shi, Yuezhang Wang, Shiliang Zhang, Ming Li:
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus. CoRR abs/2309.05396 (2023) - [i52]Yucong Zhang, Hongbin Suo, Yulong Wan, Ming Li:
Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning. CoRR abs/2309.07500 (2023) - [i51]Weiqing Wang, Ming Li:
End-to-end Online Speaker Diarization with Target Speaker Tracking. CoRR abs/2310.08696 (2023) - 2022
- [i50]Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li:
Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words. CoRR abs/2201.00167 (2022) - [i49]Zexin Cai, Ming Li:
Invertible Voice Conversion. CoRR abs/2201.10687 (2022) - [i48]Weiqing Wang, Xiaoyi Qin, Ming Li:
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge. CoRR abs/2202.02687 (2022) - [i47]Bang Zeng, Weiqing Wang, Yuanyuan Bao, Ming Li:
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1). CoRR abs/2206.08525 (2022) - [i46]Danwei Cai, Zexin Cai, Ming Li:
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems. CoRR abs/2206.09103 (2022) - [i45]Weiqing Wang, Qingjian Lin, Ming Li:
Online Target Speaker Voice Activity Detection for Speaker Diarization. CoRR abs/2207.05920 (2022) - [i44]Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings. CoRR abs/2207.05929 (2022) - [i43]Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li:
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge. CoRR abs/2207.07510 (2022) - [i42]Xiaoyi Qin, Na Li, Yuke Lin, Yiwei Ding, Chao Weng, Dan Su, Ming Li:
The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2210.05092 (2022) - [i41]Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li:
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction. CoRR abs/2210.16127 (2022) - [i40]Zexin Cai, Weiqing Wang, Ming Li:
Waveform Boundary Detection for Partially Spoofed Audio. CoRR abs/2211.00226 (2022) - [i39]Yikang Wang, Xingming Wang, Hiromitsu Nishizaki, Ming Li:
Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities. CoRR abs/2211.06546 (2022) - 2021
- [i38]Weiqing Wang, Qingjian Lin, Danwei Cai, Lin Yang, Ming Li:
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge. CoRR abs/2102.03649 (2021) - [i37]Tinglong Zhu, Xiaoyi Qin, Ming Li:
Binary Neural Network for Speaker Verification. CoRR abs/2104.02306 (2021) - [i36]Ziang Zhou, Yanze Xu, Shilei Zhang, Ming Li:
Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion. CoRR abs/2104.06004 (2021) - [i35]Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li:
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss. CoRR abs/2104.10832 (2021) - [i34]Yuanyuan Bao, Yanze Xu, Na Xu, Wenjing Yang, Hongfeng Li, Shicong Li, Yongtao Jia, Fei Xiang, Jincheng He, Ming Li:
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication. CoRR abs/2106.02934 (2021) - [i33]Weiqing Wang, Danwei Cai, Qingjian Lin, Lin Yang, Junjie Wang, Jin Wang, Ming Li:
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge. CoRR abs/2109.02002 (2021) - [i32]Danwei Cai, Ming Li:
The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge. CoRR abs/2109.02853 (2021) - [i31]Qingjian Lin, Lin Yang, Xuyang Wang, Xiaoyi Qin, Junjie Wang, Ming Li:
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification. CoRR abs/2110.04438 (2021) - [i30]Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Simple Attention Module based Speaker Verification with Iterative noisy label detection. CoRR abs/2110.06534 (2021) - [i29]Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li:
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines. CoRR abs/2111.03811 (2021) - [i28]Yucong Zhang, Qingjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li:
Online Speaker Diarization with Graph-based Label Generation. CoRR abs/2111.13803 (2021) - 2020
- [i27]Xiaoyi Qin, Ming Li, Hui Bu, Rohan Kumar Das, Wei Rao, Shrikanth Narayanan, Haizhou Li:
The FFSVC 2020 Evaluation Plan. CoRR abs/2002.00387 (2020) - [i26]Danwei Cai, Weicheng Cai, Ming Li:
Within-sample variability-invariant loss for robust speaker recognition under noisy environments. CoRR abs/2002.00924 (2020) - [i25]Qingjian Lin, Weicheng Cai, Lin Yang, Junjie Wang, Jun Zhang, Ming Li:
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team. CoRR abs/2002.12761 (2020) - [i24]Haiwei Wu, Yan Jia, Yuanfei Nie, Ming Li:
Mutli-task Learning with Alignment Loss for Far-field Small-Footprint Keyword Spotting. CoRR abs/2005.03633 (2020) - [i23]Zexin Cai, Chuxiong Zhang, Ming Li:
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint. CoRR abs/2005.04587 (2020) - [i22]Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. CoRR abs/2005.08046 (2020) - [i21]Tingle Li, Qingjian Lin, Yuanyuan Bao, Ming Li:
Atss-Net: Target Speaker Separation via Attention-based Neural Network. CoRR abs/2005.09200 (2020) - [i20]Zexin Cai, Yaogen Yang, Ming Li:
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario. CoRR abs/2005.10441 (2020) - [i19]Murong Ma, Haiwei Wu, Xuyang Wang, Lin Yang, Junjie Wang, Ming Li:
Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. CoRR abs/2005.11777 (2020) - [i18]Haiwei Wu, Lin Zhang, Lin Yang, Xuyang Wang, Junjie Wang, Dong Zhang, Ming Li:
Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling. CoRR abs/2008.05175 (2020) - [i17]Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li:
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines. CoRR abs/2010.11567 (2020) - [i16]Danwei Cai, Weiqing Wang, Ming Li:
An iterative framework for self-supervised deep speaker representation learning. CoRR abs/2010.14751 (2020) - [i15]Yan Jia, Zexin Cai, Murong Ma, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li:
Training Wake Word Detection with Synthesized Speech Data on Confusion Words. CoRR abs/2011.01460 (2020) - [i14]Xiaoyi Qin, Yaogen Yang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li:
Exploring Voice Conversion based Data Augmentation in Text-Dependent Speaker Verification. CoRR abs/2011.10710 (2020) - 2019
- [i13]Weicheng Cai, Danwei Cai, Shen Huang, Ming Li:
Utterance-level end-to-end language identification using attention-based CNN-BLSTM. CoRR abs/1902.07374 (2019) - [i12]Zexin Cai, Yaogen Yang, Chuxiong Zhang, Xiaoyi Qin, Ming Li:
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features. CoRR abs/1907.01749 (2019) - [i11]Weicheng Cai, Haiwei Wu, Danwei Cai, Ming Li:
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion. CoRR abs/1907.02663 (2019) - [i10]Qingjian Lin, Ruiqing Yin, Ming Li, Hervé Bredin, Claude Barras:
LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization. CoRR abs/1907.10393 (2019) - [i9]Ming Cheng, Kunjing Cai, Ming Li:
RWF-2000: An Open Large Scale Video Database for Violence Detection. CoRR abs/1911.05913 (2019) - [i8]Xiaoyi Qin, Hui Bu, Ming Li:
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines. CoRR abs/1912.01231 (2019) - 2018
- [i7]Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li:
Insights into End-to-End Learning Scheme for Language Identification. CoRR abs/1804.00381 (2018) - [i6]Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li:
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification. CoRR abs/1804.00385 (2018) - [i5]Weicheng Cai, Jinkun Chen, Ming Li:
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System. CoRR abs/1804.05160 (2018) - [i4]Weicheng Cai, Jinkun Chen, Ming Li:
Analysis of Length Normalization in End-to-End Speaker Verification System. CoRR abs/1806.03209 (2018) - [i3]Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:
End-to-end Language Identification using NetFV and NetVLAD. CoRR abs/1809.02906 (2018) - 2017
- [i2]Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song:
SphereFace: Deep Hypersphere Embedding for Face Recognition. CoRR abs/1704.08063 (2017) - 2015
- [i1]Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Ming Li:
The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge. CoRR abs/1507.06711 (2015)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-19 23:11 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint