default search action

combined dblp search
author search
venue search
publication search

ask others

Ming Li 0026

> Home > Persons

Person information

affiliation: Duke Kunshan University, Data Science Research Center, China
affiliation (former): Sun Yat-Sen University Carnegie Mellon University Joint Institute of Engineering, China
affiliation (former): University of Southern California, Los Angeles, CA, USA
affiliation (former): Chinese Academy of Sciences, Institute of Acoustics, China

Other persons with the same name

see FAQ

Ming Li — disambiguation page
Ming Li 0001 — University of Waterloo, ON, Canada (and 3 more)
Ming Li 0002 — East China Normal University, Shanghai, China (and 1 more)
Ming Li 0003 (aka: Ming (Fred) Li) — University of Arizona, Tucson, AZ, USA (and 2 more)
Ming Li 0004 — Xidian University, National Key Lab of Radar Signal Processing, Xi'an, China
Ming Li 0005 — Nanjing University, National Key Laboratory for Novel Software Technology, China
Ming Li 0006 — University of Texas at Arlington, TX, USA (and 2 more)
Ming Li 0007 — California State University, Fresno, CA, USA (and 1 more)
Ming Li 0008 — Worcester Polytechnic Institute, MA, USA
Ming Li 0009 — IBM T. J. Watson Research Center, Yorktown Heights, NY, USA (and 1 more)

Ming Li 0010 — Deakin University, VIC, Australia
Ming Li 0011 — Dalian University of Technology, School of Information and Communication Engineering, China (and 2 more)
Ming Li 0012 — Taiyuan University of Technology, College of Mathematics, China (and 1 more)
Ming Li 0013 — China University of Mining & Technology, Xuzhou, China
Ming Li 0014 — Unilever Corporate Research, Sharnbrook, Bedford, UK
Ming Li 0015 — Lanzhou University of Technology, China
Ming Li 0016 — RWTH Aachen University, Germany
Ming Li 0017 — Zhejiang University, State Key Laboratory of CAD&CG, China
Ming Li 0018 — Max-Planck-Institut für Informatik, Saarbrücken, Germany
Ming Li 0019 — Carleton University, Ottawa, ON, Canada
Ming Li 0020 — Google (and 1 more)
Ming Li 0021 — Oracle (and 1 more)
Ming Li 0022 — Vanderbilt University, Department of Biostatistics, Nashville, TN, USA
Ming Li 0023 — Simon Fraser University, Burnaby, BC, Canada
Ming Li 0024 — Concordia University, Department of Economics, Montreal, QC, Canada
Ming Li 0025 — Chinese Academy of Sciences, Institute of Semiconductors, China (and 1 more)
Ming Li 0028 — National University of Defense Technology, College of Mechatronic Engineering and Automation, Changsha, China
Ming Li 0029 — Beihang University, School of Automation Science and Electrical Engineering, Beijing, China (and 2 more)
Ming Li 0030 — Auburn University MRI Research Center, Auburn, USA
Ming Li 0031 — China University of Mining and Technology, School of Computer Science and Technology, Xuzhou, China
Ming Li 0032 — Heidelberg University, Institute of Geography, Germany
Ming Li 0033 — Chinese Academy of Sciences, Institute of Information Engineering, State Key Laboratory of Information Security, Beijing, China
Ming Li 0034 — Beihang University, Institute of Solid Mechanics, Beijing, China
Ming Li 0035 — Aalto University, Department of Computer Science, Espoo, Finland
Ming Li 0036 — Honghe University, Department of Mathematics, Mengzi, Yunnan, China
Ming Li 0037 — Wuhan University, State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing, China
Ming Li 0038 — Second Military Medical University, Changhai Hospital, Department of Orthopaedics, Shanghai, China
Ming Li 0039 — China Jiliang University, Department of Mathematics, Hangzhou, China
Ming Li 0040 — Nanchang University, ISST, China (and 1 more)
Ming Li 0041 — Tianjin Normal University, Tianjin Key Laboratory of Wireless Mobile Communications and Power Transmission, China (and 1 more)
Ming Li 0042 — Hamburg University of Technology, Germany
Ming Li 0043 — Unilever China (and 1 more)
Ming Li 0044 — Colorado School of Mines, Department of Electrical Engineering and Computer Science, Golden, CO, USA
Ming Li 0045 — Shanghai Jiao Tong University, Institute of Image Processing and Pattern Recognition, China
Ming Li 0046 — Yanshan University, College of Electrical Engineering, Qinhuangdao, China
Ming Li 0047 — Beijing Jiaotong University, School of Electronic and Information Engineering, China
Ming Li 0048 — Sun Yat-sen University, School of Geography and Planning, Guangzhou, China
Ming Li 0049 — Jinan University, College of Information Science and Technology, China
Ming Li 0051 — China University of Petroleum, School of Economics and Management, Beijing, China
Ming Li 0052 — National Institutes of Health, Center for Interventional Oncology / National Heart, Lung, and Blood Institute, Bethesda, MD, USA (and 1 more)
Ming Li 0053 — Macquarie University, Sydney, NSW, Australia (and 3 more)
Ming Li 0054 — Beihang University, School of Transportation Science and Engineering / Beijing Advanced Innovation Center for Big Data and Brain Computing, Beijing, China
Ming Li 0055 — Hong Kong Polytechnic University, Department of Industrial and Systems Engineering, Hong Kong (and 2 more)
Ming Li 0056 — Nanchang Hangkong University, MOE Key Laboratory of Nondestructive Testing, China (and 1 more)
Ming Li 0057 — Ocean University of China, College of Engineering, Department of Automation, Qingdao, China (and 1 more)
Ming Li 0058 — Shenyang University of Technology, School of Electrical Engineering, China
Ming Li 0059 — National University of Defense Technology, College of Meteorology and Oceanography, Nanjing, China
Ming Li 0060 — Harbin Engineering University, College of Computer Science and Technology, China
Ming Li 0061 — Rizhao People's Hospital, Department of Nuclear Medicine, China
Ming Li 0062 — CRRC Tangshan Company, Ltd., Tangshan, China
Ming Li 0063 — Harbin Institute of Technology, Communication Research Center, China
Ming Li 0064 — Beijing Institute of Technology, State Key Laboratory of Explosion Science and Technology, China
Ming Li 0065 — Zhejiang Normal University, Department of Computer Science, Jinhua, China (and 2 more)
Ming Li 0066 — National University of Defense Technology, College of Electronic Science and Technology, State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System, Changsha, China
Ming Li 0067 — Lappeenranta University of Technology, LUT, Laboratory of Intelligent Machines, Department of Mechanical Engineering, Finland
Ming Li 0068 — University of Amsterdam, IRLab, Netherlands (and 1 more)
Ming Li 0069 — Nanjing University, School of Electronic Science and Engineering, China
Ming Li 0070 — China University of Petroleum, School of Science, Qingdao, China (and 1 more)
Ming Li 0071 — Jiangsu Ocean University, Department of Computer Science and Technology, China (and 1 more)
Ming Li 0072 — Wuhan University of Technology, China (and 1 more)
Ming Li 0073 — National University of Singapore, Institute of Data Science, Singapore (and 2 more)

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/CaiL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/CaiL24
Zexin Cai, Ming Li:
Integrating frame-level boundary detection and deepfake detection for locating manipulated regions in partially spoofed audio forgery attacks. Comput. Speech Lang. 85: 101597 (2024)
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/YuZZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/YuZZL24
Chengyan Yu, Dong Zhang, Wei Zou, Ming Li:
Joint Training on Multiple Datasets With Inconsistent Labeling Criteria for Facial Expression Recognition. IEEE Trans. Affect. Comput. 15(3): 1812-1825 (2024)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QinLDL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QinLDL24
Xiaoyi Qin, Na Li, Shufei Duan, Ming Li:
Investigating Long-Term and Short-Term Time-Varying Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3408-3423 (2024)
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/CaiL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/CaiL24
Danwei Cai, Ming Li:
Leveraging ASR Pretrained Conformers for Speaker Verification Through Transfer Learning and Knowledge Distillation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3532-3545 (2024)
[j34]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangL24
Weiqing Wang, Ming Li:
Online Neural Speaker Diarization With Target Speaker Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 32: 5078-5091 (2024)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/tlt/YuWZZCYZDL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tlt/YuWZZCYZDL24
Chengyan Yu, Shihuan Wang, Dong Zhang, Yingying Zhang, Chaoqun Cen, Zhixiang You, Xiaobing Zou, Hongzhu Deng, Ming Li:
HSVRS: A Virtual Reality System of the Hide-and-Seek Game to Enhance Gaze Fixation Ability for Autistic Children. IEEE Trans. Learn. Technol. 17: 2065-2078 (2024)
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/chi/BeiLWHL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chi/BeiLWHL0024
Rongqi Bei, Yajie Liu, Yihe Wang, Yuxuan Huang, Ming Li, Yuhang Zhao, Xin Tong:
StarRescue: the Design and Evaluation of A Turn-Taking Collaborative Game for Facilitating Autistic Children's Social Skills. CHI 2024: 67:1-67:19
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Cai024a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Cai024a
Zexin Cai, Ming Li:
Invertible Voice Conversion with Parallel Data. ICASSP 2024: 10041-10045
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinQZCJWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinQZCJWL24
Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiying Wu, Ming Li:
Voxblink: A Large Scale Speaker Verification Dataset on Camera. ICASSP 2024: 10271-10275
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangCC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangCC024
Weiqing Wang, Danwei Cai, Ming Cheng, Ming Li:
Joint Inference of Speaker Diarization and ASR with Multi-Stage Information Sharing. ICASSP 2024: 11011-11015
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangCFL24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangCFL24a
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer. ICASSP 2024: 11556-11560
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZengCTLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZengCTLL24
Bang Zeng, Ming Cheng, Yao Tian, Haifeng Liu, Ming Li:
Efficient Personal Voice Activity Detection with Wake Word Reference Speech. ICASSP 2024: 12241-12245
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLYSZRCNL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLYSZRCNL24
Ze Li, Yuke Lin, Tian Yao, Hongbin Suo, Pengyuan Zhang, Yanzhen Ren, Zexin Cai, Hiromitsu Nishizaki, Ming Li:
The Database and Benchmark For the Source Speaker Tracing Challenge 2024. SLT 2024: 1254-1261
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-01473
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-01473
Danwei Cai, Zexin Cai, Ming Li:
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning. CoRR abs/2401.01473 (2024)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-01700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-01700
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer. CoRR abs/2403.01700 (2024)
2023
[j32]
- view
  authority control:
- export record
  dblp key:
  - journals/bspc/YangZCSLZDDW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bspc/YangZCSLZDDW23
Yaogen Yang, Haozhe Zhang, Zexin Cai, Yao Shi, Ming Li, Dong Zhang, Xiaojun Ding, Jianhua Deng, Jie Wang:
Electrolaryngeal speech enhancement based on a two stage framework with bottleneck feature refinement and voice conversion. Biomed. Signal Process. Control. 80(Part): 104279 (2023)
[j31]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/CaiYL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/CaiYL23
Zexin Cai, Yaogen Yang, Ming Li:
Cross-lingual multi-speaker speech synthesis with limited bilingual training data. Comput. Speech Lang. 77: 101427 (2023)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/ChenZLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ChenZLL23
Weicong Chen, Dong Zhang, Ming Li, Dah-Jye Lee:
STCAM: Spatial-Temporal and Channel Attention Module for Dynamic Facial Expression Recognition. IEEE Trans. Affect. Comput. 14(1): 800-810 (2023)
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/TengZZLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/TengZZLL23
Jianing Teng, Dong Zhang, Wei Zou, Ming Li, Dah-Jye Lee:
Typical Facial Expression Network Using a Facial Feature Decoupler and Spatial-Temporal Learning. IEEE Trans. Affect. Comput. 14(2): 1125-1137 (2023)
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/ChengZXPLLYZXHWYZLLZTDZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ChengZXPLLYZXHWYZLLZTDZL23
Ming Cheng, Yingying Zhang, Yixiang Xie, Yueran Pan, Xiao Li, Wenxing Liu, Chengyan Yu, Dong Zhang, Yu Xing, Xiaoqian Huang, Fang Wang, Cong You, Yuanyuan Zou, Yuchong Liu, Fengjing Liang, Huilin Zhu, Chun Tang, Hongzhu Deng, Xiaobing Zou, Ming Li:
Computer-Aided Autism Spectrum Disorder Diagnosis With Behavior Signal Processing. IEEE Trans. Affect. Comput. 14(4): 2982-3000 (2023)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/tamd/ZhuZCLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/ZhuZCLL23
Zhesi Zhu, Dong Zhang, Cailong Chi, Ming Li, Dah-Jye Lee:
A Complementary Dual-Branch Network for Appearance-Based Gaze Estimation From Low-Resolution Facial Image. IEEE Trans. Cogn. Dev. Syst. 15(3): 1323-1334 (2023)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QinCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QinCL23
Xiaoyi Qin, Danwei Cai, Ming Li:
Robust Multi-Channel Far-Field Speaker Verification Under Different In-Domain Data Availability Scenarios. IEEE ACM Trans. Audio Speech Lang. Process. 31: 71-85 (2023)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/Li00L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/Li00L23
Xiao Li, Dong Zhang, Ming Li, Dah-Jye Lee:
Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network. IEEE Trans. Multim. 25: 2239-2251 (2023)
[c127]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZengSWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZengSWL23
Bang Zeng, Hongbin Suo, Yulong Wan, Ming Li:
Low-complexity Multi-Channel Speaker Extraction with Pure Speech Cues. APSIPA ASC 2023: 114-118
[c126]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/dada/QinWCM023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dada/QinWCM023
Xiaoyi Qin, Xingming Wang, Yanli Chen, Qinglin Meng, Ming Li:
From Speaker Verification to Deepfake Algorithm Recognition: Our Learned Lessons from ADD2023 Track 3. DADA@IJCAI 2023: 107-112
[c125]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiCL23
Danwei Cai, Zexin Cai, Ming Li:
Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification Systems. ICASSP 2023: 1-5
[c124]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiWL23
Zexin Cai, Weiqing Wang, Ming Li:
Waveform Boundary Detection for Partially Spoofed Audio. ICASSP 2023: 1-5
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiWLXH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiWLXH23
Danwei Cai, Weiqing Wang, Ming Li, Rui Xia, Chuanzeng Huang:
Pretraining Conformer with ASR for Speaker Verification. ICASSP 2023: 1-5
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChengWWFL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChengWWFL23
Ming Cheng, Haoxu Wang, Ziteng Wang, Qiang Fu, Ming Li:
The WHU-Alibaba Audio-Visual Speaker Diarization System for the MISP 2022 Challenge. ICASSP 2023: 1-2
[c121]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChengWZQL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChengWZQL23
Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li:
Target-Speaker Voice Activity Detection Via Sequence-to-Sequence Prediction. ICASSP 2023: 1-5
[c120]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangCFL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangCFL23
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis. ICASSP 2023: 1-5
[c119]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWDHL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWDHL23
Xingming Wang, Hao Wu, Chen Ding, Chuanzeng Huang, Ming Li:
Exploring Universal Singing Speech Language Identification Using Self-Supervised Learning Based Front-End Features. ICASSP 2023: 1-5
[c118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZengSW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZengSW023
Bang Zeng, Hongbin Suo, Yulong Wan, Ming Li:
SEF-Net: Speaker Embedding Free Target Speaker Extraction Network. INTERSPEECH 2023: 3452-3456
[c117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZSW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZSW023
Xingming Wang, Bang Zeng, Hongbin Suo, Yulong Wan, Ming Li:
Robust Audio Anti-spoofing Countermeasure with Joint Training of Front-end and Back-end Models. INTERSPEECH 2023: 4004-4008
[c116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangSW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangSW023
Yucong Zhang, Hongbin Suo, Yulong Wan, Ming Li:
Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning. INTERSPEECH 2023: 5381-5385
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/prcv/LiuCPYHLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/prcv/LiuCPYHLZ23
Wenxing Liu, Ming Cheng, Yueran Pan, Lynn Yuan, Suxiu Hu, Ming Li, Songtian Zeng:
Assessing the Social Skills of Children with Autism Spectrum Disorder via Language-Image Pre-training Models. PRCV (13) 2023: 260-271
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-02348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-02348
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis. CoRR abs/2303.02348 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04497
Yu Hou, Cong Tran, Ming Li, Won-Yong Shin:
Graph Neural Network-Aided Exploratory Learning for Community Detection with Unknown Topology. CoRR abs/2304.04497 (2023)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07056
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07056
Yuke Lin, Xiaoyi Qin, Ming Cheng, Ning Jiang, Guoqing Zhao, Ming Li:
VoxBlink: X-Large Speaker Verification Dataset on Camera. CoRR abs/2308.07056 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-10281
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-10281
Zexin Cai, Weiqing Wang, Yikang Wang, Ming Li:
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023. CoRR abs/2308.10281 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-05396
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-05396
Haoxu Wang, Fan Yu, Xian Shi, Yuezhang Wang, Shiliang Zhang, Ming Li:
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus. CoRR abs/2309.05396 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07500
Yucong Zhang, Hongbin Suo, Yulong Wan, Ming Li:
Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning. CoRR abs/2309.07500 (2023)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-08696
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-08696
Weiqing Wang, Ming Li:
End-to-end Online Speaker Diarization with Target Speaker Tracking. CoRR abs/2310.08696 (2023)
2022
[j24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/XuWCXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/XuWCXL22
Yanze Xu, Weiqing Wang, Huahua Cui, Mingyang Xu, Ming Li:
Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy. EURASIP J. Audio Speech Music. Process. 2022(1): 8 (2022)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/CaiWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/CaiWL22
Danwei Cai, Weiqing Wang, Ming Li:
Incorporating Visual Information in Audio Based Self-Supervised Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1422-1435 (2022)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangLCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangLCL22
Weiqing Wang, Qingjian Lin, Danwei Cai, Ming Li:
Similarity Measurement of Segment-Level Speaker Embeddings in Speaker Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2645-2658 (2022)
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCQL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCQL22
Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li:
SIG-VC: A Speaker Information Guided Zero-Shot Voice Conversion System for Both Human Beings and Machines. ICASSP 2022: 6567-65571
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QinLWSL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QinLWSL22
Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Simple Attention Module Based Speaker Verification with Iterative Noisy Label Detection. ICASSP 2022: 6722-6726
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiYWQWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiYWQWL22
Qingjian Li, Lin Yang, Xuyang Wang, Xiaoyi Qin, Junjie Wang, Ming Li:
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification. ICASSP 2022: 7067-7071
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangL22b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangL22b
Weiqing Wang, Ming Li:
Incorporating End-to-End Framework Into Target-Speaker Voice Activity Detection. ICASSP 2022: 8362-8366
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQL22
Weiqing Wang, Xiaoyi Qin, Ming Li:
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for the M2met Challenge. ICASSP 2022: 9171-9175
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChengWWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChengWWL22
Ming Cheng, Haoxu Wang, Yechen Wang, Ming Li:
The DKU Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge. ICASSP 2022: 9256-9260
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/PanWJZGZYL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/PanWJZGZYL22
Yueran Pan, Jiaxin Wu, Ran Ju, Ziang Zhou, Jiayue Gu, Songtian Zeng, Lynn Yuan, Ming Li:
A Multimodal Framework for Automated Teaching Quality Assessment of One-to-many Online Instruction Videos. ICPR 2022: 1777-1783
[c107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Qin0W0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Qin0W0022
Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings. INTERSPEECH 2022: 1436-1440
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLL22
Weiqing Wang, Ming Li, Qingjian Lin:
Online Target Speaker Voice Activity Detection for Speaker Diarization. INTERSPEECH 2022: 1441-1445
[c105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQWXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQWXL22
Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li:
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge. INTERSPEECH 2022: 4396-4400
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangWNL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangWNL22
Yikang Wang, Xingming Wang, Hiromitsu Nishizaki, Ming Li:
Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities. ISCSLP 2022: 438-442
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLWLXWLZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLWLXWLZ22
Yuxiang Zhang, Jingze Lu, Xingming Wang, Zhuo Li, Runqiu Xiao, Wenchao Wang, Ming Li, Pengyuan Zhang:
Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion. DDAM@MM 2022: 43-52
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuaCZLZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuaCZLZ22
Hua Hua, Ziyi Chen, Yuxiang Zhang, Ming Li, Pengyuan Zhang:
Improving Spoofing Capability for End-to-end Any-to-many Voice Conversion. DDAM@MM 2022: 93-100
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ZhangLWYWWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ZhangLWYWWL22
Yucong Zhang, Qingjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li:
Low-Latency Online Speaker Diarization with Graph-Based Label Generation. Odyssey 2022: 162-169
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/HeBXLLWXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/HeBXLLWXL22
Jincheng He, Yuanyuan Bao, Na Xu, Hongfeng Li, Shicong Li, Linzhang Wang, Fei Xiang, Ming Li:
Single-Channel Target Speaker Separation Using Joint Training with Target Speaker's Pitch Information. Odyssey 2022: 301-305
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/WangJZWWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/WangJZWWL22
Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li:
Generating TTS Based Adversarial Samples for Training Wake-Up Word Detection Systems Against Confusing Words. Odyssey 2022: 402-406
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-00167
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-00167
Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li:
Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words. CoRR abs/2201.00167 (2022)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10687
Zexin Cai, Ming Li:
Invertible Voice Conversion. CoRR abs/2201.10687 (2022)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-02687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-02687
Weiqing Wang, Xiaoyi Qin, Ming Li:
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge. CoRR abs/2202.02687 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08525
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08525
Bang Zeng, Weiqing Wang, Yuanyuan Bao, Ming Li:
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1). CoRR abs/2206.08525 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09103
Danwei Cai, Zexin Cai, Ming Li:
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems. CoRR abs/2206.09103 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-05920
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-05920
Weiqing Wang, Qingjian Lin, Ming Li:
Online Target Speaker Voice Activity Detection for Speaker Diarization. CoRR abs/2207.05920 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-05929
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-05929
Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings. CoRR abs/2207.05929 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07510
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07510
Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li:
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge. CoRR abs/2207.07510 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05092
Xiaoyi Qin, Na Li, Yuke Lin, Yiwei Ding, Chao Weng, Dan Su, Ming Li:
The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2210.05092 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16127
Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li:
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction. CoRR abs/2210.16127 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00226
Zexin Cai, Weiqing Wang, Ming Li:
Waveform Boundary Detection for Partially Spoofed Audio. CoRR abs/2211.00226 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06546
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06546
Yikang Wang, Xingming Wang, Hiromitsu Nishizaki, Ming Li:
Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities. CoRR abs/2211.06546 (2022)
2021
[j21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ficn/LiuLZR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ficn/LiuLZR21
Wenbo Liu, Ming Li, Xiaobing Zou, Bhiksha Raj:
Discriminative Dictionary Learning for Autism Spectrum Disorder Identification. Frontiers Comput. Neurosci. 15: 662401 (2021)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/LiXHSLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/LiXHSLL21
Ming Li, Hao Xu, Xingchang Huang, Zhanmei Song, Xiaolin Liu, Xin Li:
Facial Expression Recognition with Identity and Emotion Joint Learning. IEEE Trans. Affect. Comput. 12(2): 544-550 (2021)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangPYSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangPYSL21
Weiqing Wang, Jin Pan, Hua Yi, Zhanmei Song, Ming Li:
Audio-Based Piano Performance Evaluation for Beginners With Convolutional Neural Network and Attention Mechanism. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1119-1133 (2021)
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiWL21
Danwei Cai, Weiqing Wang, Ming Li:
An Iterative Framework for Self-Supervised Deep Speaker Representation Learning. ICASSP 2021: 6728-6732
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/ChuWJJWLD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/ChuWJJWLD21
Huangrui Chu, Yechen Wang, Ran Ju, Yan Jia, Haoxu Wang, Ming Li, Qi Deng:
Call For Help Detection In Emergent Situations Using Keyword Spotting And Paralinguistic Analysis. ICMI Companion 2021: 104-111
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/JuCWDCL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/JuCWDCL21
Ran Ju, Huangrui Chu, Yechen Wang, Qi Deng, Ming Cheng, Ming Li:
A Multimodal Dynamic Neural Network for Call for Help Recognition in Elevators. ICMI Companion 2021: 112-120
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/ChenGCDL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/ChenGCDL21
Xinmeng Chen, Xuchen Gong, Ming Cheng, Qi Deng, Ming Li:
Cross-modal Assisted Training for Abnormal Event Recognition in Elevators. ICMI 2021: 530-538
[c94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuQL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuQL21
Tinglong Zhu, Xiaoyi Qin, Ming Li:
Binary Neural Network for Speaker Verification. Interspeech 2021: 86-90
[c93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCWLWHL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCWLWHL21
Weiqing Wang, Danwei Cai, Jin Wang, Qingjian Lin, Xuyang Wang, Mi Hong, Ming Li:
The DKU-Duke-Lenovo System Description for the Fearless Steps Challenge Phase III. Interspeech 2021: 1044-1048
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinWMLZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinWMLZL21
Xiaoyi Qin, Chao Wang, Yong Ma, Min Liu, Shilei Zhang, Ming Li:
Our Learned Lessons from Cross-Lingual Speaker Verification: The CRMI-DKU System Description for the Short-Duration Speaker Verification Challenge 2021. Interspeech 2021: 2317-2321
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiBXZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiBXZL21
Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li:
AISHELL-3: A Multi-Speaker Mandarin TTS Corpus. Interspeech 2021: 2756-2760
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaWQZWWZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaWQZWWZL21
Yan Jia, Xingming Wang, Xiaoyi Qin, Yinping Zhang, Xuyang Wang, Junjie Wang, Dong Zhang, Ming Li:
The 2020 Personalized Voice Trigger Challenge: Open Datasets, Evaluation Metrics, Baseline System and Results. Interspeech 2021: 4239-4243
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiCHL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiCHL21
Tingle Li, Jiawei Chen, Haowen Hou, Ming Li:
Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. ISCSLP 2021: 1-5
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/MaWWYWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/MaWWYWL21
Murong Ma, Haiwei Wu, Xuyang Wang, Lin Yang, Junjie Wang, Ming Li:
Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. ISCSLP 2021: 1-5
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/CaiL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/CaiL21
Danwei Cai, Ming Li:
Embedding Aggregation for Far-Field Speaker Verification with Distributed Microphone Arrays. SLT 2021: 308-315
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-03649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-03649
Weiqing Wang, Qingjian Lin, Danwei Cai, Lin Yang, Ming Li:
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge. CoRR abs/2102.03649 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02306
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02306
Tinglong Zhu, Xiaoyi Qin, Ming Li:
Binary Neural Network for Speaker Verification. CoRR abs/2104.02306 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06004
Ziang Zhou, Yanze Xu, Shilei Zhang, Ming Li:
Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion. CoRR abs/2104.06004 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-10832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-10832
Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li:
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss. CoRR abs/2104.10832 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02934
Yuanyuan Bao, Yanze Xu, Na Xu, Wenjing Yang, Hongfeng Li, Shicong Li, Yongtao Jia, Fei Xiang, Jincheng He, Ming Li:
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication. CoRR abs/2106.02934 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-02002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-02002
Weiqing Wang, Danwei Cai, Qingjian Lin, Lin Yang, Junjie Wang, Jin Wang, Ming Li:
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge. CoRR abs/2109.02002 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-02853
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-02853
Danwei Cai, Ming Li:
The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge. CoRR abs/2109.02853 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04438
Qingjian Lin, Lin Yang, Xuyang Wang, Xiaoyi Qin, Junjie Wang, Ming Li:
Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification. CoRR abs/2110.04438 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06534
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06534
Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li:
Simple Attention Module based Speaker Verification with Iterative noisy label detection. CoRR abs/2110.06534 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-03811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-03811
Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li:
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines. CoRR abs/2111.03811 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-13803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-13803
Yucong Zhang, Qingjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li:
Online Speaker Diarization with Graph-based Label Generation. CoRR abs/2111.13803 (2021)
2020
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/CaiCZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/CaiCZL20
Weicheng Cai, Jinkun Chen, Jun Zhang, Ming Li:
On-the-Fly Data Loader and Utterance-Level Aggregation for Speaker and Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1038-1051 (2020)
[c86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/Cai020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/Cai020
Zexin Cai, Ming Li:
The Duke Entry for 2020 Blizzard Challenge. Blizzard Challenge / Voice Conversion Challenge 2020
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiCL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiCL20
Danwei Cai, Weicheng Cai, Ming Li:
Within-Sample Variability-Invariant Loss for Robust Speaker Recognition Under Noisy Environments. ICASSP 2020: 6469-6473
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QinBL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QinBL20
Xiaoyi Qin, Hui Bu, Ming Li:
HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. ICASSP 2020: 7609-7613
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/PanCCZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/PanCCZL20
Yueran Pan, Kunjing Cai, Ming Cheng, Xiaobing Zou, Ming Li:
Responsive Social Smile: A Machine Learning based Multimodal Behavior Assessment Framework towards Early Stage Autism Screening. ICPR 2020: 2240-2247
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/ChengCL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/ChengCL20
Ming Cheng, Kunjing Cai, Ming Li:
RWF-2000: An Open Large Scale Video Database for Violence Detection. ICPR 2020: 4183-4190
[c81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinHL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinHL20
Qingjian Lin, Yu Hou, Ming Li:
Self-Attentive Similarity Measurement Strategies in Speaker Diarization. INTERSPEECH 2020: 284-288
[c80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLBL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLBL20
Tingle Li, Qingjian Lin, Yuanyuan Bao, Ming Li:
Atss-Net: Target Speaker Separation via Attention-Based Neural Network. INTERSPEECH 2020: 1411-1415
[c79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinLL20
Qingjian Lin, Tingle Li, Ming Li:
The DKU Speech Activity Detection and Speaker Identification Systems for Fearless Steps Challenge Phase-02. INTERSPEECH 2020: 2607-2611
[c78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinLBRDN020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinLBRDN020
Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. INTERSPEECH 2020: 3456-3460
[c77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiZL20
Zexin Cai, Chuxiong Zhang, Ming Li:
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint. INTERSPEECH 2020: 3974-3978
[c76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LinCYWZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LinCYWZL20
Qingjian Lin, Weicheng Cai, Lin Yang, Junjie Wang, Jun Zhang, Ming Li:
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team. Odyssey 2020: 102-109
[c75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LinLYWL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LinLYWL20
Qingjian Lin, Tingle Li, Lin Yang, Junjie Wang, Ming Li:
Optimal Mapping Loss: A Faster Loss for End-to-End Speaker Diarization. Odyssey 2020: 125-131
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00387
Xiaoyi Qin, Ming Li, Hui Bu, Rohan Kumar Das, Wei Rao, Shrikanth Narayanan, Haizhou Li:
The FFSVC 2020 Evaluation Plan. CoRR abs/2002.00387 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00924
Danwei Cai, Weicheng Cai, Ming Li:
Within-sample variability-invariant loss for robust speaker recognition under noisy environments. CoRR abs/2002.00924 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-12761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-12761
Qingjian Lin, Weicheng Cai, Lin Yang, Junjie Wang, Jun Zhang, Ming Li:
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team. CoRR abs/2002.12761 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-03633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-03633
Haiwei Wu, Yan Jia, Yuanfei Nie, Ming Li:
Mutli-task Learning with Alignment Loss for Far-field Small-Footprint Keyword Spotting. CoRR abs/2005.03633 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04587
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04587
Zexin Cai, Chuxiong Zhang, Ming Li:
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint. CoRR abs/2005.04587 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08046
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08046
Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. CoRR abs/2005.08046 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09200
Tingle Li, Qingjian Lin, Yuanyuan Bao, Ming Li:
Atss-Net: Target Speaker Separation via Attention-based Neural Network. CoRR abs/2005.09200 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-10441
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-10441
Zexin Cai, Yaogen Yang, Ming Li:
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario. CoRR abs/2005.10441 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-11777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-11777
Murong Ma, Haiwei Wu, Xuyang Wang, Lin Yang, Junjie Wang, Ming Li:
Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. CoRR abs/2005.11777 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-05175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-05175
Haiwei Wu, Lin Zhang, Lin Yang, Xuyang Wang, Junjie Wang, Dong Zhang, Ming Li:
Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling. CoRR abs/2008.05175 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11567
Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li:
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines. CoRR abs/2010.11567 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14751
Danwei Cai, Weiqing Wang, Ming Li:
An iterative framework for self-supervised deep speaker representation learning. CoRR abs/2010.14751 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01460
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01460
Yan Jia, Zexin Cai, Murong Ma, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li:
Training Wake Word Detection with Synthesized Speech Data on Confusion Words. CoRR abs/2011.01460 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-10710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-10710
Xiaoyi Qin, Yaogen Yang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li:
Exploring Voice Conversion based Data Augmentation in Text-Dependent Speaker Verification. CoRR abs/2011.10710 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LiTZZZCZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LiTZZZCZ19
Ming Li, Dengke Tang, Junlin Zeng, Tianyan Zhou, Huilin Zhu, Biyuan Chen, Xiaobing Zou:
An automated assessment framework for atypical prosody and stereotyped idiosyncratic phrases related to autism spectrum disorder. Comput. Speech Lang. 56: 80-94 (2019)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/tvt/LiHLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvt/LiHLL19
Zhicheng Li, Bin Hu, Ming Li, Gengnan Luo:
String Stability Analysis for Vehicle Platooning Under Unreliable Communication Links With Event-Triggered Strategy. IEEE Trans. Veh. Technol. 68(3): 2152-2164 (2019)
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/acii/TengZLH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acii/TengZLH19
Jianing Teng, Dong Zhang, Ming Li, Yudong Huang:
Facial Expression Recognition with Identity and Spatial-temporal Integrated Learning. ACII Workshops 2019: 100-104
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangW019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangW019
Weiqing Wang, Haiwei Wu, Ming Li:
Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection. APSIPA 2019: 1323-1327
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuCLGZLH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuCLGZLH19
Haiwei Wu, Weicheng Cai, Ming Li, Ji Gao, Shanshan Zhang, Zhiqiang Lyu, Shen Huang:
DKU-Tencent Submission to Oriental Language Recognition AP18-OLR Challenge. APSIPA 2019: 1646-1651
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/CaiZY019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/CaiZY019
Zexin Cai, Chuxiong Zhang, Yaogen Yang, Ming Li:
The DKU Speech Synthesis System for 2019 Blizzard Challenge. Blizzard Challenge 2019
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiCHL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiCHL19
Weicheng Cai, Danwei Cai, Shen Huang, Ming Li:
Utterance-level End-to-end Language Identification Using Attention-based CNN-BLSTM. ICASSP 2019: 5991-5995
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiXL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiXL19
Zexin Cai, Zhicheng Xu, Ming Li:
F0 Contour Estimation Using Phonetic Feature in Electrolaryngeal Speech Enhancement. ICASSP 2019: 6490-6494
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/icira/SunLLZL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icira/SunLLZL19
Sheng Sun, Shuangmei Li, Wenbo Liu, Xiaobing Zou, Ming Li:
Fixation Based Object Recognition in Autism Clinic Setting. ICIRA (4) 2019: 615-628
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinYLBB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinYLBB19
Qingjian Lin, Ruiqing Yin, Ming Li, Hervé Bredin, Claude Barras:
LSTM Based Similarity Measurement with Spectral Clustering for Speaker Diarization. INTERSPEECH 2019: 366-370
[c66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiWC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiWC019
Weicheng Cai, Haiwei Wu, Danwei Cai, Ming Li:
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion. INTERSPEECH 2019: 1023-1027
[c65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiYZQL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiYZQL19
Zexin Cai, Yaogen Yang, Chuxiong Zhang, Xiaoyi Qin, Ming Li:
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-Level Embedding Features. INTERSPEECH 2019: 2110-2114
[c64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuWL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuWL19
Haiwei Wu, Weiqing Wang, Ming Li:
The DKU-LENOVO Systems for the INTERSPEECH 2019 Computational Paralinguistic Challenge. INTERSPEECH 2019: 2433-2437
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiQCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiQCL19
Danwei Cai, Xiaoyi Qin, Weicheng Cai, Ming Li:
The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge. INTERSPEECH 2019: 2493-2497
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiCL19
Danwei Cai, Weicheng Cai, Ming Li:
The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation. INTERSPEECH 2019: 4370-4374
[c61]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/LiCC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiCC19
Ming Li, Weicheng Cai, Danwei Cai:
Survey Talk: End-to-End Deep Neural Network Based Speaker and Language Recognition. INTERSPEECH 2019
[c60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinCL19
Xiaoyi Qin, Danwei Cai, Ming Li:
Far-Field End-to-End Text-Dependent Speaker Verification Based on Mixed Training Data with Transfer Learning and Enrollment Data Augmentation. INTERSPEECH 2019: 4045-4049
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiQL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiQL19
Danwei Cai, Xiaoyi Qin, Ming Li:
Multi-Channel Training for End-to-End Speaker Recognition Under Reverberant and Noisy Environment. INTERSPEECH 2019: 4365-4369
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-07374
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-07374
Weicheng Cai, Danwei Cai, Shen Huang, Ming Li:
Utterance-level end-to-end language identification using attention-based CNN-BLSTM. CoRR abs/1902.07374 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-01749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-01749
Zexin Cai, Yaogen Yang, Chuxiong Zhang, Xiaoyi Qin, Ming Li:
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features. CoRR abs/1907.01749 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-02663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-02663
Weicheng Cai, Haiwei Wu, Danwei Cai, Ming Li:
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion. CoRR abs/1907.02663 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-10393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-10393
Qingjian Lin, Ruiqing Yin, Ming Li, Hervé Bredin, Claude Barras:
LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization. CoRR abs/1907.10393 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-05913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-05913
Ming Cheng, Kunjing Cai, Ming Li:
RWF-2000: An Open Large Scale Video Database for Violence Detection. CoRR abs/1911.05913 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-01231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-01231
Xiaoyi Qin, Hui Bu, Ming Li:
HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines. CoRR abs/1912.01231 (2019)
2018
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/CheeJCLYLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/CheeJCLYLG18
Kong-Yik Chee, Zhe Jin, Danwei Cai, Ming Li, Wun-She Yap, Yen-Lung Lai, Bok-Min Goi:
Cancellable speech template via random binary orthogonal matrices projection hashing. Pattern Recognit. 76: 273-287 (2018)
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/CaiCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/CaiCL18
Danwei Cai, Zexin Cai, Ming Li:
Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition. APSIPA 2018: 1478-1482
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiCZWL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiCZWL18
Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li:
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification. ICASSP 2018: 5189-5193
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiCLWL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiCLWL18
Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li:
Insights in-to-End Learning Scheme for Language Identification. ICASSP 2018: 5209-5213
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiCL18
Weicheng Cai, Jinkun Chen, Ming Li:
Analysis of Length Normalization in End-to-End Speaker Verification System. INTERSPEECH 2018: 3618-3622
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WuLCZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WuLCZ18
Haiwei Wu, Ming Li, Zexin Cai, Haibin Zhong:
Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. ISCSLP 2018: 1-5
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/CaiQCLLZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/CaiQCLLZ18
Zexin Cai, Xiaoyi Qin, Danwei Cai, Ming Li, Xinzhong Liu, Haibin Zhong:
The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion. ISCSLP 2018: 235-239
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChenCCCZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChenCCCZL18
Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:
End-to-end Language Identification using NetFV and NetVLAD. ISCSLP 2018: 319-323
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/CaiCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/CaiCL18
Weicheng Cai, Jinkun Chen, Ming Li:
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System. Odyssey 2018: 74-81
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00381
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00381
Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li:
Insights into End-to-End Learning Scheme for Language Identification. CoRR abs/1804.00381 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00385
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00385
Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li:
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification. CoRR abs/1804.00385 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-05160
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-05160
Weicheng Cai, Jinkun Chen, Ming Li:
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System. CoRR abs/1804.05160 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-03209
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-03209
Weicheng Cai, Jinkun Chen, Ming Li:
Analysis of Length Normalization in End-to-End Speaker Verification System. CoRR abs/1806.03209 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-02906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-02906
Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:
End-to-end Language Identification using NetFV and NetVLAD. CoRR abs/1809.02906 (2018)
2017
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/tsg/XuYGLD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsg/XuYGLD17
Yinliang Xu, Zaiyue Yang, Wei Gu, Ming Li, Zicong Deng:
Robust Real-Time Distributed Optimal Control Based Energy Management in a Smart Grid. IEEE Trans. Smart Grid 8(4): 1568-1579 (2017)
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/acii/LiuZZZL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acii/LiuZZZL17
Wenbo Liu, Tianyan Zhou, Chenghao Zhang, Xiaobing Zou, Ming Li:
Response to name: A dataset and a multimodal machine learning framework towards autism study. ACII 2017: 178-183
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/acii/ChenLL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acii/ChenLL17
Jinkun Chen, Cong Liu, Ming Li:
Automatic emotional spoken language text corpus construction from written dialogs in fictions. ACII 2017: 319-324
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LiWXC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiWXC17
Ming Li, Luting Wang, Zhicheng Xu, Danwei Cai:
Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization. APSIPA 2017: 1360-1363
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiuWYLRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiuWYLRS17
Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song:
SphereFace: Deep Hypersphere Embedding for Face Recognition. CVPR 2017: 6738-6746
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiCLLL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiCLLL17
Weicheng Cai, Danwei Cai, Wenbo Liu, Gang Li, Ming Li:
Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion. INTERSPEECH 2017: 17-21
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiNLCLL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiNLCLL17
Danwei Cai, Zhidong Ni, Wenbo Liu, Weicheng Cai, Gang Li, Ming Li:
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum. INTERSPEECH 2017: 3452-3456
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LiuWYLRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LiuWYLRS17
Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song:
SphereFace: Deep Hypersphere Embedding for Face Recognition. CoRR abs/1704.08063 (2017)
2016
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LiKLGRN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LiKLGRN16
Ming Li, Jangwon Kim, Adam C. Lammert, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Speaker verification based on the fusion of speech acoustics and inverted articulatory signals. Comput. Speech Lang. 36: 196-211 (2016)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/LiLCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/LiLCL16
Ming Li, Lun Liu, Weicheng Cai, Wenbo Liu:
Generalized I-vector Representation with Phonetic Tokenizations and Tandem Features for both Text Independent and Text Dependent Speaker Verification. J. Signal Process. Syst. 82(2): 207-215 (2016)
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YuLLYLK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YuLLYLK16
Zhiding Yu, Weiyang Liu, Wenbo Liu, Yingzhen Yang, Ming Li, B. V. K. Vijaya Kumar:
On Order-Constrained Transitive Distance Clustering. AAAI 2016: 2293-2299
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/CaiCNL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/CaiCNL16
Danwei Cai, Weicheng Cai, Zhidong Ni, Ming Li:
Locality sensitive discriminant analysis for speaker verification. APSIPA 2016: 1-5
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/ccpr/HeCLL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ccpr/HeCLL16
Gaoyuan He, Jinkun Chen, Xuebo Liu, Ming Li:
The SYSU System for CCPR 2016 Multimodal Emotion Recognition Challenge. CCPR (2) 2016: 707-720
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/ZhengCZZL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/ZhengCZZL16
Huadi Zheng, Weicheng Cai, Tianyan Zhou, Shilei Zhang, Ming Li:
Text-independent voice conversion using deep neural network based phonetic level features. ICPR 2016: 2872-2877
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhouCCZZL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhouCCZZL16
Tianyan Zhou, Weicheng Cai, Xiaoyan Chen, Xiaobing Zou, Shilei Zhang, Ming Li:
Speaker diarization system for autism children's real-life audio data. ISCSLP 2016: 1-5
2015
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/Kim0TLN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/Kim0TLN15
Jangwon Kim, Naveen Kumar, Andreas Tsiartas, Ming Li, Shrikanth S. Narayanan:
Automatic intelligibility classification of sentence-level pathological speech. Comput. Speech Lang. 29(1): 132-144 (2015)
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/acii/LiuYYZRL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acii/LiuYYZRL15
Wenbo Liu, Li Yi, Zhiding Yu, Xiaobing Zou, Bhiksha Raj, Ming Li:
Efficient autism spectrum disorder prediction with eye movement: A machine learning framework. ACII 2015: 649-655
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WengCYWCLZL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WengCYWCLZL15
Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Yiming Zhou, Ming Li:
The SYSU system for the interspeech 2015 automatic speaker verification spoofing and countermeasures challenge. APSIPA 2015: 152-155
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiLLH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiLLH15
Weicheng Cai, Ming Li, Lin Li, Qingyang Hong:
Duration dependent covariance regularization in PLDA modeling for speaker verification. INTERSPEECH 2015: 1027-1031
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HongLLHWZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HongLLHWZ15
Qingyang Hong, Lin Li, Ming Li, Ling Huang, Lihong Wan, Jun Zhang:
Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system. INTERSPEECH 2015: 1037-1041
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZLLK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZLLK15
Yingxue Wang, Shenghui Zhao, Wenbo Liu, Ming Li, Jingming Kuang:
Speech bandwidth expansion based on deep neural networks. INTERSPEECH 2015: 2593-2597
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuYRL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuYRL15
Wenbo Liu, Zhiding Yu, Bhiksha Raj, Ming Li:
Locality constrained transitive distance clustering on speech data. INTERSPEECH 2015: 2917-2921
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/WengCYWCLL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WengCYWCLL15
Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Ming Li:
The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge. CoRR abs/1507.06711 (2015)
2014
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/BoneLBN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/BoneLBN14
Daniel Bone, Ming Li, Matthew P. Black, Shrikanth S. Narayanan:
Intoxicated speech detection: A fusion framework with speaker-normalized hierarchical functionals and GMM supervectors. Comput. Speech Lang. 28(2): 375-391 (2014)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LiN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LiN14
Ming Li, Shrikanth S. Narayanan:
Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification. Comput. Speech Lang. 28(4): 940-958 (2014)
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiL14
Ming Li, Xin Li:
Verification based ECG biometrics with cardiac irregular conditions using heartbeat level and segment level information fusion. ICASSP 2014: 3769-3773
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShivakumarLDN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShivakumarLDN14
Prashanth Gurunath Shivakumar, Ming Li, Vedant Dhandhania, Shrikanth S. Narayanan:
Simplified and supervised i-vector modeling for speaker age regression. ICASSP 2014: 4833-4837
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/iih-msp/Song0014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iih-msp/Song0014
Liming Song, Ming Li, Yonghong Yan:
Melody Extraction for Vocal Polyphonic Music Based on Bayesian Framework. IIH-MSP 2014: 570-573
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiL14
Ming Li, Wenbo Liu:
Speaker verification and spoken language identification using a generalized i-vector framework with phonetic tokenizations and tandem features. INTERSPEECH 2014: 1120-1124
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiuYL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiuYL14
Wenbo Liu, Zhiding Yu, Ming Li:
An iterative framework for unsupervised learning in the PLDA based speaker verification. ISCSLP 2014: 78-82
2013
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LiHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LiHN13
Ming Li, Kyu Jeong Han, Shrikanth S. Narayanan:
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput. Speech Lang. 27(1): 151-167 (2013)
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/cis/SongLY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cis/SongLY13
Liming Song, Ming Li, Yonghong Yan:
Automatic Vocal Segments Detection in Popular Music. CIS 2013: 349-352
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiTSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiTSN13
Ming Li, Andreas Tsiartas, Maarten Van Segbroeck, Shrikanth S. Narayanan:
Speaker verification using simplified and supervised i-vector modeling. ICASSP 2013: 7199-7203
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoneCAGTSLLN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoneCAGTSLLN13
Daniel Bone, Theodora Chaspari, Kartik Audhkhasi, James Gibson, Andreas Tsiartas, Maarten Van Segbroeck, Ming Li, Sungbok Lee, Shrikanth S. Narayanan:
Classifying language-related developmental disorders from speech cues: the promise and the potential confounds. INTERSPEECH 2013: 182-186
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsiartasCKGLSPN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsiartasCKGLSPN13
Andreas Tsiartas, Theodora Chaspari, Nassos Katsamanis, Prasanta Kumar Ghosh, Ming Li, Maarten Van Segbroeck, Alexandros Potamianos, Shrikanth S. Narayanan:
Multi-band long-term signal variability features for robust voice activity detection. INTERSPEECH 2013: 718-722
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanGLON13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanGLON13
Kyu Jeong Han, Sriram Ganapathy, Ming Li, Mohamed Kamal Omar, Shrikanth S. Narayanan:
TRAP language identification system for RATS phase II evaluation. INTERSPEECH 2013: 1502-1506
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiKGRN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiKGRN13
Ming Li, Jangwon Kim, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Speaker verification based on fusion of acoustic and articulatory information. INTERSPEECH 2013: 1614-1618
2012
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/cm/MitraELLRTVZANLSS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cm/MitraELLRTVZANLSS12
Urbashi Mitra, B. Adar Emken, Sangwon Lee, Ming Li, Viktor Rozgic, Gautam Thatte, Harshvardhan Vathsangam, Daphney-Stavroula Zois, Murali Annavaram, Shrikanth S. Narayanan, Marco Levorato, Donna Spruijt-Metz, Gaurav S. Sukhatme:
KNOWME: a case study in wireless body area sensor network design. IEEE Commun. Mag. 50(5): 116-125 (2012)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tecs/ThatteLLENMSA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tecs/ThatteLLENMSA12
Gautam Thatte, Ming Li, Sangwon Lee, B. Adar Emken, Shrikanth S. Narayanan, Urbashi Mitra, Donna Spruijt-Metz, Murali Annavaram:
KNOWME: An Energy-Efficient Multimodal Body Area Network for Physical Activity Monitoring. ACM Trans. Embed. Comput. Syst. 11(S2): 48:1-48:24 (2012)
[c22]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/LiLWN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiLWN12
Ming Li, Charley Lu, Anne Wang, Shrikanth S. Narayanan:
Speaker verification using Lasso based sparse total variability supervector with PLDA modeling. APSIPA 2012: 1-4
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMBN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMBN12
Ming Li, Angeliki Metallinou, Daniel Bone, Shrikanth S. Narayanan:
Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling. ICASSP 2012: 1937-1940
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiMLN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiMLN12
Kartik Audhkhasi, Angeliki Metallinou, Ming Li, Shrikanth S. Narayanan:
Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network. INTERSPEECH 2012: 262-265
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKTLN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKTLN12
Jangwon Kim, Naveen Kumar, Andreas Tsiartas, Ming Li, Shrikanth S. Narayanan:
Intelligibility classification of pathological speech using fusion of multiple high level descriptors. INTERSPEECH 2012: 534-537
2011
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tsp/ThatteLLEANSM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsp/ThatteLLEANSM11
Gautam Thatte, Ming Li, Sangwon Lee, B. Adar Emken, Murali Annavaram, Shrikanth S. Narayanan, Donna Spruijt-Metz, Urbashi Mitra:
Optimal Time-Resource Allocation for Energy-Efficient Physical Activity Detection. IEEE Trans. Signal Process. 59(4): 1843-1857 (2011)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/Kim0LMESAN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/Kim0LMESAN11
Samuel Kim, Ming Li, Sangwon Lee, Urbashi Mitra, B. Adar Emken, Donna Spruijt-Metz, Murali Annavaram, Shrikanth S. Narayanan:
Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals. EMBC 2011: 6033-6036
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiN11
Ming Li, Shrikanth S. Narayanan:
Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors. ICASSP 2011: 1481-1484
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZYN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZYN11
Ming Li, Xiang Zhang, Yonghong Yan, Shrikanth S. Narayanan:
Speaker Verification Using Sparse Representations on Total Variability i-vectors. INTERSPEECH 2011: 2729-2732
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoneBLMLN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoneBLMLN11
Daniel Bone, Matthew Black, Ming Li, Angeliki Metallinou, Sungbok Lee, Shrikanth S. Narayanan:
Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors. INTERSPEECH 2011: 3217-3220
2010
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/LiN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/LiN10
Ming Li, Shrikanth S. Narayanan:
Robust ECG Biometrics by Fusing Temporal and Cepstral Information. ICPR 2010: 1326-1329
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiJH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiJH10
Ming Li, Chi-Sang Jung, Kyu Jeong Han:
Combining five acoustic level modeling methods for automatic speaker age and gender recognition. INTERSPEECH 2010: 2826-2829

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/CaoLWSLY09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/CaoLWSLY09
Chuan Cao, Ming Li, Xiao Wu, Hongbin Suo, Jian Liu, Yonghong Yan:
Automatic Singing Performance Evaluation for Untrained Singers. IEICE Trans. Inf. Syst. 92-D(8): 1596-1600 (2009)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/bodynets/ThatteR0GMNAS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bodynets/ThatteR0GMNAS09
Gautam Thatte, Viktor Rozgic, Ming Li, Sabyasachi Ghosh, Urbashi Mitra, Shrikanth S. Narayanan, Murali Annavaram, Donna Spruijt-Metz:
Optimal time-resource allocation for activity-detection via multimodal sensing. BODYNETS 2009: 14
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/dcoss/ThatteRLGMNAS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcoss/ThatteRLGMNAS09
Gautam Thatte, Viktor Rozgic, Ming Li, Sabyasachi Ghosh, Urbashi Mitra, Shrikanth S. Narayanan, Murali Annavaram, Donna Spruijt-Metz:
Optimal Allocation of Time-Resources for Multihypothesis Activity-Level Detection. DCOSS 2009: 273-286
2008
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SuoLLY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SuoLLY08
Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan:
Using SVM as Back-End Classifier for Language Identification. EURASIP J. Audio Speech Music. Process. 2008 (2008)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/SuoLLY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/SuoLLY08
Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan:
Automatic Language Identification with Discriminative Language Characterization Based on SVM. IEICE Trans. Inf. Syst. 91-D(3): 567-575 (2008)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/WuLSY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/WuLSY08
Xiao Wu, Ming Li, Hongbin Suo, Yonghong Yan:
Melody Track Selection Using Discriminative Language Model. IEICE Trans. Inf. Syst. 91-D(6): 1838-1840 (2008)
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiCWLFY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiCWLFY08
Ming Li, Chuan Cao, Di Wang, Ping Lu, Qiang Fu, Yonghong Yan:
Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping. INTERSPEECH 2008: 151-154
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaoLLY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaoLLY08
Chuan Cao, Ming Li, Jian Liu, Yonghong Yan:
An objective singing evaluation approach by relating acoustic measurements to perceptual ratings. INTERSPEECH 2008: 2058-2061
2007
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icnc/SuoLLL007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icnc/SuoLLL007
Hongbin Suo, Ming Li, Tantan Liu, Ping Lu, Yonghong Yan:
The Design of Backend Classifiers in PPRLM System for Language Identification. ICNC (1) 2007: 678-682
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/iih-msp/LiLZLY07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iih-msp/LiLZLY07
Ming Li, Yun Lei, Xiang Zhang, Jian Liu, Yonghong Yan:
Authentication and Quality Monitoring based on Audio Watermark for Analog AM Shortwave Broadcasting. IIH-MSP 2007: 263-266
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSWLY07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSWLY07
Ming Li, Hongbin Suo, Xiao Wu, Ping Lu, Yonghong Yan:
Spoken language identification using score vector modeling and support vector machine. INTERSPEECH 2007: 350-353
[c5]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/CaoLLY07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/CaoLLY07
Chuan Cao, Ming Li, Jian Liu, Yonghong Yan:
Singing Melody Extraction in Polyphonic Music by Harmonic Tracking. ISMIR 2007: 373-374
2006
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iih-msp/LiLLY06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iih-msp/LiLLY06
Ming Li, Yun Lei, Jian Liu, Yonghong Yan:
A Novel Audio Watermarking in Wavelet Domain. IIH-MSP 2006: 27-32
[c3]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/0026L006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0026L006
Ming Li, Jian Liu, Yonghong Yan:
An Efficient and Robust Approach to Audio ID Identification. ISCSLP 2006
[c2]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/Wu0LY006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Wu0LY006
Xiao Wu, Ming Li, Jian Liu, Jun Yang, Yonghong Yan:
A Top-down Approach to Melody Match in Pitch Contour for Query by Humming. ISCSLP 2006
2000
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiY00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiY00
Ming Li, Tiecheng Yu:
Multi-group mixture weight HMM. INTERSPEECH 2000: 290-292

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.