default search action
Md. Sahidullah
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j35]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Generalizing Speaker Verification for Spoof Awareness in the Embedding Space. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1261-1273 (2024) - [j34]Spandan Dey, Md. Sahidullah, Goutam Saha:
Towards Cross-Corpora Generalization for Low-Resource Spoken Language Identification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 5040-5050 (2024) - [c50]Subhajit Saha, Md. Sahidullah, Swagatam Das:
Exploring Green AI for Audio Deepfake Detection. EUSIPCO 2024: 186-190 - [i72]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Generalizing Speaker Verification for Spoof Awareness in the Embedding Space. CoRR abs/2401.11156 (2024) - [i71]Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen:
ChildAugment: Data Augmentation Methods for Zero-Resource Children's Speaker Verification. CoRR abs/2402.15214 (2024) - [i70]Nikhil Raghav, Md. Sahidullah:
Assessing the Robustness of Spectral Clustering for Deep Speaker Diarization. CoRR abs/2403.14286 (2024) - [i69]Subhajit Saha, Md. Sahidullah, Swagatam Das:
Exploring Green AI for Audio Deepfake Detection. CoRR abs/2403.14290 (2024) - [i68]Hye-jin Shim, Md. Sahidullah, Jee-weon Jung, Shinji Watanabe, Tomi Kinnunen:
Beyond Silence: Bias Analysis through Loss and Asymmetric Approach in Audio Anti-Spoofing. CoRR abs/2406.17246 (2024) - [i67]Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans, Kong Aik Lee, Junichi Yamagishi:
ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale. CoRR abs/2408.08739 (2024) - [i66]Shakeel A. Sheikh, Yacouba Kaloga, Md. Sahidullah, Ina Kodrasi:
Graph Neural Networks for Parkinsons Disease Detection. CoRR abs/2409.07884 (2024) - [i65]Nikhil Raghav, Subhajit Saha, Md. Sahidullah, Swagatam Das:
TCG CREST System Description for the Second DISPLACE Challenge. CoRR abs/2409.15356 (2024) - [i64]Nikhil Raghav, Avisek Gupta, Md. Sahidullah, Swagatam Das:
Self-Tuning Spectral Clustering for Speaker Diarization. CoRR abs/2410.00023 (2024) - 2023
- [j33]Spandan Dey, Md. Sahidullah, Goutam Saha:
Cross-corpora spoken language identification with domain diversification and generalization. Comput. Speech Lang. 81: 101489 (2023) - [j32]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Stuttering detection using speaker representations and self-supervised contextual embeddings. Int. J. Speech Technol. 26(on): 521-530 (2023) - [j31]Mohammad Mobarak Hossain, Mohammod Abul Kashem, Md. Monirul Islam, Md. Sahidullah, Sumona Hoque Mumu, Jia Uddin, Daniel Gavilanes Aray, Isabel de la Torre Díez, Imran Ashraf, Md Abdus Samad:
Internet of Things in Pregnancy Care Coordination and Management: A Systematic Review. Sensors 23(23): 9367 (2023) - [j30]Premjeet Singh, Md. Sahidullah, Goutam Saha:
Modulation spectral features for speech emotion recognition using deep neural networks. Speech Commun. 146: 53-69 (2023) - [j29]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2507-2522 (2023) - [j28]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning. IEEE J. Biomed. Health Informatics 27(5): 2553-2564 (2023) - [c49]Hye-jin Shim, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
How to Construct Perfect and Worse-than-Coin-Flip Spoofing Countermeasures: A Word of Warning on Shortcut Learning. INTERSPEECH 2023: 785-789 - [c48]Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen:
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech. INTERSPEECH 2023: 1948-1952 - [c47]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Speaker-Aware Anti-spoofing. INTERSPEECH 2023: 2498-2502 - [c46]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings. INTERSPEECH 2023: 3989-3993 - [i63]Premjeet Singh, Md. Sahidullah, Goutam Saha:
Modulation spectral features for speech emotion recognition using deep neural networks. CoRR abs/2301.05868 (2023) - [i62]Spandan Dey, Md. Sahidullah, Goutam Saha:
Cross-Corpora Spoken Language Identification with Domain Diversification and Generalization. CoRR abs/2302.05110 (2023) - [i61]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning. CoRR abs/2302.11343 (2023) - [i60]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification. CoRR abs/2303.01125 (2023) - [i59]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Speaker-Aware Anti-Spoofing. CoRR abs/2303.01126 (2023) - [i58]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards single integrated spoofing-aware speaker verification embeddings. CoRR abs/2305.19051 (2023) - [i57]Hye-jin Shim, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
How to Construct Perfect and Worse-than-Coin-Flip Spoofing Countermeasures: A Word of Warning on Shortcut Learning. CoRR abs/2306.00044 (2023) - [i56]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings. CoRR abs/2306.00689 (2023) - [i55]Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen:
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech. CoRR abs/2306.07501 (2023) - 2022
- [j27]Premjeet Singh, Shefali Waldekar, Md. Sahidullah, Goutam Saha:
Analysis of constant-Q filterbank based representations for speech emotion recognition. Digit. Signal Process. 130: 103712 (2022) - [j26]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Machine learning for stuttering identification: Review, challenges and future directions. Neurocomputing 514: 385-402 (2022) - [j25]A. Kishore Kumar, Shefali Waldekar, Md. Sahidullah, Goutam Kumar Saha:
Robust acoustic domain identification with its application to speaker diarization. Int. J. Speech Technol. 25(4): 933-945 (2022) - [j24]Spandan Dey, Md. Sahidullah, Goutam Saha:
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective. ACM Trans. Asian Low Resour. Lang. Inf. Process. 21(6): 1-45 (2022) - [j23]Brij Mohan Lal Srivastava, Mohamed Maouche, Md. Sahidullah, Emmanuel Vincent, Aurélien Bellet, Marc Tommasi, Natalia A. Tomashenko, Xin Wang, Junichi Yamagishi:
Privacy and Utility of X-Vector Based Speaker Anonymization. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2383-2395 (2022) - [c45]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Robust Stuttering Detection via Multi-task and Adversarial Learning. EUSIPCO 2022: 190-194 - [c44]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable Nonlinear Compression for Robust Speaker Verification. ICASSP 2022: 7962-7966 - [c43]Shakeel A. Sheikh, Md. Sahidullah, Slim Ouni, Fabrice Hirsch:
End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge. ACM Multimedia 2022: 7104-7108 - [c42]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation. Odyssey 2022: 85-91 - [c41]Alexey Sholokhov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Baselines and Protocols for Household Speaker Recognition. Odyssey 2022: 185-192 - [c40]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. Odyssey 2022: 330-337 - [i54]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable Nonlinear Compression for Robust Speaker Verification. CoRR abs/2202.05236 (2022) - [i53]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation. CoRR abs/2203.10992 (2022) - [i52]Shakeel Ahmad Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Introducing ECAPA-TDNN and Wav2Vec2.0 Embeddings to Stuttering Detection. CoRR abs/2204.01564 (2022) - [i51]Shakeel Ahmad Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Robust Stuttering Detection via Multi-task and Adversarial Learning. CoRR abs/2204.01735 (2022) - [i50]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. CoRR abs/2204.09976 (2022) - [i49]Alexey Sholokhov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Baselines and Protocols for Household Speaker Recognition. CoRR abs/2205.00288 (2022) - [i48]Shakeel Ahmad Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge. CoRR abs/2207.10817 (2022) - [i47]A. Kishore Kumar, Shefali Waldekar, Md. Sahidullah, Goutam Saha:
Robust Acoustic Domain Identification with its Application to Speaker Diarization. CoRR abs/2208.03162 (2022) - [i46]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. CoRR abs/2210.02437 (2022) - [i45]Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022) - [i44]Premjeet Singh, Shefali Waldekar, Md. Sahidullah, Goutam Saha:
Analysis of constant-Q filterbank based representations for speech emotion recognition. CoRR abs/2211.16363 (2022) - [i43]Spandan Dey, Md. Sahidullah, Goutam Saha:
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective. CoRR abs/2212.03812 (2022) - 2021
- [j22]A. Kishore Kumar, Dipjyoti Paul, Monisankha Pal, Md. Sahidullah, Goutam Saha:
Speech frame selection for spoofing detection with an application to partially spoofed audio-data. Int. J. Speech Technol. 24(1): 193-203 (2021) - [j21]Nirmalya Sen, Md. Sahidullah, Hemant A. Patil, Shyamal Kumar Das Mandal, Krothapalli Sreenivasa Rao, Tapan Kumar Basu:
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework. Int. J. Speech Technol. 24(4): 1067-1088 (2021) - [j20]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimizing Multi-Taper Features for Deep Speaker Verification. IEEE Signal Process. Lett. 28: 2187-2191 (2021) - [j19]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi H. Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech. IEEE Trans. Biom. Behav. Identity Sci. 3(2): 252-265 (2021) - [c39]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimized Power Normalized Cepstral Coefficients Towards Robust Deep Speaker Verification. ASRU 2021: 185-190 - [c38]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Parameterized Channel Normalization for Far-Field Deep Speaker Verification. ASRU 2021: 1132-1138 - [c37]Premjeet Singh, Goutam Saha, Md. Sahidullah:
Deep scattering network for speech emotion recognition. EUSIPCO 2021: 131-135 - [c36]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
StutterNet: Stuttering Detection Using Time Delay Neural Network. EUSIPCO 2021: 426-430 - [c35]Spandan Dey, Goutam Saha, Md. Sahidullah:
Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages. EUSIPCO 2021: 546-550 - [c34]Raphaël Duroselle, Md. Sahidullah, Denis Jouvet, Irina Illina:
Modeling and Training Strategies for Language Recognition Systems. Interspeech 2021: 1494-1498 - [c33]Bhusan Chettri, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
Data Quality as Predictor of Voice Anti-Spoofing Generalization. Interspeech 2021: 1659-1663 - [c32]Raphaël Duroselle, Md. Sahidullah, Denis Jouvet, Irina Illina:
Language Recognition on Unknown Conditions: The LORIA-Inria-MULTISPEECH System for AP20-OLR Challenge. Interspeech 2021: 3256-3260 - [c31]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. Interspeech 2021: 4299-4303 - [c30]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable MFCCs for Speaker Verification. ISCAS 2021: 1-5 - [c29]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. SLT 2021: 323-329 - [i42]A. Kishore Kumar, Shefali Waldekar, Goutam Saha, Md. Sahidullah:
Domain-Dependent Speaker Diarization for the Third DIHARD Challenge. CoRR abs/2101.09884 (2021) - [i41]Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan:
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification. CoRR abs/2102.02074 (2021) - [i40]Premjeet Singh, Goutam Saha, Md. Sahidullah:
Non-linear frequency warping using constant-Q transformation for speech emotion recognition. CoRR abs/2102.04029 (2021) - [i39]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech. CoRR abs/2102.05889 (2021) - [i38]A. Kishore Kumar, Shefali Waldekar, Goutam Saha, Md. Sahidullah:
ABSP System for The Third DIHARD Challenge. CoRR abs/2102.09939 (2021) - [i37]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable MFCCs for Speaker Verification. CoRR abs/2102.10322 (2021) - [i36]Bhusan Chettri, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
Data Quality as Predictor of Voice Anti-Spoofing Generalization. CoRR abs/2103.14602 (2021) - [i35]Spandan Dey, Goutam Saha, Md. Sahidullah:
Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages. CoRR abs/2105.04639 (2021) - [i34]Premjeet Singh, Goutam Saha, Md. Sahidullah:
Deep scattering network for speech emotion recognition. CoRR abs/2105.04806 (2021) - [i33]Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
StutterNet: Stuttering Detection Using Time Delay Neural Network. CoRR abs/2105.05599 (2021) - [i32]Nirmalya Sen, Md. Sahidullah, Hemant A. Patil, Shyamal Kumar Das Mandal, Krothapalli Sreenivasa Rao, Tapan Kumar Basu:
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework. CoRR abs/2105.11728 (2021) - [i31]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. CoRR abs/2106.06362 (2021) - [i30]Shakeel Ahmad Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Machine Learning for Stuttering Identification: Review, Challenges & Future Directions. CoRR abs/2107.04057 (2021) - [i29]Jean-François Bonastre, Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noé, Jose Patino, Md. Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia A. Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi:
Benchmarking and challenges in security and privacy for voice biometrics. CoRR abs/2109.00281 (2021) - [i28]Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Jose Patino, Md. Sahidullah, Massimiliano Todisco, Xin Wang, Junichi Yamagishi:
ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan. CoRR abs/2109.00535 (2021) - [i27]Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md. Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas W. D. Evans, Héctor Delgado:
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection. CoRR abs/2109.00537 (2021) - [i26]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Parameterized Channel Normalization for Far-field Deep Speaker Verification. CoRR abs/2109.12056 (2021) - [i25]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification. CoRR abs/2109.12058 (2021) - [i24]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimizing Multi-Taper Features for Deep Speaker Verification. CoRR abs/2110.10983 (2021) - 2020
- [j18]Ville Vestman, Tomi Kinnunen, Rosa González Hautamäki, Md. Sahidullah:
Voice Mimicry Attacks Assisted by Automatic Speaker Verification. Comput. Speech Lang. 59: 36-54 (2020) - [j17]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020) - [j16]Susanta Kumar Sarangi, Md. Sahidullah, Goutam Saha:
Optimization of data-driven filterbank for automatic speaker verification. Digit. Signal Process. 104: 102795 (2020) - [j15]Tomi Kinnunen, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2195-2210 (2020) - [c28]Brij Mohan Lal Srivastava, Nathalie Vauquier, Md. Sahidullah, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent:
Evaluating Voice Conversion-Based Privacy Protection against Informed Attackers. ICASSP 2020: 2802-2806 - [c27]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings. INTERSPEECH 2020: 3221-3225 - [i23]Tomi Kinnunen, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. CoRR abs/2007.05979 (2020) - [i22]Susanta Kumar Sarangi, Md. Sahidullah, Goutam Saha:
Optimization of data-driven filterbank for automatic speaker verification. CoRR abs/2007.10729 (2020) - [i21]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. CoRR abs/2007.13118 (2020) - [i20]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings. CoRR abs/2007.15283 (2020)
2010 – 2019
- 2019
- [j14]Arnab Poddar, Md. Sahidullah, Goutam Saha:
Quality measures for speaker verification with short utterances. Digit. Signal Process. 88: 66-79 (2019) - [c26]Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md. Sahidullah:
Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection. ICASSP 2019: 6146-6150 - [c25]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Tomi H. Kinnunen, Kong Aik Lee:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. INTERSPEECH 2019: 1008-1012 - [p1]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee:
Introduction to Voice Presentation Attack Detection and Recent Advances. Handbook of Biometric Anti-Spoofing, 2nd Ed. 2019: 321-361 - [i19]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee:
Introduction to Voice Presentation Attack Detection and Recent Advances. CoRR abs/1901.01085 (2019) - [i18]Dipjyoti Paul, Md. Sahidullah, Goutam Saha:
Generalization of Spoofing Countermeasures: a Case Study with ASVspoof 2015 and BTAS 2016 Corpora. CoRR abs/1901.08025 (2019) - [i17]Arnab Poddar, Md. Sahidullah, Goutam Saha:
Quality Measures for Speaker Verification with Short Utterances. CoRR abs/1901.10345 (2019) - [i16]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. CoRR abs/1904.05441 (2019) - [i15]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i14]Ville Vestman, Tomi Kinnunen, Rosa González Hautamäki, Md. Sahidullah:
Voice Mimicry Attacks Assisted by Automatic Speaker Verification. CoRR abs/1906.01454 (2019) - [i13]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - [i12]Md. Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, Hervé Bredin, Pavel Korshunov, Alessio Brutti, Romain Serizel, Emmanuel Vincent, Nicholas W. D. Evans, Sébastien Marcel, Stefano Squartini, Claude Barras:
The Speed Submission to DIHARD II: Contributions & Lessons Learned. CoRR abs/1911.02388 (2019) - [i11]Brij Mohan Lal Srivastava, Nathalie Vauquier, Md. Sahidullah, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent:
Evaluating Voice Conversion-based Privacy Protection against Informed Attackers. CoRR abs/1911.03934 (2019) - 2018
- [j13]Alexey Sholokhov, Md. Sahidullah, Tomi Kinnunen:
Semi-supervised speech activity detection with an application to automatic speaker verification. Comput. Speech Lang. 47: 132-156 (2018) - [j12]Arnab Poddar, Md. Sahidullah, Goutam Saha:
Speaker verification with short utterances: a review of challenges, trends and opportunities. IET Biom. 7(2): 91-101 (2018) - [j11]Arnab Poddar, Md. Sahidullah, Goutam Saha:
Improved i-vector extraction technique for speaker verification with short utterances. Int. J. Speech Technol. 21(3): 473-488 (2018) - [j10]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction. Speech Commun. 99: 62-79 (2018) - [j9]Md. Sahidullah, Dennis Alexander Lehmann Thomsen, Rosa González Hautamäki, Tomi Kinnunen, Zheng-Hua Tan, Robert Parts, Martti Pitkänen:
Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 44-56 (2018) - [c24]Massimiliano Todisco, Héctor Delgado, Kong-Aik Lee, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi:
Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion. INTERSPEECH 2018: 77-81 - [c23]Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Kong-Aik Lee, Junichi Yamagishi:
ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements. Odyssey 2018: 296-303 - [c22]Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. Odyssey 2018: 312-319 - [c21]Fuming Fang, Junichi Yamagishi, Isao Echizen, Md. Sahidullah, Tomi Kinnunen:
Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems. WIFS 2018: 1-9 - [i10]Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. CoRR abs/1804.09618 (2018) - [i9]Fuming Fang, Junichi Yamagishi, Isao Echizen, Md. Sahidullah, Tomi Kinnunen:
Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems. CoRR abs/1809.04274 (2018) - [i8]Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md. Sahidullah:
Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection. CoRR abs/1811.03790 (2018) - [i7]Arnab Poddar, Md. Sahidullah, Goutam Saha:
Novel Quality Metric for Duration Variability Compensation in Speaker Verification using i-Vectors. CoRR abs/1812.00828 (2018) - 2017
- [j8]Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov, Nicholas W. D. Evans, Massimiliano Todisco:
ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge. IEEE J. Sel. Top. Signal Process. 11(4): 588-604 (2017) - [j7]Rosa González Hautamäki, Md. Sahidullah, Ville Hautamäki, Tomi Kinnunen:
Acoustical and perceptual study of voice disguise by age modification in speaker verification. Speech Commun. 95: 1-15 (2017) - [c20]Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Md. Sahidullah, Wei Ming Liu, Federico Alegre, Tomi Kinnunen, Benoit G. B. Fauve:
Impact of Bandwidth and Channel Variation on Presentation Attack Detection for Speaker Verification. BIOSIG 2017: 173-183 - [c19]Arnab Poddar, Md. Sahidullah, Goutam Saha:
Novel Quality Metric for Duration Variability Compensation in Speaker Verification using i-Vectors. ICAPR 2017: 1-6 - [c18]Dipjyoti Paul, Md. Sahidullah, Goutam Saha:
Generalization of spoofing countermeasures: A case study with ASVspoof 2015 and BTAS 2016 corpora. ICASSP 2017: 2047-2051 - [c17]Anssi Kanervisto, Ville Vestman, Md. Sahidullah, Ville Hautamäki, Tomi Kinnunen:
Effects of gender information in text-independent and text-dependent speaker verification. ICASSP 2017: 5360-5364 - [c16]Tomi Kinnunen, Md. Sahidullah, Mauro Falcone, Luca Costantini, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Achintya Kumar Sarkar, Zheng-Hua Tan, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Ville Hautamäki, Kong-Aik Lee:
RedDots replayed: A new replay spoofing attack corpus for text-dependent speaker verification research. ICASSP 2017: 5395-5399 - [c15]Tomi Kinnunen, Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee:
The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection. INTERSPEECH 2017: 2-6 - [c14]Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332 - [c13]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions. INTERSPEECH 2017: 1512-1516 - [c12]Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan, Tomi Kinnunen:
Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data. INTERSPEECH 2017: 2611-2615 - [c11]Arnab Poddar, Md. Sahidullah, Goutam Saha:
An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance. PReMI 2017: 326-332 - 2016
- [j6]Nandini Sengupta, Md. Sahidullah, Goutam Saha:
Lung sound classification using cepstral-based statistical features. Comput. Biol. Medicine 75: 118-129 (2016) - [j5]Md. Sahidullah, Tomi Kinnunen:
Local spectral variability features for speaker verification. Digit. Signal Process. 50: 1-11 (2016) - [j4]Cemal Hanilçi, Tomi Kinnunen, Md. Sahidullah, Aleksandr Sizov:
Spoofing detection goes noisy: An analysis of synthetic speech detection in the presence of additive noise. Speech Commun. 85: 83-97 (2016) - [c10]Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, André R. Gonçalves, A. G. Souza Mello, Ricardo Paranhos Velloso Violato, Flávio Olmos Simões, Mário Uliani Neto, Marcus de Assis Angeloni, José Augusto Stuchi, Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Dipjyoti Paul, Goutam Saha, Md. Sahidullah:
Overview of BTAS 2016 speaker anti-spoofing competition. BTAS 2016: 1-6 - [c9]Tomi Kinnunen, Md. Sahidullah, Ivan Kukanov, Héctor Delgado, Massimiliano Todisco, Achintya Kumar Sarkar, Nicolai Bæk Thomsen, Ville Hautamäki, Nicholas W. D. Evans, Zheng-Hua Tan:
Utterance Verification for Text-Dependent Speaker Recognition: A Comparative Assessment Using the RedDots Corpus. INTERSPEECH 2016: 430-434 - [c8]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Hong Yu, Tomi Kinnunen, Nicholas W. D. Evans, Zheng-Hua Tan:
Integrated Spoofing Countermeasures and Automatic Speaker Verification: An Evaluation on ASVspoof 2015. INTERSPEECH 2016: 1700-1704 - [c7]Md. Sahidullah, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamäki, Robert Parts, Martti Pitkänen:
Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech. INTERSPEECH 2016: 1720-1724 - [c6]Tomi Kinnunen, Alexey Sholokhov, Elie Khoury, Dennis Alexander Lehmann Thomsen, Md. Sahidullah, Zheng-Hua Tan:
HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors. INTERSPEECH 2016: 2992-2996 - [c5]Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen, Ville Hautamäki:
Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy. Odyssey 2016: 277-282 - [c4]Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Achintya Kumar Sarkar, Nicholas W. D. Evans, Tomi Kinnunen, Zheng-Hua Tan:
Further optimisations of constant Q cepstral processing for integrated utterance and text-dependent speaker verification. SLT 2016: 179-185 - [i6]Cemal Hanilçi, Tomi Kinnunen, Md. Sahidullah, Aleksandr Sizov:
Spoofing Detection Goes Noisy: An Analysis of Synthetic Speech Detection in the Presence of Additive Noise. CoRR abs/1603.03947 (2016) - [i5]Monisankha Pal, Dipjyoti Paul, Md. Sahidullah, Goutam Saha:
Robustness of Voice Conversion Techniques Under Mismatched Conditions. CoRR abs/1612.07523 (2016) - 2015
- [c3]Zhizheng Wu, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov:
ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge. INTERSPEECH 2015: 2037-2041 - [c2]Cemal Hanilçi, Tomi Kinnunen, Md. Sahidullah, Aleksandr Sizov:
Classifiers for synthetic speech detection: a comparison. INTERSPEECH 2015: 2057-2061 - [c1]Md. Sahidullah, Tomi Kinnunen, Cemal Hanilçi:
A comparison of features for synthetic speech detection. INTERSPEECH 2015: 2087-2091 - 2013
- [j3]Md. Sahidullah, Goutam Saha:
A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition. IEEE Signal Process. Lett. 20(2): 149-152 (2013) - 2012
- [j2]Md. Sahidullah, Goutam Saha:
Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. Speech Commun. 54(4): 543-565 (2012) - [i4]Md. Sahidullah, Goutam Saha:
A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition. CoRR abs/1206.2437 (2012) - [i3]Md. Sahidullah, Goutam Saha:
Comparison of Speech Activity Detection Techniques for Speaker Recognition. CoRR abs/1210.0297 (2012) - 2011
- [i2]Md. Sahidullah, Goutam Saha:
In Search of Autocorrelation Based Vocal Cord Cues for Speaker Identification. CoRR abs/1105.2095 (2011) - [i1]Md. Sahidullah, Sandipan Chakroborty, Goutam Saha:
Improving Performance of Speaker Identification System Using Complementary Information Fusion. CoRR abs/1105.2770 (2011) - 2010
- [j1]Md. Sahidullah, Sandipan Chakroborty, Goutam Saha:
On the use of perceptual Line Spectral pairs Frequencies and higher-order residual moments for Speaker Identification. Int. J. Biom. 2(4): 358-378 (2010)
Coauthor Index
aka: Tomi H. Kinnunen
aka: Kong Aik Lee
aka: Shakeel Ahmad Sheikh
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-23 20:33 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint