default search action
Najim Dehak
Person information
- affiliation: MIT, Cambridge, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j28]Deming Li, Ankur A. Butala, Laureano Moro-Velázquez, Trevor Meyer, Esther S. Oh, Chelsey Motley, Jesús Villalba, Najim Dehak:
Automating the analysis of eye movement for different neurodegenerative disorders. Comput. Biol. Medicine 170: 107951 (2024) - [j27]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Piotr Zelasko, Najim Dehak:
Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1736-1749 (2024) - [j26]Magdalena Rybicka, Jesús Villalba, Thomas Thebaud, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3960-3973 (2024) - [j25]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Slowness Regularized Contrastive Predictive Coding for Acoustic Unit Discovery. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4277-4287 (2024) - [c137]Maliha Jahan, Helin Wang, Thomas Thebaud, Yinglun Sun, Giang Ha Le, Zsuzsanna Fagyal, Odette Scharenborg, Mark Hasegawa-Johnson, Laureano Moro-Velázquez, Najim Dehak:
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline. LREC/COLING 2024: 7296-7306 - [c136]Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. ICASSP 2024: 1196-1200 - [c135]Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. Odyssey 2024: 165-171 - [c134]Anna Favaro, Najim Dehak, Thomas Thebaud, Jesús Villalba, Esther S. Oh, Laureano Moro-Velázquez:
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task. Odyssey 2024: 201-208 - [c133]Lucas Goncalves, Ali N. Salman, Abinay Reddy Naini, Laureano Moro-Velázquez, Thomas Thebaud, Paola García, Najim Dehak, Berrak Sisman, Carlos Busso:
Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results. Odyssey 2024: 247-254 - [e1]Najim Dehak, Patrick Cardinal:
Odyssey 2024: The Speaker and Language Recognition Workshop, Quebec City, Canada, June 18-21, 2024. ISCA 2024 [contents] - [i55]Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. CoRR abs/2402.19355 (2024) - [i54]Helin Wang, Meng Yu, Jiarui Hai, Chen Chen, Yuchen Hu, Rilin Chen, Najim Dehak, Dong Yu:
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis. CoRR abs/2409.07556 (2024) - [i53]Thomas Thebaud, Anna Favaro, Casey Chen, Gabriel Chávez, Laureano Moro-Velázquez, Ankur A. Butala, Najim Dehak:
Explainable Metrics for the Assessment of Neurodegenerative Diseases through Handwriting Analysis. CoRR abs/2409.08303 (2024) - [i52]Helin Wang, Jiarui Hai, Yen-Ju Lu, Karan Thakkar, Mounya Elhilali, Najim Dehak:
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer. CoRR abs/2409.08425 (2024) - [i51]Henry Li Xinyuan, Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak, Sanjeev Khudanpur:
Clean Label Attacks against SLU Systems. CoRR abs/2409.08985 (2024) - 2023
- [j24]Anna Favaro, Yi-Ting Tsai, Ankur A. Butala, Thomas Thebaud, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios. Comput. Biol. Medicine 166: 107559 (2023) - [c132]Maliha Jahan, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak, Jesús Villalba:
Model-Based Fairness Metric for Speaker Verification. ASRU 2023: 1-7 - [c131]Martin Sustek, Sonal Joshi, Henry Li, Thomas Thebaud, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks. ASRU 2023: 1-8 - [c130]Thomas Thebaud, Sonal Joshi, Henry Li, Martin Sustek, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System. ASRU 2023: 1-8 - [c129]Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning. INTERSPEECH 2023: 431-435 - [c128]Jesús Villalba, Jonas Borgstrom, Maliha Jahan, Saurabh Kataria, Leibny Paola García, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22. INTERSPEECH 2023: 521-525 - [c127]Helin Wang, Thomas Thebaud, Jesús Villalba, Myra Sydnor, Becky Lammers, Najim Dehak, Laureano Moro-Velázquez:
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model. INTERSPEECH 2023: 1548-1552 - [c126]Anna Favaro, Tianyu Cao, Thomas Thebaud, Jesús Villalba, Ankur A. Butala, Najim Dehak, Laureano Moro-Velázquez:
Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora? INTERSPEECH 2023: 2388-2392 - [c125]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition. INTERSPEECH 2023: 4688-4692 - [i50]Martin Sustek, Samik Sadhu, Lukás Burget, Hynek Hermansky, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Stabilized training of joint energy-based models and their practical applications. CoRR abs/2303.04187 (2023) - [i49]Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning. CoRR abs/2309.04628 (2023) - [i48]Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. CoRR abs/2310.04567 (2023) - [i47]Trevor Meyer, Camden Shultz, Najim Dehak, Laureano Moro-Velázquez, Pedro P. Irazoqui:
Time Scale Network: A Shallow Neural Network For Time Series Data. CoRR abs/2311.06170 (2023) - 2022
- [j23]Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering phonetic inventories with crosslingual automatic speech recognition. Comput. Speech Lang. 74: 101358 (2022) - [j22]Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-Supervised Learning for Utterance-Level Information Extraction From Speech. IEEE J. Sel. Top. Signal Process. 16(6): 1284-1295 (2022) - [j21]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2002-2014 (2022) - [c124]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. INTERSPEECH 2022: 615-619 - [c123]Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-contrastive self-supervised learning of utterance-level speech representations. INTERSPEECH 2022: 4028-4032 - [c122]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser. INTERSPEECH 2022: 5035-5039 - [c121]Yiwen Shao, Jesús Villalba, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak:
Chunking Defense for Adversarial Attacks on ASR. INTERSPEECH 2022: 5045-5049 - [c120]Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. INTERSPEECH 2022: 5060-5064 - [c119]Magdalena Rybicka, Jesús Villalba, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors. INTERSPEECH 2022: 5090-5094 - [c118]Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Magdalena Rybicka, Carlos D. Castillo, Jaejin Cho, L. Paola García-Perera, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21. Odyssey 2022: 213-220 - [c117]Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Jaejin Cho, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge. Odyssey 2022: 338-345 - [c116]Tianyu Cao, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Vsameter: Evaluation of a New Open-Source Tool to Measure Vowel Space Area and Related Metrics. SLT 2022: 517-524 - [c115]Anna Favaro, Chelsie Motley, Tianyu Cao, Miguel Iglesias, Ankur A. Butala, Esther S. Oh, Robert D. Stevens, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
A Multi-Modal Array of Interpretable Features to Evaluate Language and Speech Patterns in Different Neurological Disorders. SLT 2022: 532-539 - [c114]Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur:
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition. SLT 2022: 777-784 - [i46]Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali:
Code-Switching Text Augmentation for Multilingual Speech Processing. CoRR abs/2201.02550 (2022) - [i45]Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition. CoRR abs/2201.11207 (2022) - [i44]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. CoRR abs/2203.16614 (2022) - [i43]Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. CoRR abs/2204.03848 (2022) - [i42]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser. CoRR abs/2204.03851 (2022) - [i41]Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations. CoRR abs/2208.05413 (2022) - [i40]Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech. CoRR abs/2208.05445 (2022) - 2021
- [j20]Laureano Moro-Velázquez, Jorge Andrés Gómez García, Julián D. Arias-Londoño, Najim Dehak, Juan Ignacio Godino-Llorente:
Advances in Parkinson's Disease detection and assessment using voice and speech: A review of the articulatory and phonatory aspects. Biomed. Signal Process. Control. 66: 102418 (2021) - [j19]Nanxin Chen, Shinji Watanabe, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Non-Autoregressive Transformer for Speech Recognition. IEEE Signal Process. Lett. 28: 121-125 (2021) - [j18]Piotr Zelasko, Raghavendra Pappagari, Najim Dehak:
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition. Trans. Assoc. Comput. Linguistics 9: 1163-1179 (2021) - [j17]Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Study of Pre-Processing Defenses Against Adversarial Attacks on State-of-the-Art Speaker Recognition Systems. IEEE Trans. Inf. Forensics Secur. 16: 4811-4826 (2021) - [c113]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. ASRU 2021: 39-46 - [c112]Raghavendra Pappagari, Piotr Zelasko, Agnieszka Mikolajczyk, Piotr Pezik, Najim Dehak:
Joint Prediction of Truecasing and Punctuation for Conversational Speech in Low-Resource Scenarios. ASRU 2021: 1185-1191 - [c111]Laureano Moro-Velázquez, Jorge Gómez-García, Najim Dehak, Juan Ignacio Godino-Llorente:
New tools for the differential evaluation of Parkinson's disease using voice and speech processing. IberSPEECH 2021 - [c110]Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer. ICASSP 2021: 5994-5998 - [c109]Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. ICASSP 2021: 6324-6328 - [c108]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios. ICASSP 2021: 6733-6737 - [c107]Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss Based Speech Denoising with an Ensemble of Audio Pattern Recognition and Self-Supervised Models. ICASSP 2021: 7118-7122 - [c106]Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-Shot ASR Performance. ICASSP 2021: 7238-7242 - [c105]Liming Wang, Xinsheng Wang, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval. ICASSP 2021: 7603-7607 - [c104]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. Interspeech 2021: 366-370 - [c103]Magdalena Rybicka, Jesús Villalba, Piotr Zelasko, Najim Dehak, Konrad Kowalczyk:
Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition. Interspeech 2021: 496-500 - [c102]Saurabh Kataria, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification. Interspeech 2021: 1079-1083 - [c101]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. Interspeech 2021: 3765-3769 - [c100]Nanxin Chen, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition. Interspeech 2021: 3770-3774 - [c99]Raghavendra Pappagari, Jaejin Cho, Sonal Joshi, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios. Interspeech 2021: 3825-3829 - [c98]Jesús Villalba, Sonal Joshi, Piotr Zelasko, Najim Dehak:
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems. Interspeech 2021: 4304-4308 - [c97]Aviad Shtrosberg, Jesús Villalba, Najim Dehak, Azaria Cohen, Bar Ben-Yair:
Invariant Representation Learning for Robust Far-Field Speaker Recognition. SLSP 2021: 97-110 - [i39]Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Adversarial Attacks and Defenses for Speaker Identification Systems. CoRR abs/2101.08909 (2021) - [i38]Piotr Zelasko, Sonal Joshi, Yiwen Shao, Jesús Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Adversarial Attacks and Defenses for Speech Recognition Systems. CoRR abs/2103.17122 (2021) - [i37]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. CoRR abs/2106.02170 (2021) - [i36]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. CoRR abs/2106.09660 (2021) - [i35]Piotr Zelasko, Raghavendra Pappagari, Najim Dehak:
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition. CoRR abs/2107.02294 (2021) - [i34]Raghavendra Pappagari, Piotr Zelasko, Agnieszka Mikolajczyk, Piotr Pezik, Najim Dehak:
Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios. CoRR abs/2109.06103 (2021) - [i33]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. CoRR abs/2109.06112 (2021) - [i32]Jaejin Cho, Jesús Villalba, Najim Dehak:
The JHU submission to VoxSRC-21: Track 3. CoRR abs/2109.13425 (2021) - [i31]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding. CoRR abs/2110.02345 (2021) - 2020
- [j16]Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An unsupervised segment-based robust voice activity detection method. Comput. Speech Lang. 59: 1-21 (2020) - [j15]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Leibny Paola García-Perera, Fred Richardson, Réda Dehak, Pedro A. Torres-Carrasquillo, Najim Dehak:
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations. Comput. Speech Lang. 60 (2020) - [j14]Juan Ignacio Godino-Llorente, Douglas D. O'Shaughnessy, Tan Lee, Najim Dehak, Claudia Manfredi:
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing. IEEE J. Sel. Top. Signal Process. 14(2): 234-239 (2020) - [j13]Laureano Moro-Velázquez, Estefanía Hernández-García, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Najim Dehak:
Analysis of the Effects of Supraglottal Tract Surgical Procedures in Automatic Speaker Recognition Performance. IEEE ACM Trans. Audio Speech Lang. Process. 28: 798-812 (2020) - [c96]Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Using X-Vectors to Automatically Detect Parkinson's Disease from Speech. ICASSP 2020: 1155-1159 - [c95]Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition. ICASSP 2020: 7169-7173 - [c94]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Nanxin Chen, L. Paola García-Perera, Najim Dehak:
Feature Enhancement with Deep Feature Losses for Speaker Verification. ICASSP 2020: 7584-7588 - [c93]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, L. Paola García-Perera, Najim Dehak:
Unsupervised Feature Enhancement for Speaker Verification. ICASSP 2020: 7599-7603 - [c92]Raghavendra Pappagari, Jaejin Cho, Laureano Moro-Velázquez, Najim Dehak:
Using State of the Art Speaker Recognition and Natural Language Processing Technologies to Detect Alzheimer's Disease and Assess its Severity. INTERSPEECH 2020: 2177-2181 - [c91]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. INTERSPEECH 2020: 3256-3260 - [c90]Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: An Analysis of Phonetic Representations Transfer Across Languages. INTERSPEECH 2020: 3705-3709 - [c89]Jesús Villalba, Yuekai Zhang, Najim Dehak:
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification. INTERSPEECH 2020: 4233-4237 - [c88]Yuekai Zhang, Ziyan Jiang, Jesús Villalba, Najim Dehak:
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples. INTERSPEECH 2020: 4238-4242 - [c87]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. INTERSPEECH 2020: 4876-4880 - [c86]Lukasz Augustyniak, Piotr Szymanski, Mikolaj Morzy, Piotr Zelasko, Adrian Szymczak, Jan Mizgajski, Yishay Carmiel, Najim Dehak:
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings? INTERSPEECH 2020: 4906-4910 - [c85]Jesús Antonio Villalba López, Daniel Garcia-Romero, Nanxin Chen, Gregory Sell, Jonas Borgstrom, Alan McCree, Leibny Paola García-Perera, Saurabh Kataria, Phani Sankar Nidadavolu, Pedro Torres-Carrasquiilo, Najim Dehak:
Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19. Odyssey 2020: 273-280 - [c84]Leibny Paola García-Perera, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker Detection in the Wild: Lessons Learned from JSALT 2019. Odyssey 2020: 415-422 - [c83]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss Based Enhancement for Speaker Verification. Odyssey 2020: 459-466 - [i30]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss based Enhancement for Speaker Verification. CoRR abs/2002.00139 (2020) - [i29]Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition. CoRR abs/2002.05039 (2020) - [i28]Lukasz Augustyniak, Piotr Szymanski, Mikolaj Morzy, Piotr Zelasko, Adrian Szymczak, Jan Mizgajski, Yishay Carmiel, Najim Dehak:
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings? CoRR abs/2004.05985 (2020) - [i27]Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages. CoRR abs/2005.08118 (2020) - [i26]Phani Sankar Nidadavolu, Saurabh Kataria, L. Paola García-Perera, Jesús Villalba, Najim Dehak:
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild. CoRR abs/2005.08331 (2020) - [i25]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. CoRR abs/2007.13033 (2020) - [i24]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. CoRR abs/2010.11221 (2020) - [i23]Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models. CoRR abs/2010.11860 (2020) - [i22]Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-shot ASR Performance. CoRR abs/2010.12104 (2020) - [i21]Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. CoRR abs/2010.14602 (2020) - [i20]Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the present: a regularization method for the ASR source-target attention layer. CoRR abs/2011.01210 (2020)
2010 – 2019
- 2019
- [j12]Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jesús Villalba, Jan Rusz, Stefanie Shattuck-Hufnagel, Najim Dehak:
A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing. Biomed. Signal Process. Control. 48: 205-220 (2019) - [c82]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, Najim Dehak:
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-Gans. ASRU 2019: 710-717 - [c81]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Hierarchical Transformers for Long Document Classification. ASRU 2019: 838-844 - [c80]Saurabhchand Bhati, Chunxi Liu, Jesús Villalba, Jan Trmal, Sanjeev Khudanpur, Najim Dehak:
Bottom-Up Unsupervised Word Discovery via Acoustic Units. GlobalSIP 2019: 1-5 - [c79]Saurabhchand Bhati, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
LSTM Siamese Network for Parkinson's Disease Detection from Speech. GlobalSIP 2019: 1-5 - [c78]Phani Sankar Nidadavolu, Vicente Iglesias, Jesús Villalba, Najim Dehak:
Investigation on Neural Bandwidth Extension of Telephone Speech for Improved Speaker Recognition. ICASSP 2019: 6111-6115 - [c77]Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. ICASSP 2019: 6191-6195 - [c76]Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Cycle-GANs for Domain Adaptation of Acoustic Features for Speaker Recognition. ICASSP 2019: 6206-6210 - [c75]Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. ICASSP 2019: 6316-6320 - [c74]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation. INTERSPEECH 2019: 356-360 - [c73]Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks. INTERSPEECH 2019: 1013-1017 - [c72]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, Najim Dehak:
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. INTERSPEECH 2019: 1488-1492 - [c71]David Snyder, Jesús Villalba, Nanxin Chen, Daniel Povey, Gregory Sell, Najim Dehak, Sanjeev Khudanpur:
The JHU Speaker Recognition System for the VOiCES 2019 Challenge. INTERSPEECH 2019: 2468-2472 - [c70]Saurabhchand Bhati, Shekhar Nayak, K. Sri Rama Murty, Najim Dehak:
Unsupervised Acoustic Segmentation and Clustering Using Siamese Network Embeddings. INTERSPEECH 2019: 2668-2672 - [c69]Nanxin Chen, Jesús Villalba, Najim Dehak:
Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings. INTERSPEECH 2019: 2948-2952 - [c68]Laureano Moro-Velázquez, Jaejin Cho, Shinji Watanabe, Mark A. Hasegawa-Johnson, Odette Scharenborg, Heejin Kim, Najim Dehak:
Study of the Performance of Automatic Speech Recognition Systems in Speakers with Parkinson's Disease. INTERSPEECH 2019: 3875-3879 - [c67]Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, Najim Dehak:
Improving Emotion Identification Using Phone Posteriors in Raw Speech Waveform Based DNN. INTERSPEECH 2019: 3925-3929 - [c66]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings. INTERSPEECH 2019: 4375-4379 - [i19]Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks. CoRR abs/1904.01120 (2019) - [i18]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation. CoRR abs/1904.04240 (2019) - [i17]Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro Lameiras Koerich:
Speaker Sincerity Detection based on Covariance Feature Vectors and Ensemble Methods. CoRR abs/1904.11641 (2019) - [i16]Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method. CoRR abs/1906.03588 (2019) - [i15]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Hierarchical Transformers for Long Document Classification. CoRR abs/1910.10781 (2019) - [i14]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Nanxin Chen, Paola García, Najim Dehak:
Feature Enhancement with Deep Feature Losses for Speaker Verification. CoRR abs/1910.11905 (2019) - [i13]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, Najim Dehak:
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs. CoRR abs/1910.11909 (2019) - [i12]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, L. Paola García-Perera, Najim Dehak:
Unsupervised Feature Enhancement for speaker verification. CoRR abs/1910.11915 (2019) - [i11]Nanxin Chen, Shinji Watanabe, Jesús Villalba, Najim Dehak:
Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition. CoRR abs/1911.04908 (2019) - [i10]Paola García, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker detection in the wild: Lessons learned from JSALT 2019. CoRR abs/1912.00938 (2019) - 2018
- [j11]Rubén Zazo-Candil, Phani Sankar Nidadavolu, Nanxin Chen, Joaquin Gonzalez-Rodriguez, Najim Dehak:
Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks. IEEE Access 6: 22524-22530 (2018) - [j10]Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jesús Villalba, Juan Rafael Orozco-Arroyave, Najim Dehak:
Analysis of speaker recognition methodologies and the influence of kinetic changes to automatically detect Parkinson's Disease. Appl. Soft Comput. 62: 649-666 (2018) - [j9]Juan Rafael Orozco-Arroyave, Juan Camilo Vásquez-Correa, Jesús Francisco Vargas-Bonilla, Raman Arora, Najim Dehak, Phani S. Nidadavolu, Heidi Christensen, Frank Rudzicz, Maria Yancheva, Hamid R. Chinaei, Alyssa Vann, Nikolai Vogler, Tobias Bocklet, Milos Cernak, Julius Hannink, Elmar Nöth:
NeuroSpeech: An open-source software for Parkinson's speech analysis. Digit. Signal Process. 77: 207-221 (2018) - [j8]Juan Rafael Orozco-Arroyave, Juan Camilo Vásquez-Correa, Jesús Francisco Vargas-Bonilla, Raman Arora, Najim Dehak, Phani S. Nidadavolu, Heidi Christensen, Frank Rudzicz, Maria Yancheva, Hamid R. Chinaei, Alyssa Vann, Nikolai Vogler, Tobias Bocklet, Milos Cernak, Julius Hannink, Elmar Nöth:
NeuroSpeech. SoftwareX 8: 69-70 (2018) - [c65]Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jan Rusz, Sabine Skodda, Francisco Grandas, José-Miguel Velazquez, Juan Rafael Orozco-Arroyave, Elmar Nöth, Najim Dehak:
Study of the Automatic Detection of Parkison's Disease Based on Speaker Recognition Technologies and Allophonic Distillation. EMBC 2018: 1404-1407 - [c64]Zili Huang, L. Paola García-Perera, Jesús Villalba, Daniel Povey, Najim Dehak:
JHU Diarization System Description. IberSPEECH 2018: 236-239 - [c63]Nanxin Chen, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech. ICASSP 2018: 4939-4943 - [c62]Matthew Maciejewski, David Snyder, Vimal Manohar, Najim Dehak, Sanjeev Khudanpur:
Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods. ICASSP 2018: 5244-5248 - [c61]Raghavendra Pappagari, Jesús Villalba, Najim Dehak:
Joint Verification-Identification in end-to-end Multi-Scale CNN Framework for Topic Identification. ICASSP 2018: 6199-6203 - [c60]Nanxin Chen, Jesús Villalba, Najim Dehak:
An Investigation of Non-linear i-vectors for Speaker Verification. INTERSPEECH 2018: 87-91 - [c59]Jaejin Cho, Raghavendra Pappagari, Purva Kulkarni, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Deep Neural Networks for Emotion Recognition Combining Audio and Transcripts. INTERSPEECH 2018: 247-251 - [c58]Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur, Najim Dehak:
End-to-end Deep Neural Network Age Estimation. INTERSPEECH 2018: 277-281 - [c57]Phani Sankar Nidadavolu, Cheng-I Lai, Jesús Villalba, Najim Dehak:
Investigation on Bandwidth Extension for Speaker Recognition. INTERSPEECH 2018: 1111-1115 - [c56]Odette Scharenborg, Sebastian Tiesmeyer, Mark Hasegawa-Johnson, Najim Dehak:
Visualizing Phoneme Category Adaptation in Deep Neural Networks. INTERSPEECH 2018: 1482-1486 - [c55]Peter Sibbern Frederiksen, Jesús Villalba, Shinji Watanabe, Zheng-Hua Tan, Najim Dehak:
Effectiveness of Single-Channel BLSTM Enhancement for Language Identification. INTERSPEECH 2018: 1823-1827 - [c54]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur:
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. INTERSPEECH 2018: 2052-2056 - [c53]Piotr Zelasko, Piotr Szymanski, Jan Mizgajski, Adrian Szymczak, Yishay Carmiel, Najim Dehak:
Punctuation Prediction Model for Conversational Speech. INTERSPEECH 2018: 2633-2637 - [c52]Gregory Sell, David Snyder, Alan McCree, Daniel Garcia-Romero, Jesús Villalba, Matthew Maciejewski, Vimal Manohar, Najim Dehak, Daniel Povey, Shinji Watanabe, Sanjeev Khudanpur:
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. INTERSPEECH 2018: 2808-2812 - [c51]Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, Najim Dehak:
Emotion Identification from Raw Speech Signals Using DNNs. INTERSPEECH 2018: 3097-3101 - [c50]Fred Richardson, Pedro A. Torres-Carrasquillo, Jonas Borgstrom, Douglas E. Sturim, Youngjune Gwon, Jesús Villalba, Jan Trmal, Nanxin Chen, Réda Dehak, Najim Dehak:
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System. Odyssey 2018: 54-59 - [c49]Jesús Antonio Villalba López, Niko Brummer, Najim Dehak:
End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions. Odyssey 2018: 112-119 - [c48]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. SLT 2018: 656-663 - [c47]Odette Scharenborg, Patrick Ebel, Mark Hasegawa-Johnson, Najim Dehak:
Building an ASR System for Mboshi Using A Cross-Language Definition of Acoustic Units Approach. SLTU 2018: 167-171 - [i9]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Sanjeev Khudanpur, Najim Dehak:
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection. CoRR abs/1802.08731 (2018) - [i8]Piotr Zelasko, Piotr Szymanski, Jan Mizgajski, Adrian Szymczak, Yishay Carmiel, Najim Dehak:
Punctuation Prediction Model for Conversational Speech. CoRR abs/1807.00543 (2018) - [i7]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. CoRR abs/1807.06204 (2018) - [i6]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System. CoRR abs/1807.06663 (2018) - [i5]Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. CoRR abs/1810.13048 (2018) - [i4]Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language model integration based on memory control for sequence to sequence speech recognition. CoRR abs/1811.02162 (2018) - [i3]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Low Resource Multi-modal Data Augmentation for End-to-end ASR. CoRR abs/1812.03919 (2018) - 2017
- [c46]Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Raman Arora, Elmar Nöth, Najim Dehak, Heidi Christensen, Frank Rudzicz, Tobias Bocklet, Milos Cernak, Hamid R. Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Maria Yancheva, Alyssa Vann, Nikolai Vogler:
Multi-view representation learning via gcca for multimodal analysis of Parkinson's disease. ICASSP 2017: 2966-2970 - [c45]Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukás Burget, Sanjeev Khudanpur:
An empirical evaluation of zero resource acoustic unit discovery. ICASSP 2017: 5305-5309 - [c44]Santosh Kesiraju, Raghavendra Pappagari, Lucas Ondel, Lukás Burget, Najim Dehak, Sanjeev Khudanpur, Jan Cernocký, Suryakanth V. Gangashetty:
Topic identification of spoken documents using unsupervised acoustic unit discovery. ICASSP 2017: 5745-5749 - [c43]Nicanor García, Juan Rafael Orozco-Arroyave, Luis Fernando D'Haro, Najim Dehak, Elmar Nöth:
Evaluation of the Neurological State of People with Parkinson's Disease Using i-Vectors. INTERSPEECH 2017: 299-303 - [c42]Jesús Villalba, Niko Brümmer, Najim Dehak:
Tied Variational Autoencoder Backends for i-Vector Speaker Recognition. INTERSPEECH 2017: 1004-1008 - [c41]Pedro A. Torres-Carrasquillo, Fred Richardson, Shahan C. Nercessian, Douglas E. Sturim, William M. Campbell, Youngjune Gwon, Swaroop Vattam, Najim Dehak, Sri Harish Reddy Mallidi, Phani Sankar Nidadavolu, Ruizhi Li, Réda Dehak:
The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System. INTERSPEECH 2017: 1333-1337 - [c40]Nicanor García, Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Najim Dehak, Elmar Nöth:
Language Independent Assessment of Motor Impairments of Patients with Parkinson's Disease Using i-Vectors. TSD 2017: 147-155 - [i2]Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukás Burget, Sanjeev Khudanpur:
An Empirical Evaluation of Zero Resource Acoustic Unit Discovery. CoRR abs/1702.01360 (2017) - 2016
- [j7]Stephen H. Shum, David F. Harwath, Najim Dehak, James R. Glass:
On the Use of Acoustic Unit Discovery for Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(9): 1665-1676 (2016) - [c39]Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro L. Koerich:
Native Language Detection Using the I-Vector Framework. INTERSPEECH 2016: 2398-2402 - [c38]Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James R. Glass, Peter Bell, Steve Renals:
Automatic Dialect Detection in Arabic Broadcast Speech. INTERSPEECH 2016: 2934-2938 - [c37]Ruizhi Li, Sri Harish Reddy Mallidi, Lukás Burget, Oldrich Plchot, Najim Dehak:
Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. INTERSPEECH 2016: 3265-3269 - [c36]Najim Dehak:
I-Vector Representation Based on GMM and DNN for Audio Classification. Odyssey 2016 - [c35]Pedro A. Torres-Carrasquillo, Najim Dehak, Elizabeth Godoy, Douglas A. Reynolds, Fred Richardson, Stephen Shum, Elliot Singer, Douglas E. Sturim:
The MITLL NIST LRE 2015 Language Recognition System. Odyssey 2016: 196-203 - 2015
- [j6]Fred Richardson, Douglas A. Reynolds, Najim Dehak:
Deep Neural Network Approaches to Speaker and Language Recognition. IEEE Signal Process. Lett. 22(10): 1671-1675 (2015) - [c34]Fred Richardson, Douglas A. Reynolds, Najim Dehak:
A unified deep neural network for speaker and language recognition. INTERSPEECH 2015: 1146-1150 - [c33]Patrick Cardinal, Najim Dehak, Yu Zhang, James R. Glass:
Speaker adaptation using the i-vector technique for bottleneck features. INTERSPEECH 2015: 2867-2871 - [c32]Patrick Cardinal, Najim Dehak, Alessandro Lameiras Koerich, Jahangir Alam, Patrice Boucher:
ETS System for AV+EC 2015 Challenge. AVEC@ACM Multimedia 2015: 17-23 - [i1]Fred Richardson, Douglas A. Reynolds, Najim Dehak:
A Unified Deep Neural Network for Speaker and Language Recognition. CoRR abs/1504.00923 (2015) - 2014
- [j5]Mohamad Hasan Bahari, Najim Dehak, Hugo Van hamme, Lukás Burget, Ahmed Ali, Jim Glass:
Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(7): 1117-1129 (2014) - [c31]Stephen H. Shum, Najim Dehak, James R. Glass:
Limited labels for unlimited data: active learning for speaker recognition. INTERSPEECH 2014: 383-387 - [c30]Patrick Cardinal, Ahmed Ali, Najim Dehak, Yu Zhang, Tuka Al Hanai, Yifan Zhang, James R. Glass, Stephan Vogel:
Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera. INTERSPEECH 2014: 2088-2092 - [c29]Najim Dehak, Oldrich Plchot, Mohamad Hasan Bahari, Lukás Burget, Hugo Van hamme, Réda Dehak:
GMM Weights Adaptation Based on Subspace Approaches for Speaker Verification. Odyssey 2014: 48-53 - [c28]Ahmed Ali, Yifan Zhang, Patrick Cardinal, Najim Dehak, Stephan Vogel, James R. Glass:
A complete KALDI recipe for building Arabic speech recognition systems. SLT 2014: 525-529 - 2013
- [j4]Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass:
Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach. IEEE Trans. Speech Audio Process. 21(10): 2015-2028 (2013) - [c27]Oldrich Plchot, Spyros Matsoukas, Pavel Matejka, Najim Dehak, Jeff Z. Ma, Sandro Cumani, Ondrej Glembek, Hynek Hermansky, Sri Harish Reddy Mallidi, Nima Mesgarani, Richard M. Schwartz, Mehdi Soufifar, Zheng-Hua Tan, Samuel Thomas, Bing Zhang, Xinhui Zhou:
Developing a speaker identification system for the DARPA RATS project. ICASSP 2013: 6768-6772 - [c26]Xiao Fang, Najim Dehak, James R. Glass:
Bayesian distance metric learning on i-vector for speaker verification. INTERSPEECH 2013: 2514-2518 - [c25]Mohammed Senoussaoui, Patrick Kenny, Pierre Dumouchel, Najim Dehak:
New cosine similarity scorings to implement gender-independent speaker verification. INTERSPEECH 2013: 2773-2777 - 2012
- [c24]Pavel Matejka, Oldrich Plchot, Mehdi Soufifar, Ondrej Glembek, Luis Fernando D'Haro, Karel Veselý, Frantisek Grézl, Jeff Z. Ma, Spyros Matsoukas, Najim Dehak:
Patrol Team Language Identification System for DARPA RATS P1 Evaluation. INTERSPEECH 2012: 50-53 - [c23]Stephen Shum, Najim Dehak, Jim Glass:
On the Use of Spectral and Iterative Methods for Speaker Diarization. INTERSPEECH 2012: 482-485 - [c22]Mohammed Senoussaoui, Najim Dehak, Patrick Kenny, Réda Dehak, Pierre Dumouchel:
First attempt of boltzmann machines for speaker verification. Odyssey 2012: 117-121 - [c21]Elliot Singer, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Alan McCree, Fred Richardson, Najim Dehak, Douglas E. Sturim:
The MITLL NIST LRE 2011 language recognition system. Odyssey 2012: 209-215 - 2011
- [j3]Najim Dehak, Patrick Kenny, Réda Dehak, Pierre Dumouchel, Pierre Ouellet:
Front-End Factor Analysis for Speaker Verification. IEEE Trans. Speech Audio Process. 19(4): 788-798 (2011) - [c20]Zahi N. Karam, William M. Campbell, Najim Dehak:
Towards reduced false-alarms using cohorts. ICASSP 2011: 4512-4515 - [c19]Najim Dehak, Zahi N. Karam, Douglas A. Reynolds, Réda Dehak, William M. Campbell, James R. Glass:
A channel-blind system for speaker verification. ICASSP 2011: 4536-4539 - [c18]Douglas E. Sturim, William M. Campbell, Najim Dehak, Zahi N. Karam, Alan McCree, Douglas A. Reynolds, Fred Richardson, Pedro A. Torres-Carrasquillo, Stephen Shum:
The MIT LL 2010 speaker recognition evaluation system: Scalable language-independent speaker recognition. ICASSP 2011: 5272-5275 - [c17]Najim Dehak, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Réda Dehak:
Language Recognition via i-vectors and Dimensionality Reduction. INTERSPEECH 2011: 857-860 - [c16]Stephen Shum, Najim Dehak, Ekapol Chuangsuwanich, Douglas A. Reynolds, James R. Glass:
Exploiting Intra-Conversation Variability for Speaker Diarization. INTERSPEECH 2011: 945-948 - 2010
- [c15]Mohammed Senoussaoui, Patrick Kenny, Najim Dehak, Pierre Dumouchel:
An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech. Odyssey 2010: 6 - [c14]Najim Dehak, Réda Dehak, James R. Glass, Douglas A. Reynolds, Patrick Kenny:
Cosine Similarity Scoring without Score Normalization Techniques. Odyssey 2010: 15 - [c13]Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass:
Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification. Odyssey 2010: 16
2000 – 2009
- 2009
- [c12]Ondrej Glembek, Lukás Burget, Najim Dehak, Niko Brümmer, Patrick Kenny:
Comparison of scoring methods used in speaker recognition with Joint Factor Analysis. ICASSP 2009: 4057-4060 - [c11]Najim Dehak, Patrick Kenny, Réda Dehak, Ondrej Glembek, Pierre Dumouchel, Lukás Burget, Valiantsina Hubeika, Fabio Castaldo:
Support vector machines and Joint Factor Analysis for speaker verification. ICASSP 2009: 4237-4240 - [c10]Pierre Dumouchel, Najim Dehak, Yazid Attabi, Réda Dehak, Narjès Boufaden:
Cepstral and long-term features for emotion recognition. INTERSPEECH 2009: 344-347 - [c9]Najim Dehak, Réda Dehak, Patrick Kenny, Niko Brümmer, Pierre Ouellet, Pierre Dumouchel:
Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification. INTERSPEECH 2009: 1559-1562 - 2008
- [j2]Patrick Kenny, Pierre Ouellet, Najim Dehak, Vishwa Gupta, Pierre Dumouchel:
A Study of Interspeaker Variability in Speaker Verification. IEEE Trans. Speech Audio Process. 16(5): 980-988 (2008) - [c8]Patrick Kenny, Najim Dehak, Pierre Ouellet, Vishwa Gupta, Pierre Dumouchel:
Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation. INTERSPEECH 2008: 1401-1404 - [c7]Najim Dehak, Réda Dehak, Patrick Kenny, Pierre Dumouchel:
Comparison between factor analysis and GMM support vector machines for speaker verification. Odyssey 2008: 9 - [c6]Patrick Kenny, Najim Dehak, Réda Dehak, Vishwa Gupta, Pierre Dumouchel:
The role of speaker factors in the NIST extended data task. Odyssey 2008: 11 - [c5]Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel:
Kernel combination for SVM speaker verification. Odyssey 2008: 21 - 2007
- [j1]Najim Dehak, Pierre Dumouchel, Patrick Kenny:
Modeling Prosodic Features With Joint Factor Analysis for Speaker Verification. IEEE Trans. Speech Audio Process. 15(7): 2095-2103 (2007) - [c4]Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel:
Linear and non linear kernel GMM supervector machines for speaker verification. INTERSPEECH 2007: 302-305 - [c3]Najim Dehak, Patrick Kenny, Pierre Dumouchel:
Continuous prosodic features and formant modeling with joint factor analysis for speaker verification. INTERSPEECH 2007: 1234-1237 - 2006
- [c2]Hervé Bredin, Najim Dehak, Gérard Chollet:
GMM-based SVM for face recognition. ICPR (3) 2006: 1111-1114 - [c1]Najim Dehak, Gérard Chollet:
Support Vector Gmms for Speaker Verification. Odyssey 2006: 1-4
Coauthor Index
aka: Mark A. Hasegawa-Johnson
aka: Phani Sankar Nidadavolu
aka: Jesús Antonio Villalba López
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-08 02:24 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint