- 2024
- Jesús Villalba:
Towards Speech Processing Robust to Adversarial Deceptions. Odyssey 2024 - Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li:
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion. Odyssey 2024: 180-186 - Can Cui, Imran A. Sheikh, Mostafa Sadeghi, Emmanuel Vincent:
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications. Odyssey 2024: 99-106 - Juan Ignacio Álvarez-Trejos, Beltrán Labrador, Alicia Lozano-Diez:
Leveraging Speaker Embeddings in End-to-End Neural Diarization for Two-Speaker Scenarios. Odyssey 2024: 107-114 - Imen Ben Amor, Jean-François Bonastre, David van der Vloed:
Forensic speaker recognition with BA-LR: calibration and evaluation on a forensically realistic database. Odyssey 2024: 9-16 - Jaime Bellver-Soler, Iván Martín-Fernández, Jose M. Bravo-Pacheco, Sergio Esteban Romero, Fernando Fernández Martínez, Luis Fernando D'Haro:
Multimodal Audio-Language Model for Speech Emotion Recognition. Odyssey 2024: 288-295 - Japan Bhatt, Harsh Patel, Hemant A. Patil:
Noise Robust Whisper Features for Dysarthric Automatic Speech Recognition. Odyssey 2024: 217-224 - Carlos Busso:
Toward Robust and Discriminative Emotional Speech Representations. Odyssey 2024 - Shreeram Suresh Chandra, Zongyang Du, Berrak Sisman:
Exploring speech style spaces with language models: Emotional TTS without emotion labels. Odyssey 2024: 194-200 - Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. Odyssey 2024: 260-265 - Oubaïda Chouchane, Christoph Busch, Chiara Galdi, Nicholas W. D. Evans, Massimiliano Todisco:
A Comparison of Differential Performance Metrics for the Evaluation of Automatic Speaker Verification Fairness. Odyssey 2024: 209-216 - Joon Son Chung:
Multimodal Learning of Speech and Speaker Representations. Odyssey 2024 - Federico Costa, Miquel India, Javier Hernando:
Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge. Odyssey 2024: 266-273 - Anh-Tuan Dao, Nicholas W. D. Evans, Driss Matrouf:
Spoofing detection in the wild: an investigation of approaches to improve generalisation. Odyssey 2024: 145-150 - Daria Diatlova, Anton Udalov, Vitalii Shutov, Egor Spirin:
Adapting WavLM for Speech Emotion Recognition. Odyssey 2024: 303-308 - Zongyang Du, Junchen Lu, Kun Zhou, Lakshmish Kaushik, Berrak Sisman:
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with A Conditional Diffusion Model. Odyssey 2024: 172-179 - Jarod Duret, Yannick Estève, Mickael Rouvier:
MSP-Podcast SER Challenge 2024: L'antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition. Odyssey 2024: 309-314 - Satwik Dutta, Iván López-Espejo, Dwight Irvin, John H. L. Hansen:
Joint Language and Speaker Classification in Naturalistic Bilingual Adult-Toddler Interactions. Odyssey 2024: 81-85 - Aleix Espuña, Amrutha Prasad, Petr Motlícek, Srikanth R. Madikeri, Christof Schüpbach:
Normalizing Flows for Speaker and Language Recognition Backend. Odyssey 2024: 74-80 - Abderrahim Fathan, Xiaolin Zhu, Jahangir Alam:
An investigative study of the effect of several regularization techniques on label noise robustness of self-supervised speaker verification systems. Odyssey 2024: 43-50 - Anna Favaro, Najim Dehak, Thomas Thebaud, Jesús Villalba, Esther S. Oh, Laureano Moro-Velázquez:
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task. Odyssey 2024: 201-208 - Thibault Gaudier, Marie Tahon, Anthony Larcher, Yannick Estève:
Automatic Voice Identification after Speech Resynthesis using PPG. Odyssey 2024: 187-193 - Linda Gerlach, Finnian Kelly, Kirsty McDougall, Anil Alexander:
Exploring speaker similarity based selection of relevant populations for forensic automatic speaker recognition. Odyssey 2024: 25-30 - Lucas Goncalves, Ali N. Salman, Abinay Reddy Naini, Laureano Moro-Velázquez, Thomas Thebaud, Paola García, Najim Dehak, Berrak Sisman, Carlos Busso:
Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results. Odyssey 2024: 247-254 - Reza Amini Gougeh, Nu Zhang, Zeljko Zilic:
Optimizing Auditory Immersion Safety on Edge Devices: An On-Device Sound Event Detection System. Odyssey 2024: 225-231 - Craig S. Greenberg:
A Brief History of the NIST Speaker Recognition Evaluations. Odyssey 2024 - Nathan Griot, Mohammad MohammadAmini, Driss Matrouf, Raphaël Blouet, Jean-François Bonastre:
Attention-based Comparison on Aligned Utterances for Text-Dependent Speaker Verification. Odyssey 2024: 31-37 - Henry Härm, Tanel Alumäe:
TalTech Systems for the Odyssey 2024 Emotion Recognition Challenge. Odyssey 2024: 255-259 - Mingrui He, Longting Xu, Han Wang, Mingjun Zhang, Rohan Kumar Das:
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks. Odyssey 2024: 137-144 - Vincent Hughes, Chenzi Xu, Paul Foulkes, Philip Harrison, Poppy Welch, Finnian Kelly, David van der Vloed:
Exploring individual speaker behaviour within a forensic automatic speaker recognition system. Odyssey 2024: 1-8