default search action
IberSPEECH 2021: Valladolid, Spain
- Valentín Cardeñoso-Payo, David Escudero Mancebo, César González Ferreras:
Fifth International Conference, IberSPEECH 2021, Valladolid, Spain, 24-25 March 2021, Proceedings. ISCA 2021
Applications of Speech Technologies for Learning and Education
- David Escudero, Valentín Cardeñoso-Payo, Mario Corrales-Astorgano, César González Ferreras:
Prosodic feature selection for automatic quality assessment of oral productions in people with Down syndrome. - Cristian Tejedor García, Valentín Cardeñoso-Payo, David Escudero Mancebo:
Performance Comparison of Specific and General-Purpose ASR Systems for Pronunciation Assessment of Japanese Learners of Spanish. - Yu Bai, Ferdy Hubers, Catia Cucchiarini, Helmer Strik:
An ASR-based Reading Tutor for Practicing Reading Skills in the First Grade: Improving Performance through Threshold Adjustment. - Catarina Realinho, Rita Gonçalves, Helena Moniz, Isabel Trancoso:
Impact of vowel reduction in L2 Chinese learners of Portuguese within and across word boundaries. - Diogo Botelheiro, Alberto Abad, João Freitas, Rui Correia:
Nativeness Assessment for Crowdsourced Speech Collections.
Keynote 1
- Gérard Bailly:
Characterizing and Assessing the Oral Reading Fluency of Young Readers. IberSPEECH 2021
Speech Processing and Acoustic Event Detection
- Pablo Gimeno, Dayana Ribas, Alfonso Ortega Giménez, Antonio Miguel, Eduardo Lleida:
Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions. - Juan Manuel Martín-Doñas, Antonio M. Peinado, Iván López-Espejo, Ángel M. Gómez:
Dual-channel eKF-RTF framework for speech enhancement with DNN-based speech presence estimation. - Diego de Benito-Gorrón, Daniel Ramos-Castro, Doroteo T. Toledano:
An analysis of Sound Event Detection under acoustic degradation using multi-resolution systems. - David Bonet, Guillermo Cámbara, Fernando López, Pablo Gómez, Carlos Segura, Jordi Luque, Mireia Farrús:
Speech Enhancement for Wake-Up-Word detection in Voice Assistants. - Fernando Fernández Martínez, David Griol, Zoraida Callejas, Cristina Luna Jiménez:
An approach to intent detection and classification based on attentive recurrent neural networks. - Midel de Velasco, Raquel Justo, Leila Ben Letaifa, M. Inés Torres:
Contrasting the Emotions identified in Spanish TV debates and in Human-Machine Interactions. - Roberto Móstoles, David Griol, Zoraida Callejas, Fernando Fernández Martínez:
A proposal for emotion recognition using speech features, transfer learning and convolutional neural networks. - Esther Rituerto-González, Clara Luis-Mingueza, Carmen Peález-Moreno:
Using Audio Events to Extend a Multi-modal Public Speaking Database with Reinterpreted Emotional Annotations.
Albayzín Evaluation Challenges
- Juan Ignacio Álvarez-Trejos, Doroteo T. Toledano:
Query-by-Example Spoken Term Detection using Attentive Pooling Networks at ALBAYZIN 2020 Evaluation: The AUDIAS-UAM System. - Cristina Luna Jiménez, Ricardo Kleinlein, Fernando Fernández Martínez, José Manuel Pardo Muñoz, José Manuel Moya-Fernández:
GTH-UPM System for Albayzin Multimodal Diarization Challenge 2020. - Victoria Mingote, Ignacio Viñals, Pablo Gimeno, Antonio Miguel, Alfonso Ortega Giménez, Eduardo Lleida:
ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge. - Manuel Porta-Lorenzo, José Luis Alba-Castro, Laura Docío Fernández:
The GTM-UVIGO System for Audiovisual Diarization 2020. - Roberto Font, Teresa Grau:
The Biometric Vox System for the Albayzin-RTVE 2020 Speaker Diarization and Identity Assignment Challenge. - Carlos Rodrigo Castillo-Sanchez, Leibny Paola García-Perera:
The CLIR-CLSP System for the IberSPEECH-RTVE 2020 Speaker Diarization and Identity Assignment Challenge. - Ignacio Viñals, Pablo Gimeno, Alfonso Ortega Giménez, Antonio Miguel, Eduardo Lleida:
Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge. - Roberto Font, Teresa Grau:
The Biometric Vox System for the Albayzin-RTVE 2020 Speech-to-Text Challenge. - Aitor Álvarez, Haritz Arzelus, Iván González Torre, Ander González-Docasal:
The Vicomtech Speech Transcription Systems for the Albayzín-RTVE 2020 Speech to Text Transcription Challenge. - Juan M. Perero-Codosero, Fernando M. Espinoza-Cuadros, Luis Alfonso Hernández Gómez:
Sigma-UPM ASR Systems for the IberSpeech-RTVE 2020 Speech-to-Text Transcription Challenge. - Martin Kocour, Guillermo Cámbara, Jordi Luque, David Bonet, Mireia Farrús, Martin Karafiát, Karel Veselý, Jan Cernocký:
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge. - Javier Jorge, Adrià Giménez, Pau Baquero-Arnal, Javier Iranzo-Sánchez, Alejandro Pérez, Gonçal V. Garcés Díaz-Munío, Joan Albert Silvestre-Cerdà, Jorge Civera, Albert Sanchís, Alfons Juan:
MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-To-Text Challenge.
Research and Development Projects
- David Escudero Mancebo, Valentín Cardeñoso-Payo, Mario Corrales-Astorgano, César González Ferreras, Valle Flores-Lucas, Lourdes Aguilar, Yolanda Martín-de-San-Pablo, Alfonso Rodríguez-de-Rojas:
Incorporation of an automatic module for the prediction of the quality of oral communication of people with Down syndrome in an educational video game. - Sergio Figueras, Alejandro García-Caballero, Carmen García-Mateo, Laura Docío Fernández, Edward L. Campbell, José Baltasar García Pérez-Schofield, Leandro Rodríguez Liñares, Arturo J. Méndez:
CIRUSS Platform: Surgery Patient Empowerment by Stress and Anxiety Monitoring. - Inma Hernáez, José Andrés González López, Eva Navas, José Luis Pérez-Córdoba, Ibon Saratxaga, Gonzalo Olivares, Jon Sánchez de la Fuente, Alberto Galdón, Víctor García Romillo, Míriam González-Atienza, Tanja Schultz, Phil D. Green, Michael Wand, Ricard Marxer, Lorenz Diener:
Voice Restoration with Silent Speech Interfaces (ReSSInt). - Catarina Oliveira, Ana Rita Valente, Luciana Albuquerque, Fábio Barros, Paula Martins, Samuel S. Silva, António J. S. Teixeira:
The Vox Senes project: a study of segmental changes and rhythm variations on European Portuguese aging voice. - David Griol, David Pérez Fernández, Zoraida Callejas:
Hispabot-Covid19: the official Spanish conversational system about Covid-19. - Samuel S. Silva, António J. S. Teixeira, Nuno Almeida, Diogo Cunha, David Ferreira, Conceição Cunha:
Project MEMNON: Extending Speech Production Studies to Silent Speech, Dynamic Sounds and Audiovisual Speech Synthesis. - Zoraida Callejas, David Griol, Kawtar Benghazi Akhlaki, Manuel Noguera, María Inés Torres, Raquel Justo, Anna Esposito, Gennaro Cordasco, Raymond R. Bond, Maurice D. Mulvenna, Edel Ennis, Siobhan O'Neill, Huiru Zheng, Matthias Kraus, Nicolas Wagner, Wolfgang Minker, Gavin McConvey, Matthias L. Hemmje, Michael Fuchs, Neil Glackin, Gérard Chollet:
Towards conversational technology to promote, monitor and protect mental health. - Oriol Guasch, Francesc Alías, Marc Arnela, Joan Claudi Socoró, Marc Freixes, Arnau Pont:
GENIOVOX Project: Computational generation of expressive voice.
Ph.D. Thesis
- Sara Santiso:
Adverse Drug Reaction extraction on Electronic Health Records written in Spanish: A PhD Thesis overview. - Cristian Tejedor García, Valentín Cardeñoso-Payo, David Escudero Mancebo:
Design and Evaluation of Mobile Computer-Assisted Pronunciation Training Tools for Second Language Learning: a Ph.D. Thesis Overview. - Laureano Moro-Velázquez, Jorge Gómez-García, Najim Dehak, Juan Ignacio Godino-Llorente:
New tools for the differential evaluation of Parkinson's disease using voice and speech processing. - Mario Corrales-Astorgano:
Prosody training of people with Down syndrome using an educational video game. - Umair Khan, Javier Hernando:
Self-supervised Deep Learning Approaches to Speaker Recognition: A Ph.D. Thesis Overview.
ASR and NLP Techniques
- María Pilar Fernández-Gallego, Doroteo T. Toledano:
A study of data augmentation for increased ASR robustness against packet losses. - Carlos Carvalho, Alberto Abad:
TRIBUS: An end-to-end automatic speech recognition system for European Portuguese. - Thierry Etchegoyhen, Haritz Arzelus, Harritxu Gete Ugarte, Aitor Álvarez, Ander González-Docasal, Edson Benites Fernandez:
mintzai-ST: Corpus and Baselines for Basque-Spanish Speech Translation. - Nuno Carriço, Paulo Quaresma:
Sentence Embeddings and Sentence Similarity for Portuguese FAQs. - Rui Ribeiro, Alberto Abad, José Lopes:
Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning.
Keynote 2
- Antonio Bonafonte:
Diverse Conversational Spoken Language Generation. IberSPEECH 2021
Speech Synthesis and Multimodal Processing
- Agustín Alonso, Victor García, Inma Hernáez, Eva Navas, Jon Sánchez:
Automatic Speaker Adaptation Assessment Based on Objective Measures for Voice Banking Donors. - Conceição Cunha, Nuno Almeida, Jens Frahm, Samuel S. Silva, António J. S. Teixeira:
Data-driven analysis of nasal vowels dynamics and coordination: Results for bilabial contexts. - David Gimeno-Gómez, Carlos D. Martínez-Hinarejos:
Analysis of Visual Features for Continuous Lipreading in Spanish. - Victor Garcia, Inma Hernáez, Eva Navas:
Implementation of neural network based synthesizers for Spanish and Basque. - José Andrés González López, Miriam Gonzalez-Atienza, Alejandro Gómez Alanís, José Luis Pérez-Córdoba, Phil D. Green:
Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis. - Aitana Villaplana, Carlos David Martínez-Hinarejos:
Generation of Synthetic Sign Language Sentences. - Marc Freixes, Francesc Alías, Joan Claudi Socoró:
Contribution of vocal tract and glottal source spectral cues in the generation of happy and aggressive [a] vowels. - Luciana Albuquerque, Ana Rita Valente, Fábio Barros, António J. S. Teixeira, Samuel S. Silva, Paula Martins, Catarina Oliveira:
The age effects on EP vowel production: an ultrasound pilot study.
Speaker Characterization and Diarization
- David Romero, Luis Fernando D'Haro, Christian Salamea:
Exploring Transformer-based Language Recognition using Phonotactic Information. - Alejandro Gómez Alanís, José A. González, Antonio M. Peinado:
Adversarial Transformation of Spoofing Attacks for Voice Biometrics. - Yevhenii Prokopalo, Meysam Shamsi, Loïc Barrault, Sylvain Meignier, Anthony Larcher:
Active correction for speaker diarization with human in the loop. - Miriam Gonzalez-Atienza, Antonio M. Peinado, José Andrés González López:
An Automatic System for Dementia Detection using Acoustic and Linguistic Features. - Edward L. Campbell, Laura Docío Fernández, Javier Jiménez Raboso, Carmen García-Mateo:
Alzheimer's Dementia Detection from Audio and Language Modalities in Spontaneous Speech.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.