default search action
26th SPECOM 2024: Belgrade, Serbia - Part II
- Alexey Karpov, Vlado Delic:
Speech and Computer - 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024, Proceedings, Part II. Lecture Notes in Computer Science 15300, Springer 2025, ISBN 978-3-031-78013-4
Computational Paralinguistics
- Denis Dresvyanskiy, Alexey Karpov, Wolfgang Minker:
A Cross-Multi-modal Fusion Approach for Enhanced Engagement Recognition. 3-17 - Gábor Gosztolya, András Bence Lázár, Ildikó Hoffmann, Otília Bagi, Fruzsina Fanni Farkas, Janka Gajdics, László Tóth, János Kálmán:
Automatic Assessment of Signs of Alcohol Dependency Syndrome from Spontaneous Speech. 18-29 - Aya Abdalla, Nada Sharaf, Caroline Sabty:
An Enhanced Compact Convolution Transformer for Age, Gender and Emotion Detection in Egyptian Arabic Speech. 30-42 - Elizaveta Vologina, Anastasiia Matveeva, Olesia Makhnytkina, Yuri Matveev, Nursaule Burambayeva:
RAG and Few-Shot Prompting in Emotional Text Generation. 43-53 - Ahmed Sherif, Caroline Sabty:
Sentiment Analysis for Egyptian Arabic-English Code-Switched Data Using Traditional Neural Models and Advanced Language Models. 54-69 - Uliana E. Kochetkova, Pavel A. Skrelin, Vera Evdokimova, Nikolai Borisov, Pavel Scherbakov, Petr Fedkin, Rada German:
Automatic Detection of Irony Based on Acoustic Features and Facial Expressions. 70-82
Affective Computing
- Olga V. Frolova, Anton Matveev, Elena E. Lyakso, Tamara Kuznetsova, Inna Golubeva:
Emotion Recognition by Vocalizations of Nonhuman Primates: Human and Automatic Classification. 85-94 - Aman Goel, Abhishek Poswal:
MMHS: Multimodal Model for Hate Speech Intensity Prediction. 95-108 - Tijana Durkic, Nikola Simic, Sinisa Suzic, Dragana Bajovic, Zoran Peric, Vlado Delic:
Multimodal Emotion Recognition Using Compressed Graph Neural Networks. 109-121 - Olesia Makhnytkina, Yuri Matveev, Alexander Zubakov, Anton Matveev:
Utilizing Speaker Models and Topic Markers for Emotion Recognition in Dialogues. 122-137 - Elena E. Lyakso, Olga V. Frolova, Aleksandr Nikolaev, Severin Grechanyi, Yulia Filatova, Ruban Nersisson:
How Children Recognize Emotions from Video and Audio. 138-153
Speaker Recognition
- Jahangir Alam, Md Shahidul Alam:
On the Influence of CNN-Based Feature Learning Modules in Neural Speaker Verification Framework. 157-170 - Jacek Kudera, Miriam Coccia, Sharifeh Fadaeijouybari, Till Preidt, Akshay Ranjan, Angelika Braun:
Voice Cloning and Mismatch Conditions in Forensic Automatic Speaker Recognition. 171-184 - Shalini Tomar, Shashidhar G. Koolagudi:
Transformation of Emotional Speech to Anger Speech to Reduce Mismatches in Testing and Enrollment Speech for Speaker Recognition System. 185-200 - Parth Sanjay Khadse, Sabyasachi Chandra, Puja Bharati, Debolina Pramanik, G. Satya Prasad, Aniket Aitawade, Shyamal Kumar Das Mandal:
Investigating Data Requirements for Hindi Speaker Recognition: A Comparative Study with English. 201-209 - Rodmonga Potapova, Vsevolod Potapov, Irina Kuryanova:
Practical Evaluation and Validation of Methods for Automatic Speaker Identification (as Applied to Various Languages). 210-223
Digital Speech Processing
- Branislav Gerazov, Paul Konstantin Krug, Daniel R. van Niekerk, Anqi Xu, Peter Birkholz, Yi Xu:
In Pursuit for the Best Error Metric for Optimisation of Articulatory Vowel Synthesis. 227-237 - Lukas Förner, Maximilian Dauner:
Exploring MetaConformer for Speech Enhancement. 238-249 - YingWei Tan:
Integration of Short-Term and Long-Term Harmonic Peaks in a Two-Level Discriminative Weight Training Framework for Voice Activity Detection. 250-263 - Anandakumar Singaravelan, Jia-Lien Hsu:
Separating Party Conversation by Applying Contrastive Learning Methodology. 264-276 - Himadri Mukherjee, Matteo Marciano, Ankita Dhar, Kaushik Roy:
DuFCALF: Instilling Sentience in Computerized Song Analysis. 277-292
Natural Language Processing
- Manar Ouled Ahmed, Zuheng Ming, Alice Othmani:
Harnessing Knowledge Distillation for Enhanced Text-to-Text Translation in Low-Resource Languages. 295-307 - Yasser Saeid, Thomas Kopinski:
Bias Unveiled: Enhancing Fairness in German Word Embeddings with Large Language Models. 308-325 - Prateek Verma:
Conformer LLM - Convolution Augmented Large Language Models. 326-333 - Valery Solovyev, Anna Ivleva:
How to Detect Imbalances in the Google Books Ngram Corpus? 334-348 - Vladimir V. Bochkarev, Andrey V. Savinkov, Anna V. Shevlyakova:
Predicting the Valence Rating of Russian Words Using Various Pre-trained Word Embeddings. 349-361 - Radek Marík, Renata Landgráfová, Jirí Liska:
Ancient Egyptian Hieroglyphic Texts Structure Identification. 362-377
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.