default search action

combined dblp search
author search
venue search
publication search

ask others

26th SPECOM 2024: Belgrade, Serbia - Part I

> Home > Conferences and Workshops > SPECOM

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  authority control:
- export record
  dblp key:
  - conf/specom/2024-1
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/2024-1
Alexey Karpov, Vlado Delic:
Speech and Computer - 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024, Proceedings, Part I. Lecture Notes in Computer Science 15299, Springer 2025, ISBN 978-3-031-77960-2

Invited Papers

- view
  authority control:
- export record
  dblp key:
  - conf/specom/KraljevskiDSTW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/KraljevskiDSTW24
Ivan Kraljevski, Frank Duckhorn, Daniel Sobe, Constanze Tschöpe, Matthias Wolff:
Preserving Language Heritage Through Speech Technology: The Case of Upper Sorbian. 3-22
- view
  authority control:
- export record
  dblp key:
  - conf/specom/SecujskiPPJPSNSSD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/SecujskiPPJPSNSSD24
Milan Secujski, Branislav M. Popovic, Darko Pekar, Niksa Jakovljevic, Edvin Pakoci, Sinisa Suzic, Tijana V. Nosek, Nikola Simic, Vuk Stanojev, Vlado Delic:
Retrospective and Perspectives of TTS & STT Technology Development and Implementation for South Slavic Under-Resourced Languages. 23-42

Automatic Speech Recognition

- view
  authority control:
- export record
  dblp key:
  - conf/specom/LuoM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/LuoM24
Yue Luo, Péter Mihajlik:
Comparison of Well and Lower-Resourced Self-training in ASR. 45-56
- view
  authority control:
- export record
  dblp key:
  - conf/specom/KipyatkovaKDR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/KipyatkovaKDR24
Irina S. Kipyatkova, Ildar Kagirov, Mikhail Dolgushin, Alexandra Rodionova:
Towards a Livvi-Karelian End-to-End ASR System. 57-68
- view
  authority control:
- export record
  dblp key:
  - conf/specom/Gupta24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/Gupta24
Vishwa Gupta:
Advances in OpenASR21 Evaluation with Increased Temporal Resolution for Speech Self-supervised Learning Models. 69-81
- view
  authority control:
- export record
  dblp key:
  - conf/specom/KatkovLV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/KatkovLV24
Sergei Katkov, Antonio Liotta, Alessandro Vietti:
Benchmarking Whisper Under Diverse Audio Transformations and Real-Time Constraints. 82-91
- view
  authority control:
- export record
  dblp key:
  - conf/specom/GunduzKYAFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/GunduzKYAFS24
Ahmet Gunduz, Yunsu Kim, Kamer Ali Yuksel, Mohamed Al-Badrashiny, Thiago Castro Ferreira, Hassan Sawaf:
AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost. 92-103
- view
  authority control:
- export record
  dblp key:
  - conf/specom/TorralboMAP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/TorralboMAP24
Manuel Torralbo, Ariane Méndez, Maia Agirre, Arantza del Pozo:
Pre-training and Adverse Audio Samples for Data-Efficient Wake Word Detection. 104-118
- view
  authority control:
- export record
  dblp key:
  - conf/specom/KarandeSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/KarandeSM24
Pranav Karande, Balaram Sarkar, Chandresh Kumar Maurya:
Cross-Lingual Summarization of Speech-to-Speech Translation: A Baseline. 119-133

Speech and Language Resources

- view
  authority control:
- export record
  dblp key:
  - conf/specom/LjubesicRK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/LjubesicRK24
Nikola Ljubesic, Peter Rupnik, Danijel Korzinek:
The ParlaSpeech Collection of Automatically Generated Speech and Text Datasets from Parliamentary Proceedings. 137-150
- view
  authority control:
- export record
  dblp key:
  - conf/specom/SherstinovaP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/SherstinovaP24
Tatiana Y. Sherstinova, Irina Petrova:
ESC Corpus of Spoken Russian: Everyday Student Conversations Captured Through Continuous Speech Recording in Natural Communicative Environments. 151-162
- view
  authority control:
- export record
  dblp key:
  - conf/specom/IvankoRAKK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/IvankoRAKK24
Denis Ivanko, Dmitry Ryumin, Alexandr Axyonov, Alexey M. Kashevnik, Alexey Karpov:
OpenAV: Bilingual Dataset for Audio-Visual Voice Control of a Computer for Hand Disabled People. 163-173
- view
  authority control:
- export record
  dblp key:
  - conf/specom/PopovaP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/PopovaP24
Velka Popova, Dimitar Popov:
Bulgarian Speech Resources in the CHILDES System. 174-186
- view
  authority control:
- export record
  dblp key:
  - conf/specom/BogdanovaBeglarianBKSP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/BogdanovaBeglarianBKSP24
Natalia Bogdanova-Beglarian, Olga Blinova, Maria Khokhlova, Tatiana Y. Sherstinova, Tatiana I. Popova:
Multiword Units in Russian Everyday Speech: Empirical Classification and Corpus-Based Studies. 187-200
- view
  authority control:
- export record
  dblp key:
  - conf/specom/PotapovaPKMB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/PotapovaPKMB24
Rodmonga Potapova, Vsevolod Potapov, Ekaterina Karimova, Leonid Motovskikh, Nikolay Bobrov:
Neurophysiological Correlates of Textual Modulation in Visual Stimuli: An Experimental Study of Russian and English Memes. 201-215

Speech Synthesis and Perception

- view
  authority control:
- export record
  dblp key:
  - conf/specom/NosekSSSPD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/NosekSSSPD24
Tijana V. Nosek, Sinisa Suzic, Milan Secujski, Vuk Stanojev, Darko Pekar, Vlado Delic:
End-to-End Speech Synthesis for the Serbian Language Based on Tacotron. 219-229
- view
  authority control:
- export record
  dblp key:
  - conf/specom/AlwaisiAN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/AlwaisiAN24
Shaimaa Alwaisi, Mohammed Salah Al-Radhi, Géza Németh:
ChildTinyTalks (CTT): A Benchmark Dataset and Baseline for Expressive Child Speech Synthesis. 230-240
- view
  authority control:
- export record
  dblp key:
  - conf/specom/BorzykhS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/BorzykhS24
Anna Borzykh, Tatiana Shevchenko:
Multidimensional Rhythm: Comparing Rhythmic Properties of Australian and New Zealand Monologues. 241-250
- view
  authority control:
- export record
  dblp key:
  - conf/specom/AnanevaK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/AnanevaK24
Anastasia Ananeva, Uliana E. Kochetkova:
Influence of Linguistic and Sociolinguistic Factors on Speech Rate Perception. 251-264
- view
  authority control:
- export record
  dblp key:
  - conf/specom/GusevaMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/GusevaMD24
Daria Guseva, Olga Mitrofanova, Mikhail Dolgushin:
Human and Machine Keyphrase Perception in Russian Text and Speech. 265-280
- view
  authority control:
- export record
  dblp key:
  - conf/specom/LyaksoFMNN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/LyaksoFMNN24
Elena E. Lyakso, Olga V. Frolova, Anton Matveev, Aleksandr Nikolaev, Ruban Nersisson:
Assessment of Children's Ability to Manifest Emotions in Facial Expressions, Voice and Speech by Humans, Automatic, and on a Likert Scale. 281-294

Speech Processing for Medicine

- view
  authority control:
- export record
  dblp key:
  - conf/specom/GosztolyaTSBH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/GosztolyaTSBH24
Gábor Gosztolya, László Tóth, Veronika Svindt, Judit Bóna, Ildikó Hoffmann:
Investigating the Utility of wav2vec 2.0 Hidden Layers for Detecting Multiple Sclerosis. 297-308
- view
  authority control:
- export record
  dblp key:
  - conf/specom/MamontovZKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/MamontovZKM24
Danila Mamontov, Sebastian Zepf, Alexey Karpov, Wolfgang Minker:
Cross-Cultural Automatic Depression Detection Based on Audio Signals. 309-323
- view
  authority control:
- export record
  dblp key:
  - conf/specom/KumarKP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/KumarKP24
Lokesh Kumar, Kumar Kaustubh, S. R. Mahadeva Prasanna:
Depression Classification Using Token Merging-Based Speech Spectrotemporal Transformer. 324-335
- view
  authority control:
- export record
  dblp key:
  - conf/specom/IdamkinaC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/IdamkinaC24
Mary Idamkina, Andrea Corradini:
Detecting Depression from Audio Data. 336-351
- view
  authority control:
- export record
  dblp key:
  - conf/specom/AzizS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/AzizS24
Dosti Aziz, Dávid Sztahó:
Binary and Multiclass Classification of Dysphonia Using Whisper Encoder and One-Dimensional Convolutional Neural Network. 352-366
- view
  authority control:
- export record
  dblp key:
  - conf/specom/EgleNTK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/EgleNTK24
German Egle, Dariya Novokhrestova, Svetlana Tomilina, Evgeny Kostyuchenko:
Approach to Assessing the Quality of Syllable Pronunciation by Patients in the Process of Speech Rehabilitation Based on Comparison with Healthy Speakers. 367-376
- view
  authority control:
- export record
  dblp key:
  - conf/specom/HarnischSH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/HarnischSH24
Philipp L. Harnisch, Daniel Schuhmann, Stefan Hillmann:
A Comparative Study for Contextualized Spoken Answer Classification in German Medical Questionnaires. 377-391

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.