default search action

combined dblp search
author search
venue search
publication search

ask others

27th TSD 2024: Brno, Czech Republic - Part II

> Home > Conferences and Workshops > TSD

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  authority control:
- export record
  dblp key:
  - conf/tsd/2024-2
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/2024-2
Elmar Nöth, Ales Horák, Petr Sojka:
Text, Speech, and Dialogue - 27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024, Proceedings, Part II. Lecture Notes in Computer Science 15049, Springer 2024, ISBN 978-3-031-70565-6

Speech

- view
  authority control:
- export record
  dblp key:
  - conf/tsd/SrinivasaganG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/SrinivasaganG24
Gokul Srinivasagan, Munir Georges:
Retrieval Augmented Spoken Language Generation for Transport Domain. 3-12
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/AllerF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/AllerF24
Sven Aller, Mark Fishel:
Adapting Audiovisual Speech Synthesis to Estonian. 13-23
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/AzizS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/AzizS24
Dosti Aziz, Dávid Sztahó:
Dysphonia Diagnosis Using Self-supervised Speech Models in Mono and Cross-Lingual Settings. 24-35
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/TihelkaMHV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/TihelkaMHV24
Daniel Tihelka, Jindrich Matousek, Zdenek Hanzlícek, Lukás Vladar:
Sentences vs Phrases in Neural Speech Synthesis. 36-45
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/LeheckaHMT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/LeheckaHMT24
Jan Lehecka, Zdenek Hanzlícek, Jindrich Matousek, Daniel Tihelka:
Zero-Shot vs. Few-Shot Multi-speaker TTS Using Pre-trained Czech SpeechT5 Model. 46-57
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/AbedS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/AbedS24
Mohammed Hamzah Abed, Dávid Sztahó:
Deep Speaker Embeddings for Speaker Verification of Children. 58-69
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/YuenYC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/YuenYC24
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding. 70-80
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/ShamsC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/ShamsC24
Erfan A. Shams, Julie Carson-Berndsen:
Attention to Phonetics: A Visually Informed Explanation of Speech Transformers. 81-93
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/VladarM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/VladarM24
Lukás Vladar, Jindrich Matousek:
Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis. 94-104
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/MorenoAcevedoVMA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/MorenoAcevedoVMA24
Santiago Andres Moreno-Acevedo, Juan Camilo Vásquez-Correa, Juan M. Martín-Doñas, Aitor Álvarez:
Stream-based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning. 105-117
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/Hanzlicek24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/Hanzlicek24
Zdenek Hanzlícek:
Data Alignment and Duration Modelling in VITS. 118-129
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/Manfredi24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/Manfredi24
Ilaria Manfredi:
Multiword Expressions Resources for Italian: Presenting a Manually Annotated Spoken Corpus. 130-138
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/PortesH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/PortesH24
David Portes, Ales Horák:
Generating High-Quality F0 Embeddings Using the Vector-Quantized Variational Autoencoder. 139-148
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/HernandezPAVYOM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/HernandezPAVYOM24
Abner Hernandez, Paula Andrea Pérez-Toro, Tomás Arias-Vergara, Juan Camilo Vásquez-Correa, Seung Hee Yang, Juan Rafael Orozco-Arroyave, Andreas K. Maier:
Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation. 149-160
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/BRR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/BRR24
Mala J. B., S. M. Alex Raj, Rajeev Rajan:
X-Vector-Based Speaker Diarization Using Bi-LSTM and Interim Voting-Driven Post-processing. 161-173
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/RouxRWD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/RouxRWD24
Thibault Bañeras Roux, Mickael Rouvier, Jane Wottawa, Richard Dufour:
A Paradigm for Interpreting Metrics and Measuring Error Severity in Automatic Speech Recognition. 174-183
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/JakubecJLKS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/JakubecJLKS24
Maros Jakubec, Roman Jarina, Eva Lieskovska, Peter Kasak, Michal Spisiak:
Enhancing Speech Emotion Recognition Using Transfer Learning from Speaker Embeddings. 184-195

Dialogue

- view
  authority control:
- export record
  dblp key:
  - conf/tsd/DruartVE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/DruartVE24
Lucas Druart, Valentin Vielzeuf, Yannick Estève:
Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets. 199-209
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/WongC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/WongC24
Kwan Yeung Wong, Korris Fu-Lai Chung:
PiCo-VITS: Leveraging Pitch Contours for Fine-Grained Emotional Speech Synthesis. 210-221
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/OrtegaSV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/OrtegaSV24
Daniel Ortega, Steven Söhnel, Ngoc Thang Vu:
Improving and Understanding Clarifying Question Generation in Conversational Search. 222-235
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/Altinok24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/Altinok24a
Duygu Altinok:
Explainable Multimodal Fusion for Dementia Detection From Text and Speech. 236-251
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/LopezSantanderRBNO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/LopezSantanderRBNO24
Diego Alexander Lopez-Santander, Cristian David Ríos-Urrego, Christian Bergler, Elmar Nöth, Juan Rafael Orozco-Arroyave:
Robust Classification of Parkinson's Speech: an Approximation to a Scenario With Non-controlled Acoustic Conditions. 252-262
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/SotolarPS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/SotolarPS24
Ondrej Sotolár, Jaromír Plhák, David Smahel:
Leveraging Conceptual Similarities to Enhance Modeling of Factors Affecting Adolescents' Well-Being. 263-274
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/KumarG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/KumarG24
Ankit Kumar, Munir Georges:
Joint-Average Mean and Variance Feature Matching (JAMVFM) Semi-supervised GAN with Additional-Objective Training Function for Intent Detection. 275-287
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/KleerWFB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/KleerWFB24
Niko Kleer, Leon Weyand, Michael Feld, Klaus Berberich:
Capturing Task-Related Information for Text-Based Grasp Classification Using Fine-Tuned Embeddings. 288-299
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/WolterKF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/WolterKF24
Julian Wolter, Niko Kleer, Michael Feld:
StepDP: A Step Towards Expressive and Pervasive Dialogue Platforms. 300-312
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/GalloAristizabalERNO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/GalloAristizabalERNO24
Jeferson David Gallo-Aristizábal, Daniel Escobar-Grisales, Cristian David Ríos-Urrego, Elmar Nöth, Juan Rafael Orozco-Arroyave:
Automatic Classification of Parkinson's Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels. 313-323

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.