default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 666 matches
- 2023
- Shaimaa Alwaisi, Mohammed Salah Al-Radhi, Géza Németh:
Universal Approach to Multilingual Multispeaker Child Speech SynthesisUniversal Approach to Multilingual Multispeaker Child Speech Synthesis. SSW 2023: 236-237 - Gérard Bailly, Martin Lenglet, Olivier Perrotin, Esther Klabbers:
Advocating for text input in multi-speaker text-to-speech systems. SSW 2023: 1-7 - Haolin Chen, Philip N. Garner:
Diffusion Transformer for Adaptive Text-to-Speech. SSW 2023: 157-162 - Arnab Das, Suhita Ghosh, Tim Polzehl, Ingo Siegert, Sebastian Stober:
StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings. SSW 2023: 81-87 - Daria Diatlova, Vitalii Shutov:
EmoSpeech: guiding FastSpeech2 towards Emotional Text to Speech. SSW 2023: 106-112 - Phat Do, Matt Coler, Jelske Dijkstra, Esther Klabbers:
Strategies in Transfer Learning for Low-Resource Speech Synthesis: Phone Mapping, Features Input, and Source Language Selection. SSW 2023: 21-26 - Jarod Duret, Yannick Estève, Titouan Parcollet:
Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data. SSW 2023: 184-190 - Mikey Elmers, Éva Székely:
The Impact of Pause-Internal Phonetic Particles on Recall in Synthesized Lectures. SSW 2023: 204-210 - Lev Finkelstein, Joshua Camp, Rob Clark:
Importance of Human Factors in Text-To-Speech Evaluations. SSW 2023: 27-33 - Lev Finkelstein, Chun-an Chan, Vincent Wan, Heiga Zen, Rob Clark:
FiPPiE: A Computationally Efficient Differentiable method for Estimating Fundamental Frequency From Spectrograms. SSW 2023: 218-224 - Seraphina Fong, Marco Matassoni, Gianluca Esposito, Alessio Brutti:
Towards Speaker-Independent Voice Conversion for Improving Dysarthric Speech Intelligibility. SSW 2023: 238-239 - Jason Fong, Hao Tang, Simon King:
Spell4TTS: Acoustically-informed spellings for improving text-to-speech pronunciations. SSW 2023: 8-13 - David Guennec, Lily Wadoux, Aghilas Sini, Nelly Barbot, Damien Lolive:
Voice Cloning: Training Speaker Selection with Limited Multi-Speaker Corpus. SSW 2023: 170-176 - Ryunosuke Hirai, Yuki Saito, Hiroshi Saruwatari:
Federated Learning for Human-in-the-Loop Many-to-Many Voice Conversion. SSW 2023: 94-99 - Ibrahim Ibrahimov, Gábor Gosztolya, Tamás Gábor Csapó:
Data Augmentation Methods on Ultrasound Tongue Images for Articulation-to-Speech Synthesis. SSW 2023: 230-235 - Maxime Jacquelin, Maeva Garnier, Laurent Girin, Rémy Vincent, Olivier Perrotin:
Exploring the multidimensional representation of individual speech acoustic parameters extracted by deep unsupervised models. SSW 2023: 240-241 - Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova:
Controllable Emphasis with zero data for text-to-speech. SSW 2023: 113-119 - Sofoklis Kakouros, Juraj Simko, Martti Vainio, Antti Suni:
Investigating the Utility of Surprisal from Large Language Models for Speech Synthesis Prosody. SSW 2023: 127-133 - Anton Kashkin, Ivan Karpukhin, Svyatoslav Shishkin:
HiFi-VC: High Quality ASR-based Voice Conversion. SSW 2023: 100-105 - Ambika Kirkland, Shivam Mehta, Harm Lameris, Gustav Eje Henter, Éva Székely, Joakim Gustafson:
Stuck in the MOS pit: A critical analysis of MOS test methodology in TTS evaluation. SSW 2023: 41-47 - Kishor Kayyar Lakshminarayana, Christian Dittmar, Nicola Pia, Emanuël A. P. Habets:
Subjective Evaluation of Text-to-Speech Models: Comparing Absolute Category Rating and Ranking by Elimination Tests. SSW 2023: 191-196 - Harm Lameris, Ambika Kirkland, Joakim Gustafson, Éva Székely:
Situating Speech Synthesis: Investigating Contextual Factors in the Evaluation of Conversational TTS. SSW 2023: 69-74 - Martin Lenglet, Olivier Perrotin, Gérard Bailly:
Local Style Tokens: Fine-Grained Prosodic Representations For TTS Expressive Control. SSW 2023: 120-126 - Zhu Li, Xiyuan Gao, Shekhar Nayak, Matt Coler:
SarcasticSpeech: Speech Synthesis for Sarcasm in Low-Resource Scenarios. SSW 2023: 242-243 - Johannes A. Louw:
Cross-lingual transfer using phonological features for resource-scarce text-to-speech. SSW 2023: 55-61 - Yuta Matsunaga, Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari:
Improving robustness of spontaneous speech synthesis with linguistic speech regularization and pseudo-filled-pause insertion. SSW 2023: 62-68 - Shivam Mehta, Siyang Wang, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis. SSW 2023: 150-156 - Marcel Granero Moya, Penny Karanasou, Sri Karlapati, Bastian Schnell, Nicole Peinelt, Alexis Moinet, Thomas Drugman:
A Comparative Analysis of Pretrained Language Models for Text-to-Speech. SSW 2023: 14-20 - Johannah O'Mahony, Catherine Lai, Simon King:
Synthesising turn-taking cues using natural conversational data. SSW 2023: 75-80 - Ondrej Plátek, Ondrej Dusek:
MooseNet: A Trainable Metric for Synthesized Speech with a PLDA Module. SSW 2023: 48-54
skipping 636 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-12-29 15:47 CET from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint