- Tuomo Raitio, Javier Latorre, Andrea Davis, Tuuli Morrill, Ladan Golipour:
Improving the quality of neural TTS using long-form content and multi-speaker multi-style modeling. SSW 2023: 144-149 - Nicholas Sanders, Korin Richmond:
Recovering Discrete Prosody Inputs via Invert-Classify. SSW 2023: 244-245 - Fritz Seebauer, Michael Kuhlmann, Reinhold Haeb-Umbach, Petra Wagner:
Re-examining the quality dimensions of synthetic speech. SSW 2023: 34-40 - Ravi Shankar, Archana Venkataraman:
Adaptive Duration Modification of Speech using Masked Convolutional Networks and Open-Loop Time Warping. SSW 2023: 177-183 - Sajad Shirali-Shahreza, Gerald Penn:
Better Replacement for TTS Naturalness Evaluation. SSW 2023: 197-203 - Atli Thor Sigurgeirsson, Simon King:
Using a Large Language Model to Control Speaking Style for Expressive TTS. SSW 2023: 246-247 - Adriana Stan, Johannah O'Mahony:
An analysis on the effects of speaker embedding choice in non auto-regressive TTS. SSW 2023: 134-138 - Emmett Strickland, Dana Aubakirova, Dorin Doncenco, Diego Torres, Marc Evrard:
NaijaTTS: A pitch-controllable TTS model for Nigerian Pidgin. SSW 2023: 248-249 - Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko:
PRVAE-VC: Non-Parallel Many-to-Many Voice Conversion with Perturbation-Resistant Variational Autoencoder. SSW 2023: 88-93 - Biel Tura Vecino, Adam Gabrys, Daniel Matwicki, Andrzej Pomirski, Tom Iddon, Marius Cotescu, Jaime Lorenzo-Trueba:
Lightweight End-to-end Text-to-speech Synthesis for low resource on-device applications. SSW 2023: 225-229 - Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely:
On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis. SSW 2023: 163-169 - Takenori Yoshimura, Takato Fujimoto, Keiichiro Oura, Keiichi Tokuda:
SPTK4: An Open-Source Software Toolkit for Speech Signal Processing. SSW 2023: 211-217 - Weicheng Zhang, Cheng-chieh Yeh, Will Beckman, Tuomo Raitio, Ramya Rasipuram, Ladan Golipour, David Winarsky:
Audiobook synthesis with long-form neural text-to-speech. SSW 2023: 139-143 - Gérard Bailly, Thomas Hueber, Damien Lolive, Nicolas Obin, Olivier Perrotin:
12th ISCA Speech Synthesis Workshop, SSW 2023, Grenoble, France, August 26-28, 2023. ISCA 2023 [contents] - 2021
- Jennifer Williams, Jason Fong, Erica Cooper, Junichi Yamagishi:
Exploring Disentanglement with Multilingual and Monolingual VQ-VAE. SSW 2021: 124-129 - Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangen, Sri Karlapati, Thomas Drugman:
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech. SSW 2021: 177-182 - Arun Baby, Pranav Jawale, Saranya Vinnaitherthan, Sumukh Badam, Nagaraj Adiga, Sharath Adavanne:
Non-native English lexicon creation for bilingual speech synthesis. SSW 2021: 154-159 - Erica Cooper, Xin Wang, Junichi Yamagishi:
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis. SSW 2021: 130-135 - Erica Cooper, Junichi Yamagishi:
How do Voices from Past Speech Synthesis Challenges Compare Today? SSW 2021: 183-188 - Tamás Gábor Csapó, László Tóth, Gábor Gosztolya, Alexandra Markó:
Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input. SSW 2021: 31-36 - Tamás Gábor Csapó:
Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging. SSW 2021: 7-12 - Abdelhamid Ezzerg, Adam Gabrys, Bartosz Putrycz, Daniel Korzekwa, Daniel Saez-Trigueros, David McHardy, Kamil Pokora, Jakub Lachowicz, Jaime Lorenzo-Trueba, Viacheslav Klimkov:
Enhancing audio quality for expressive Neural Text-to-Speech. SSW 2021: 78-83 - Jason Fong, Jennifer Williams, Simon King:
Analysing Temporal Sensitivity of VQ-VAE Sub-Phone Codebooks. SSW 2021: 227-231 - Jason Fong, Jilong Wu, Prabhav Agrawal, Andrew Gibiansky, Thilo Köhler, Qing He:
Improving Polyglot Speech Synthesis through Multi-task and Adversarial Learning. SSW 2021: 172-176 - Pilar Oplustil Gallegos, Johannah O'Mahony, Simon King:
Comparing acoustic and textual representations of previous linguistic context for improving Text-to-Speech. SSW 2021: 205-210 - Joakim Gustafson, Jonas Beskow, Éva Székely:
Personality in the mix - investigating the contribution of fillers and speaking style to the perception of spontaneous speech synthesis. SSW 2021: 48-53 - Elijah Gutierrez, Pilar Oplustil Gallegos, Catherine Lai:
Location, Location: Enhancing the Evaluation of Text-to-Speech synthesis using the Rapid Prosody Transcription Paradigm. SSW 2021: 25-30 - Marc Illa, Bence Mark Halpern, Rob van Son, Laureano Moro-Velázquez, Odette Scharenborg:
Pathological voice adaptation with autoencoder-based voice conversion. SSW 2021: 19-24 - Ambika Kirkland, Marcin Wlodarczak, Joakim Gustafson, Éva Székely:
Perception of smiling voice in spontaneous speech synthesis. SSW 2021: 108-112 - Paul Konstantin Krug, Simon Stone, Peter Birkholz:
Intelligibility and naturalness of articulatory synthesis with VocalTractLab compared to established speech synthesis technologies. SSW 2021: 102-107