default search action
Manuel Sam Ribeiro
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c22]Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba:
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech. INTERSPEECH 2023: 27-31 - [c21]Giulia Comini, Manuel Sam Ribeiro, Fan Yang, Heereen Shim, Jaime Lorenzo-Trueba:
Multilingual context-based pronunciation learning for Text-to-Speech. INTERSPEECH 2023: 631-635 - [c20]Manuel Sam Ribeiro, Giulia Comini, Jaime Lorenzo-Trueba:
Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings. INTERSPEECH 2023: 999-1003 - [i15]Manuel Sam Ribeiro, Giulia Comini, Jaime Lorenzo-Trueba:
Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings. CoRR abs/2307.16643 (2023) - [i14]Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba:
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech. CoRR abs/2307.16679 (2023) - [i13]Giulia Comini, Manuel Sam Ribeiro, Fan Yang, Heereen Shim, Jaime Lorenzo-Trueba:
Multilingual context-based pronunciation learning for Text-to-Speech. CoRR abs/2307.16709 (2023) - 2022
- [c19]Manuel Sam Ribeiro, Julian Roth, Giulia Comini, Goeric Huybrechts, Adam Gabrys, Jaime Lorenzo-Trueba:
Cross-Speaker Style Transfer for Text-to-Speech Using Data Augmentation. ICASSP 2022: 6797-6801 - [c18]Adam Gabrys, Goeric Huybrechts, Manuel Sam Ribeiro, Chung-Ming Chien, Julian Roth, Giulia Comini, Roberto Barra-Chicote, Bartek Perz, Jaime Lorenzo-Trueba:
Voice Filter: Few-Shot Text-to-Speech Speaker Adaptation Using Voice Conversion as a Post-Processing Module. ICASSP 2022: 7902-7906 - [c17]Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter:
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks. INTERSPEECH 2022: 471-475 - [c16]Giulia Comini, Goeric Huybrechts, Manuel Sam Ribeiro, Adam Gabrys, Jaime Lorenzo-Trueba:
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation. INTERSPEECH 2022: 1946-1950 - [i12]Manuel Sam Ribeiro, Julian Roth, Giulia Comini, Goeric Huybrechts, Adam Gabrys, Jaime Lorenzo-Trueba:
Cross-speaker style transfer for text-to-speech using data augmentation. CoRR abs/2202.05083 (2022) - [i11]Adam Gabrys, Goeric Huybrechts, Manuel Sam Ribeiro, Chung-Ming Chien, Julian Roth, Giulia Comini, Roberto Barra-Chicote, Bartek Perz, Jaime Lorenzo-Trueba:
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module. CoRR abs/2202.08164 (2022) - [i10]Giulia Comini, Goeric Huybrechts, Manuel Sam Ribeiro, Adam Gabrys, Jaime Lorenzo-Trueba:
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation. CoRR abs/2207.14607 (2022) - [i9]Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter:
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks. CoRR abs/2209.11003 (2022) - 2021
- [j2]Manuel Sam Ribeiro, Joanne Cleland, Aciel Eshky, Korin Richmond, Steve Renals:
Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors. Speech Commun. 128: 24-34 (2021) - [j1]Aciel Eshky, Joanne Cleland, Manuel Sam Ribeiro, Eleanor Sugden, Korin Richmond, Steve Renals:
Automatic audiovisual synchronisation for ultrasound tongue imaging. Speech Commun. 132: 83-95 (2021) - [c15]Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals:
Silent versus Modal Multi-Speaker Speech Recognition from Ultrasound and Video. Interspeech 2021: 641-645 - [c14]Manuel Sam Ribeiro, Jennifer Sanger, Jing-Xuan Zhang, Aciel Eshky, Alan Wrench, Korin Richmond, Steve Renals:
Tal: A Synchronised Multi-Speaker Corpus of Ultrasound Tongue Imaging, Audio, and Lip Videos. SLT 2021: 1109-1116 - [i8]Manuel Sam Ribeiro, Joanne Cleland, Aciel Eshky, Korin Richmond, Steve Renals:
Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors. CoRR abs/2103.00324 (2021) - [i7]Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals:
Silent versus modal multi-speaker speech recognition from ultrasound and video. CoRR abs/2103.00333 (2021) - [i6]Aciel Eshky, Joanne Cleland, Manuel Sam Ribeiro, Eleanor Sugden, Korin Richmond, Steve Renals:
Automatic audiovisual synchronisation for ultrasound tongue imaging. CoRR abs/2105.15162 (2021) - 2020
- [i5]Manuel Sam Ribeiro, Jennifer Sanger, Jing-Xuan Zhang, Aciel Eshky, Alan Wrench, Korin Richmond, Steve Renals:
TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos. CoRR abs/2011.09804 (2020)
2010 – 2019
- 2019
- [c13]Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals:
Speaker-independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech. ICASSP 2019: 1328-1332 - [c12]Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals:
Ultrasound Tongue Imaging for Diarization and Alignment of Child Speech Therapy Sessions. INTERSPEECH 2019: 16-20 - [c11]Aciel Eshky, Manuel Sam Ribeiro, Korin Richmond, Steve Renals:
Synchronising Audio and Ultrasound by Learning Cross-Modal Embeddings. INTERSPEECH 2019: 4100-4104 - [i4]Aciel Eshky, Manuel Sam Ribeiro, Korin Richmond, Steve Renals:
Synchronising audio and ultrasound by learning cross-modal embeddings. CoRR abs/1907.00758 (2019) - [i3]Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals:
Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions. CoRR abs/1907.00818 (2019) - [i2]Aciel Eshky, Manuel Sam Ribeiro, Joanne Cleland, Korin Richmond, Zoe Roxburgh, James M. Scobbie, Alan Wrench:
UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions. CoRR abs/1907.00835 (2019) - [i1]Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals:
Speaker-independent classification of phonetic segments from raw ultrasound in child speech. CoRR abs/1907.01413 (2019) - 2018
- [c10]Felipe Espic, Avashna Govender, Manuel Sam Ribeiro, Cassia Valentini-Botinhao, Oliver Watts:
The CSTR entry to the 2018 Blizzard Challenge. Blizzard Challenge 2018 - [c9]Aciel Eshky, Manuel Sam Ribeiro, Joanne Cleland, Korin Richmond, Zoe Roxburgh, James M. Scobbie, Alan Wrench:
UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions. INTERSPEECH 2018: 1888-1892 - 2017
- [c8]Srikanth Ronanki, Manuel Sam Ribeiro, Felipe Espic, Oliver Watts:
The CSTR entry to the Blizzard Challenge 2017. Blizzard Challenge 2017 - [c7]Manuel Sam Ribeiro, Oliver Watts, Junichi Yamagishi:
Learning Word Vector Representations Based on Acoustic Counts. INTERSPEECH 2017: 799-803 - 2016
- [c6]Manuel Sam Ribeiro, Oliver Watts, Junichi Yamagishi, Robert A. J. Clark:
Wavelet-based decomposition of F0 as a secondary task for DNN-based speech synthesis with multi-task learning. ICASSP 2016: 5525-5529 - [c5]Jean-Philippe Goldman, Pierre-Edouard Honnet, Robert A. J. Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli, Junichi Yamagishi:
The SIWIS Database: A Multilingual Speech Database with Acted Emphasis. INTERSPEECH 2016: 1532-1535 - [c4]Manuel Sam Ribeiro, Oliver Watts, Junichi Yamagishi:
Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis. INTERSPEECH 2016: 3186-3190 - [c3]Manuel Sam Ribeiro, Oliver Watts, Junichi Yamagishi:
Parallel and cascaded deep neural networks for text-to-speech synthesis. SSW 2016: 100-105 - 2015
- [c2]Manuel Sam Ribeiro, Robert A. J. Clark:
A multi-level representation of f0 using the continuous wavelet transform and the Discrete Cosine Transform. ICASSP 2015: 4909-4913 - [c1]Manuel Sam Ribeiro, Junichi Yamagishi, Robert A. J. Clark:
A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis. INTERSPEECH 2015: 1586-1590
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-26 01:57 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint