default search action
Thomas Hueber
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c41]Yanis Ouakrim, Hannah Bull, Michèle Gouiffès, Denis Beautemps, Thomas Hueber, Annelies Braffort:
Mediapi-RGB: Enabling Technological Breakthroughs in French Sign Language (LSF) Research Through an Extensive Video-Text Corpus. VISIGRAPP (2): VISAPP 2024: 139-148 - [i13]Ihab Asaad, Maxime Jacquelin, Olivier Perrotin, Laurent Girin, Thomas Hueber:
Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting. CoRR abs/2405.20101 (2024) - [i12]Angelo Ortiz Tandazo, Thomas Schatz, Thomas Hueber, Emmanuel Dupoux:
Simulating Articulatory Trajectories with Phonological Feature Interpolation. CoRR abs/2408.04363 (2024) - 2023
- [c40]Sanjana Sankar, Denis Beautemps, Frédéric Elisei, Olivier Perrotin, Thomas Hueber:
Investigating the dynamics of hand and lips in French Cued Speech using attention mechanisms and CTC-based decoding. INTERSPEECH 2023: 4978-4982 - [e2]Gérard Bailly, Thomas Hueber, Damien Lolive, Nicolas Obin, Olivier Perrotin:
12th ISCA Speech Synthesis Workshop, SSW 2023, Grenoble, France, August 26-28, 2023. ISCA 2023 [contents] - [i11]Sanjana Sankar, Denis Beautemps, Frédéric Elisei, Olivier Perrotin, Thomas Hueber:
Investigating the dynamics of hand and lips in French Cued Speech using attention mechanisms and CTC-based decoding. CoRR abs/2306.08290 (2023) - 2022
- [c39]Marc-Antoine Georges, Julien Diard, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Repeat after Me: Self-Supervised Learning of Acoustic-to-Articulatory Mapping by Vocal Imitation. ICASSP 2022: 8252-8256 - [c38]Sanjana Sankar, Denis Beautemps, Thomas Hueber:
Multistream Neural Architectures for Cued Speech Recognition Using a Pre-Trained Visual Feature Extractor and Constrained CTC Decoding. ICASSP 2022: 8477-8481 - [c37]Marc-Antoine Georges, Jean-Luc Schwartz, Thomas Hueber:
Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE. INTERSPEECH 2022: 774-778 - [c36]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model. INTERSPEECH 2022: 3383-3387 - [i10]Marc-Antoine Georges, Julien Diard, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation. CoRR abs/2204.02269 (2022) - [i9]Sanjana Sankar, Denis Beautemps, Thomas Hueber:
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding. CoRR abs/2204.04965 (2022) - [i8]Marc-Antoine Georges, Jean-Luc Schwartz, Thomas Hueber:
Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE. CoRR abs/2206.08790 (2022) - [i7]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model. CoRR abs/2207.01718 (2022) - 2021
- [j13]Laurent Girin, Simon Leglaive, Xiaoyu Bie, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda:
Dynamical Variational Autoencoders: A Comprehensive Review. Found. Trends Mach. Learn. 15(1-2): 1-175 (2021) - [j12]Fanny Roche, Thomas Hueber, Maëva Garnier, Samuel Limier, Laurent Girin:
Make That Sound More Metallic: Towards a Perceptually Relevant Control of the Timbre of Synthesizer Sounds Using a Variational Autoencoder. Trans. Int. Soc. Music. Inf. Retr. 4(1): 52-66 (2021) - [c35]Olivier Perrotin, Hussein El Amouri, Gérard Bailly, Thomas Hueber:
Evaluating the Extrapolation Capabilities of Neural Vocoders to Extreme Pitch Values. Interspeech 2021: 11-15 - [c34]Xiaoyu Bie, Laurent Girin, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda:
A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling. Interspeech 2021: 46-50 - [c33]Marc-Antoine Georges, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Learning Robust Speech Representation with an Articulatory-Regularized Variational Autoencoder. Interspeech 2021: 3345-3349 - [c32]Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier:
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input. Interspeech 2021: 3865-3869 - [i6]Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier:
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input. CoRR abs/2102.09914 (2021) - [i5]Marc-Antoine Georges, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Learning robust speech representation with an articulatory-regularized variational autoencoder. CoRR abs/2104.03204 (2021) - [i4]Xiaoyu Bie, Laurent Girin, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda:
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling. CoRR abs/2106.06500 (2021) - 2020
- [j11]Thomas Hueber, Eric Tatulli, Laurent Girin, Jean-Luc Schwartz:
Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning. Neural Comput. 32(3): 596-625 (2020) - [c31]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. INTERSPEECH 2020: 215-219 - [i3]Laurent Girin, Simon Leglaive, Xiaoyu Bie, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda:
Dynamical Variational Autoencoders: A Comprehensive Review. CoRR abs/2008.12595 (2020) - [i2]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. CoRR abs/2009.02035 (2020)
2010 – 2019
- 2018
- [c30]Li Liu, Thomas Hueber, Gang Feng, Denis Beautemps:
Visual Recognition of Continuous Cued Speech Using a Tandem CNN-HMM Approach. INTERSPEECH 2018: 2643-2647 - [i1]Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin:
Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models. CoRR abs/1806.04096 (2018) - 2017
- [j10]Avril Treille, Coriandre Vilain, Thomas Hueber, Laurent Lamalle, Marc Sato:
Inside Speech: Multisensory and Modality-specific Processing of Tongue and Lip Speech Actions. J. Cogn. Neurosci. 29(3): 448-466 (2017) - [j9]Diandra Fabre, Thomas Hueber, Laurent Girin, Xavier Alameda-Pineda, Pierre Badin:
Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract. Speech Commun. 93: 63-75 (2017) - [j8]Laurent Girin, Thomas Hueber, Xavier Alameda-Pineda:
Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 662-673 (2017) - [j7]Tanja Schultz, Thomas Hueber, Dean J. Krusienski, Jonathan S. Brumberg:
Introduction to the Special Issue on Biosignal-Based Spoken Communication. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2254-2256 (2017) - [j6]Tanja Schultz, Michael Wand, Thomas Hueber, Dean J. Krusienski, Christian Herff, Jonathan S. Brumberg:
Biosignal-Based Spoken Communication: A Survey. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2257-2271 (2017) - [c29]Laurent Girin, Thomas Hueber, Xavier Alameda-Pineda:
Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework. LVA/ICA 2017: 459-468 - [c28]Eric Tatulli, Thomas Hueber:
Feature extraction using multimodal convolutional neural networks for visual speech recognition. ICASSP 2017: 2971-2975 - 2016
- [j5]Thomas Hueber, Gérard Bailly:
Statistical conversion of silent articulation into audible speech using full-covariance HMM. Comput. Speech Lang. 36: 274-293 (2016) - [j4]Florent Bocquelet, Thomas Hueber, Laurent Girin, Christophe Savariaux, Blaise Yvert:
Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces. PLoS Comput. Biol. 12(11) (2016) - [c27]Maël Pouget, Olha Nahorna, Thomas Hueber, Gérard Bailly:
Adaptive Latency for Part-of-Speech Tagging in Incremental Text-to-Speech Synthesis. INTERSPEECH 2016: 2846-2850 - 2015
- [j3]Thomas Hueber, Laurent Girin, Xavier Alameda-Pineda, Gérard Bailly:
Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression. IEEE ACM Trans. Audio Speech Lang. Process. 23(12): 2246-2259 (2015) - [c26]Maël Pouget, Thomas Hueber, Gérard Bailly, Timo Baumann:
HMM training strategy for incremental speech synthesis. INTERSPEECH 2015: 1201-1205 - [c25]Florent Bocquelet, Thomas Hueber, Laurent Girin, Christophe Savariaux, Blaise Yvert:
Real-time control of a DNN-based articulatory synthesizer for silent speech conversion: a pilot study. INTERSPEECH 2015: 2405-2409 - [c24]Diandra Fabre, Thomas Hueber, Florent Bocquelet, Pierre Badin:
Tongue tracking in ultrasound images using eigentongue decomposition and artificial neural networks. INTERSPEECH 2015: 2410-2414 - 2014
- [c23]Florent Bocquelet, Thomas Hueber, Laurent Girin, Pierre Badin, Blaise Yvert:
Robust articulatory speech synthesis using deep neural networks for BCI applications. INTERSPEECH 2014: 2288-2292 - [c22]Diandra Fabre, Thomas Hueber, Pierre Badin:
Automatic animation of an articulatory tongue model from ultrasound images using Gaussian mixture regression. INTERSPEECH 2014: 2293-2297 - 2013
- [c21]Adela Barbulescu, Thomas Hueber, Gérard Bailly, Rémi Ronfard:
Audio-visual speaker conversion using prosody features. AVSP 2013: 11-16 - [c20]Avril Treille, Coriandre Vilain, Thomas Hueber, Jean-Luc Schwartz, Laurent Lamalle, Marc Sato:
The sight of your tongue: neural correlates of audio-lingual speech perception. AVSP 2013: 157-162 - [c19]Jun Cai, Thomas Hueber, Sotiris Manitsaris, Pierre Roussel, Lise Crevier-Buchman, Maureen Stone, Claire Pillot-Loiseau, Gérard Chollet, Gérard Dreyfus, Bruce Denby:
Vocal tract imaging system for post-laryngectomy voice replacement. I2MTC 2013: 676-680 - [c18]Nicolas D'Alessandro, Joëlle Tilmanne, Maria Astrinaki, Thomas Hueber, Rasmus Dall, Thierry Ravet, Alexis Moinet, Hüseyin Çakmak, Onur Babacan, Adela Barbulescu, Valentin Parfait, Victor Huguenin, Emine Sümeyye Kalayci, Qiong Hu:
Reactive Statistical Mapping: Towards the Sketching of Performative Control with Data. eNTERFACE 2013: 20-49 - [c17]Thomas Hueber:
Ultraspeech-player: intuitive visualization of ultrasound articulatory data for speech therapy and pronunciation training. INTERSPEECH 2013: 752-753 - [c16]Thomas Hueber, Gérard Bailly, Pierre Badin, Frédéric Elisei:
Speaker adaptation of an acoustic-articulatory inversion model using cascaded Gaussian mixture regressions. INTERSPEECH 2013: 2753-2757 - [c15]Thomas Hueber, Gérard Bailly, Pierre Badin, Frédéric Elisei:
Vizart3d - real-time system of visual articulatory feedback. SLaTE 2013 - [e1]Pierre Badin, Thomas Hueber, Gérard Bailly, Didier Demolin, Françoise Raby:
ISCA International Workshop on Speech and Language Technology in Education, SLaTE 2013, Grenoble, France, August 30 - September 1, 2013. ISCA 2013 [contents] - 2012
- [c14]Thomas Hueber, Gérard Bailly, Bruce Denby:
Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface. INTERSPEECH 2012: 723-726 - [c13]Thomas Hueber, Atef Ben Youssef, Gérard Bailly, Pierre Badin, Frédéric Elisei:
Cross-speaker Acoustic-to-Articulatory Inversion using Phone-based Trajectory HMM for Pronunciation Training. INTERSPEECH 2012: 783-786 - [c12]Thomas Hueber, Atef Ben Youssef, Pierre Badin, Gérard Bailly, Frédéric Elisei:
Vizart3D : Retour Articulatoire Visuel pour l'Aide à la Prononciation (Vizart3D: Visual Articulatory Feedack for Computer-Assisted Pronunciation Training) [in French]. JEP-TALN-RECITAL 2012: 17-18 - 2011
- [c11]Jun Cai, Thomas Hueber, Bruce Denby, Elie-Laurent Benaroya, Gérard Chollet, Pierre Roussel, Gérard Dreyfus, Lise Crevier-Buchman:
A Visual Speech Recognition System for an Ultrasound-based Silent Speech Interface. ICPhS 2011: 384-387 - [c10]Bruce Denby, Jun Cai, Pierre Roussel, Gérard Dreyfus, Lise Crevier-Buchman, Claire Pillot-Loiseau, Thomas Hueber, Gérard Chollet:
Tests of an Interactive, Phrasebook-style Post-laryngectomy Voice-replacement System. ICPhS 2011: 572-575 - [c9]Atef Ben Youssef, Thomas Hueber, Pierre Badin, Gérard Bailly:
Toward a Multi-Speaker Visual Articulatory Feedback System. INTERSPEECH 2011: 589-592 - [c8]Thomas Hueber, Elie-Laurent Benaroya, Bruce Denby, Gérard Chollet:
Statistical Mapping Between Articulatory and Acoustic Data for an Ultrasound-Based Silent Speech Interface. INTERSPEECH 2011: 593-596 - 2010
- [j2]Bruce Denby, Tanja Schultz, Kiyoshi Honda, Thomas Hueber, J. M. Gilbert, Jonathan S. Brumberg:
Silent speech interfaces. Speech Commun. 52(4): 270-287 (2010) - [j1]Thomas Hueber, Elie-Laurent Benaroya, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone:
Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips. Speech Commun. 52(4): 288-300 (2010) - [c7]Victoria M. Florescu, Lise Crevier-Buchman, Bruce Denby, Thomas Hueber, Antonia Colazo-Simon, Claire Pillot-Loiseau, Pierre Roussel-Ragot, Cédric Gendrot, Sophie Quattrocchi:
Silent vs vocalized articulation for a portable ultrasound-based silent speech interface. INTERSPEECH 2010: 450-453
2000 – 2009
- 2009
- [c6]Thomas Hueber, Elie-Laurent Benaroya, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone:
Visuo-phonetic decoding using multi-stream and context-dependent models for an ultrasound-based silent speech interface. INTERSPEECH 2009: 640-643 - 2008
- [c5]Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone:
Towards a segmental vocoder driven by ultrasound and optical images of the tongue and lips. INTERSPEECH 2008: 2028-2031 - [c4]Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone:
Phone recognition from ultrasound and optical video sequences for a silent speech interface. INTERSPEECH 2008: 2032-2035 - 2007
- [c3]Thomas Hueber, Guido Aversano, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Yacine Oussar, Pierre Roussel-Ragot, Maureen Stone:
Eigentongue Feature Extraction for an Ultrasound-Based Silent Speech Interface. ICASSP (1) 2007: 1245-1248 - [c2]Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone:
Continuous-speech phone recognition from ultrasound and optical images of the tongue and lips. INTERSPEECH 2007: 658-661 - [c1]Gérard Chollet, R. Landais, Thomas Hueber, Hervé Bredin, Chafic Mokbel, Patrick Perrot, Leila Zouari:
Some Experiments in Audio-Visual Speech Processing. NOLISP 2007: 28-56
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-30 01:01 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint