default search action
François G. Germain
Person information
- affiliation: Stanford University, Department of Music
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c22]Zexu Pan, Gordon Wichern, François G. Germain, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Late Audio-Visual Fusion for in-the-Wild Speaker Diarization. ICASSP Workshops 2024: 174-178 - [c21]Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-Weon Jung, François G. Germain, Jonathan Le Roux, Shinji Watanabe:
Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation. ICASSP 2024: 316-320 - [c20]Chang-Bin Jeon, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Why Does Music Source Separation Benefit from Cacophony? ICASSP Workshops 2024: 873-877 - [c19]Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. ICASSP 2024: 1016-1020 - [c18]Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. ICASSP 2024: 1156-1160 - [c17]Zexu Pan, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
NeuroHeed+: Improving Neuro-Steered Speaker Extraction with Joint Auditory Attention Detection. ICASSP 2024: 11456-11460 - [c16]Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement. IWAENC 2024: 205-209 - [i15]Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. CoRR abs/2402.17907 (2024) - [i14]Junghyun Koo, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers. CoRR abs/2404.02252 (2024) - [i13]Janek Ebbers, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Sound Event Bounding Boxes. CoRR abs/2406.04212 (2024) - [i12]Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
Enhanced Reverberation as Supervision for Unsupervised Speech Separation. CoRR abs/2408.03438 (2024) - [i11]Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement. CoRR abs/2408.03440 (2024) - [i10]Kohei Saijo, Janek Ebbers, François G. Germain, Sameer Khurana, Gordon Wichern, Jonathan Le Roux:
Leveraging Audio-Only Data for Text-Queried Target Sound Extraction. CoRR abs/2409.13152 (2024) - [i9]Kohei Saijo, Janek Ebbers, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Task-Aware Unified Source Separation. CoRR abs/2410.23987 (2024) - 2023
- [c15]Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction. ASRU 2023: 1-8 - [c14]Ke Chen, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Paᗧ-HuBERT: Self-Supervised Music Source Separation Via Primitive Auditory Clustering And Hidden-Unit Bert. ICASSP Workshops 2023: 1-5 - [c13]Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Cold Diffusion for Speech Enhancement. ICASSP 2023: 1-5 - [c12]François G. Germain, Gordon Wichern, Jonathan Le Roux:
Hyperbolic Unsupervised Anomalous Sound Detection. WASPAA 2023: 1-5 - [c11]Ricardo Falcón Pérez, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds. WASPAA 2023: 1-5 - [i8]Ke Chen, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT. CoRR abs/2304.02160 (2023) - [i7]Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-weon Jung, François G. Germain, Jonathan Le Roux, Shinji Watanabe:
Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation. CoRR abs/2309.17352 (2023) - [i6]Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. CoRR abs/2310.10604 (2023) - [i5]Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction. CoRR abs/2310.19644 (2023) - [i4]Zexu Pan, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection. CoRR abs/2312.07513 (2023) - 2022
- [i3]Zexu Pan, Gordon Wichern, François G. Germain, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Towards End-to-end Speaker Diarization in the Wild. CoRR abs/2211.01299 (2022) - [i2]Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Cold Diffusion for Speech Enhancement. CoRR abs/2211.02527 (2022) - 2021
- [c10]François G. Germain:
Practical Virtual Analog Modeling Using MÖbius Transforms. DAFx 2021: 49-56 - [c9]François G. Germain:
Periodic Analysis of Nonlinear Virtual Analog Models. WASPAA 2021: 321-325
2010 – 2019
- 2019
- [c8]François G. Germain, Qifeng Chen, Vladlen Koltun:
Speech Denoising with Deep Feature Losses. INTERSPEECH 2019: 2723-2727 - 2018
- [i1]François G. Germain, Qifeng Chen, Vladlen Koltun:
Speech Denoising with Deep Feature Losses. CoRR abs/1806.10522 (2018) - 2017
- [c7]François G. Germain, Kurt James Werner:
Optimizing differentiated discretization for audio circuits beyond driving point transfer functions. WASPAA 2017: 384-388 - 2016
- [c6]François G. Germain, Gautham J. Mysore, Takako Fujioka:
Equalization matching of speech recordings in real-world environments. ICASSP 2016: 609-613 - 2015
- [c5]François G. Germain, Iretiayo A. Akinola, Qiyuan Tian, Steven Lansel, Brian A. Wandell:
Efficient illuminant correction in the local, linear, learned (L3) method. Digital Photography 2015: 940404 - [c4]François G. Germain, Gautham J. Mysore:
Speaker and noise independent online single-channel speech enhancement. ICASSP 2015: 71-75 - 2014
- [j1]François G. Germain, Gautham J. Mysore:
Stopping Criteria for Non-Negative Matrix Factorization Based Supervised and Semi-Supervised Source Separation. IEEE Signal Process. Lett. 21(10): 1284-1288 (2014) - 2013
- [c3]François G. Germain, Dennis L. Sun, Gautham J. Mysore:
Speaker and noise independent voice activity detection. INTERSPEECH 2013: 732-736 - [c2]Zafar Rafii, François G. Germain, Dennis L. Sun, Gautham J. Mysore:
Combining Modeling Of Singing Voice And Background Music For Automatic Separation Of Musical Mixtures. ISMIR 2013: 41-46
2000 – 2009
- 2009
- [c1]François G. Germain, Gianpaolo Evangelista:
Synthesis of guitar by digital waveguides: Modeling the plectrum in the physical interaction of the player with the instrument. WASPAA 2009: 25-28
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-09 13:16 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint