default search action

combined dblp search
author search
venue search
publication search

ask others

François G. Germain

> Home > Persons

Person information

affiliation: Stanford University, Department of Music

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanWGSR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanWGSR24
Zexu Pan, Gordon Wichern, François G. Germain, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Late Audio-Visual Fusion for in-the-Wild Speaker Diarization. ICASSP Workshops 2024: 174-178
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuCWJGR024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuCWJGR024
Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-Weon Jung, François G. Germain, Jonathan Le Roux, Shinji Watanabe:
Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation. ICASSP 2024: 316-320
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JeonWGR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JeonWGR24
Chang-Bin Jeon, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Why Does Music Source Separation Benefit from Cacophony? ICASSP Workshops 2024: 873-877
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasuyamaWGPKHR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasuyamaWGPKHR24
Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. ICASSP 2024: 1016-1020
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BraliosWGPKHR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BraliosWGPKHR24
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. ICASSP 2024: 1156-1160
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanWGKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanWGKR24
Zexu Pan, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
NeuroHeed+: Improving Neuro-Steered Speaker Extraction with Joint Auditory Attention Detection. ICASSP 2024: 11456-11460
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/SaijoWGPR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/SaijoWGPR24
Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement. IWAENC 2024: 205-209
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17907
Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. CoRR abs/2402.17907 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-02252
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-02252
Junghyun Koo, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers. CoRR abs/2404.02252 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04212
Janek Ebbers, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Sound Event Bounding Boxes. CoRR abs/2406.04212 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-03438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-03438
Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
Enhanced Reverberation as Supervision for Unsupervised Speech Separation. CoRR abs/2408.03438 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-03440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-03440
Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement. CoRR abs/2408.03440 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13152
Kohei Saijo, Janek Ebbers, François G. Germain, Sameer Khurana, Gordon Wichern, Jonathan Le Roux:
Leveraging Audio-Only Data for Text-Queried Target Sound Extraction. CoRR abs/2409.13152 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-23987
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-23987
Kohei Saijo, Janek Ebbers, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Task-Aware Unified Source Separation. CoRR abs/2410.23987 (2024)
2023
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PanWMGKHR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PanWMGKHR23
Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction. ASRU 2023: 1-8
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWGR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWGR23
Ke Chen, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Paᗧ-HuBERT: Self-Supervised Music Source Separation Via Primitive Auditory Clustering And Hidden-Unit Bert. ICASSP Workshops 2023: 1-5
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YenGWR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YenGWR23
Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Cold Diffusion for Speech Enhancement. ICASSP 2023: 1-5
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/GermainWR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/GermainWR23
François G. Germain, Gordon Wichern, Jonathan Le Roux:
Hyperbolic Unsupervised Anomalous Sound Detection. WASPAA 2023: 1-5
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/PerezWGR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/PerezWGR23
Ricardo Falcón Pérez, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds. WASPAA 2023: 1-5
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-02160
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-02160
Ke Chen, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT. CoRR abs/2304.02160 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-17352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-17352
Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-weon Jung, François G. Germain, Jonathan Le Roux, Shinji Watanabe:
Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation. CoRR abs/2309.17352 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10604
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. CoRR abs/2310.10604 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19644
Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction. CoRR abs/2310.19644 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07513
Zexu Pan, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection. CoRR abs/2312.07513 (2023)
2022
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01299
Zexu Pan, Gordon Wichern, François G. Germain, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Towards End-to-end Speaker Diarization in the Wild. CoRR abs/2211.01299 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02527
Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Cold Diffusion for Speech Enhancement. CoRR abs/2211.02527 (2022)
2021
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/dafx/Germain21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dafx/Germain21
François G. Germain:
Practical Virtual Analog Modeling Using MÖbius Transforms. DAFx 2021: 49-56
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/Germain21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/Germain21
François G. Germain:
Periodic Analysis of Nonlinear Virtual Analog Models. WASPAA 2021: 321-325

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GermainCK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GermainCK19
François G. Germain, Qifeng Chen, Vladlen Koltun:
Speech Denoising with Deep Feature Losses. INTERSPEECH 2019: 2723-2727
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-10522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-10522
François G. Germain, Qifeng Chen, Vladlen Koltun:
Speech Denoising with Deep Feature Losses. CoRR abs/1806.10522 (2018)
2017
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/GermainW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/GermainW17
François G. Germain, Kurt James Werner:
Optimizing differentiated discretization for audio circuits beyond driving point transfer functions. WASPAA 2017: 384-388
2016
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GermainMF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GermainMF16
François G. Germain, Gautham J. Mysore, Takako Fujioka:
Equalization matching of speech recordings in real-world environments. ICASSP 2016: 609-613
2015
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/dphoto/GermainATLW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dphoto/GermainATLW15
François G. Germain, Iretiayo A. Akinola, Qiyuan Tian, Steven Lansel, Brian A. Wandell:
Efficient illuminant correction in the local, linear, learned (L3) method. Digital Photography 2015: 940404
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GermainM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GermainM15
François G. Germain, Gautham J. Mysore:
Speaker and noise independent online single-channel speech enhancement. ICASSP 2015: 71-75
2014
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/GermainM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/GermainM14
François G. Germain, Gautham J. Mysore:
Stopping Criteria for Non-Negative Matrix Factorization Based Supervised and Semi-Supervised Source Separation. IEEE Signal Process. Lett. 21(10): 1284-1288 (2014)
2013
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GermainSM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GermainSM13
François G. Germain, Dennis L. Sun, Gautham J. Mysore:
Speaker and noise independent voice activity detection. INTERSPEECH 2013: 732-736
[c2]
- view
  - electronic edition @ pucpr.br
  - details & citations
- export record
  dblp key:
  - conf/ismir/RafiiGSM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/RafiiGSM13
Zafar Rafii, François G. Germain, Dennis L. Sun, Gautham J. Mysore:
Combining Modeling Of Singing Voice And Background Music For Automatic Separation Of Musical Mixtures. ISMIR 2013: 41-46

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/GermainE09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/GermainE09
François G. Germain, Gianpaolo Evangelista:
Synthesis of guitar by digital waveguides: Modeling the plectrum in the physical interaction of the player with the instrument. WASPAA 2009: 25-28

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.