default search action

combined dblp search
author search
venue search
publication search

ask others

Umut Isik

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TogamiVHGIG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TogamiVHGIG24
Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin:
Real-Time Stereo Speech Enhancement with Spatial-Cue Preservation Based on Dual-Path Structure. ICASSP 2024: 71-75
2022
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuanWIGVGK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuanWIGVGK22
Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy:
Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing. ICASSP 2022: 111-115
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ValinISK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ValinISK22
Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of Lpcnet. ICASSP 2022: 8437-8441
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SubramaniVISK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SubramaniVISK22
Krishna Subramani, Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation. INTERSPEECH 2022: 818-822
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-11169
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-11169
Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet. CoRR abs/2202.11169 (2022)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-11301
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-11301
Krishna Subramani, Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation. CoRR abs/2202.11301 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15092
Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy:
Improved singing voice separation with chromagram-based pitch-aware remixing. CoRR abs/2203.15092 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07917
Jean-Marc Valin, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Arvindh Krishnaswamy:
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets. CoRR abs/2206.07917 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09072
Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Michael M. Goodwin, Arvindh Krishnaswamy:
Semi-supervised Time Domain Target Speaker Extraction with Attention. CoRR abs/2206.09072 (2022)
2021
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangGIVK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangGIVK21
Zhepei Wang, Ritwik Giri, Umut Isik, Jean-Marc Valin, Arvindh Krishnaswamy:
Semi-Supervised Singing Voice Separation With Noisy Self-Training. ICASSP 2021: 31-35
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CasebeerVIVGK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CasebeerVIVGK21
Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy:
Enhancing into the Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders. ICASSP 2021: 711-715
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ValinTHIK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ValinTHIK21
Jean-Marc Valin, Srikanth V. Tenneti, Karim Helwani, Umut Isik, Arvindh Krishnaswamy:
Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet. ICASSP 2021: 7133-7137
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GiriVVIK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GiriVVIK21
Ritwik Giri, Shrikant Venkataramani, Jean-Marc Valin, Umut Isik, Arvindh Krishnaswamy:
Personalized PercepNet: Real-Time, Low-Complexity Target Voice Separation and Enhancement. Interspeech 2021: 1124-1128
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06610
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06610
Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy:
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders. CoRR abs/2102.06610 (2021)
2020
[c9]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/GiriTCHIK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/GiriTCHIK20
Ritwik Giri, Srikanth V. Tenneti, Fangzhou Cheng, Karim Helwani, Umut Isik, Arvindh Krishnaswamy:
Self-Supervised Classification for Detecting Anomalous Sounds. DCASE 2020: 46-50
[c8]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/GiriCHTIK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/GiriCHTIK20
Ritwik Giri, Fangzhou Cheng, Karim Helwani, Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy:
Group Masked Autoencoder Based Density Estimator for Audio Anomaly Detection. DCASE 2020: 51-55
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TolooshamsGSIK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TolooshamsGSIK20
Bahareh Tolooshams, Ritwik Giri, Andrew H. Song, Umut Isik, Arvindh Krishnaswamy:
Channel-Attention Dense U-Net for Multichannel Speech Enhancement. ICASSP 2020: 836-840
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CasebeerIVK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CasebeerIVK20
Jonah Casebeer, Umut Isik, Shrikant Venkataramani, Arvindh Krishnaswamy:
Efficient Trainable Front-Ends for Neural Speech Enhancement. ICASSP 2020: 6639-6643
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ValinIPGHK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ValinIPGHK20
Jean-Marc Valin, Umut Isik, Neerad Phansalkar, Ritwik Giri, Karim Helwani, Arvindh Krishnaswamy:
A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech. INTERSPEECH 2020: 2482-2486
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IsikGPVHK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IsikGPVHK20
Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy:
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss. INTERSPEECH 2020: 2487-2491
[c3]
- view
  - electronic edition @ ismir.net
  - details & citations
- export record
  dblp key:
  - conf/ismir/ChiKYSI20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/ChiKYSI20
Wayne Chi, Prachi Kumar, Suri Yaddanapudi, Rahul Suresh, Umut Isik:
Generating Music with a Self-Correcting Non-Chronological Autoregressive Model. ISMIR 2020: 893-900
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/FedericoEBGIKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/FedericoEBGIKS20
Marcello Federico, Robert Enyedi, Roberto Barra-Chicote, Ritwik Giri, Umut Isik, Arvindh Krishnaswamy, Hassan Sawaf:
From Speech-to-Speech Translation to Automatic Dubbing. IWSLT 2020: 257-264
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-06785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-06785
Marcello Federico, Robert Enyedi, Roberto Barra-Chicote, Ritwik Giri, Umut Isik, Arvindh Krishnaswamy:
From Speech-to-Speech Translation to Automatic Dubbing. CoRR abs/2001.06785 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-11542
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-11542
Bahareh Tolooshams, Ritwik Giri, Andrew H. Song, Umut Isik, Arvindh Krishnaswamy:
Channel-Attention Dense U-Net for Multichannel Speech Enhancement. CoRR abs/2001.11542 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-09286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-09286
Jonah Casebeer, Umut Isik, Shrikant Venkataramani, Arvindh Krishnaswamy:
Efficient Trainable Front-Ends for Neural Speech Enhancement. CoRR abs/2002.09286 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-04470
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-04470
Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy:
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss. CoRR abs/2008.04470 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-08927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-08927
Wayne Chi, Prachi Kumar, Suri Yaddanapudi, Rahul Suresh, Umut Isik:
Generating Music with a Self-Correcting Non-Chronological Autoregressive Model. CoRR abs/2008.08927 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/GiriIK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/GiriIK19
Ritwik Giri, Umut Isik, Arvindh Krishnaswamy:
Attention Wave-U-Net for Speech Enhancement. WASPAA 2019: 249-253

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.