default search action
Buye Xu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2022
- [j3]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing. IEEE J. Sel. Top. Signal Process. 16(6): 1329-1341 (2022) - 2021
- [j2]Ke Tan, Buye Xu, Anurag Kumar, Eliya Nachmani, Yossi Adi:
SAGRNN: Self-Attentive Gated RNN For Binaural Speaker Separation With Interaural Cue Preservation. IEEE Signal Process. Lett. 28: 26-30 (2021) - 2020
- [j1]Yan Zhao, DeLiang Wang, Buye Xu, Tao Zhang:
Monaural Speech Dereverberation Using Temporal Convolutional Networks With Self Attention. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1598-1607 (2020)
Conference and Workshop Papers
- 2024
- [c26]Hassan Taherian, Ashutosh Pandey, Daniel Wong, Buye Xu, DeLiang Wang:
Leveraging Sound Localization to Improve Continuous Speaker Separation. ICASSP 2024: 621-625 - [c25]Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar:
A Closer Look at Wav2vec2 Embeddings for On-Device Single-Channel Speech Enhancement. ICASSP 2024: 751-755 - [c24]Vahid Ahmadi Kalkhorani, Anurag Kumar, Ke Tan, Buye Xu, DeLiang Wang:
Audiovisual Speaker Separation with Full- and Sub-Band Modeling in the Time-Frequency Domain. ICASSP 2024: 12001-12005 - [c23]Tsun-An Hsieh, Jacob Donley, Daniel Wong, Buye Xu, Ashutosh Pandey:
On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement. ICASSP 2024: 12181-12185 - [c22]Ashutosh Pandey, Buye Xu:
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement. ICASSP 2024: 12206-12210 - 2023
- [c21]Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu:
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-Channel Speech Enhancement. ICASSP 2023: 1-5 - [c20]Anurag Kumar, Ke Tan, Zhaoheng Ni, Pranay Manocha, Xiaohui Zhang, Ethan Henderson, Buye Xu:
Torchaudio-Squim: Reference-Less Speech Quality and Intelligibility Measures in Torchaudio. ICASSP 2023: 1-5 - [c19]Rodrigo Mira, Buye Xu, Jacob Donley, Anurag Kumar, Stavros Petridis, Vamsi Krishna Ithapu, Maja Pantic:
LA-VOCE: LOW-SNR Audio-Visual Speech Enhancement Using Neural Vocoders. ICASSP 2023: 1-5 - [c18]Hassan Taherian, Ashutosh Pandey, Daniel Wong, Buye Xu, DeLiang Wang:
Multi-input Multi-output Complex Spectral Mapping for Speaker Separation. INTERSPEECH 2023: 1070-1074 - [c17]Ashutosh Pandey, Ke Tan, Buye Xu:
A Simple RNN Model for Lightweight, Low-compute and Low-latency Multichannel Speech Enhancement in the Time Domain. INTERSPEECH 2023: 2478-2482 - [c16]Vahid Ahmadi Kalkhorani, Anurag Kumar, Ke Tan, Buye Xu, DeLiang Wang:
Time-domain Transformer-based Audiovisual Speaker Separation. INTERSPEECH 2023: 3472-3476 - [c15]Haibin Wu, Ke Tan, Buye Xu, Anurag Kumar, Daniel Wong:
Rethinking Complex-Valued Deep Neural Networks for Monaural Speech Enhancement. INTERSPEECH 2023: 3889-3893 - 2022
- [c14]Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang:
TPARN: Triple-Path Attentive Recurrent Network for Time-Domain Multichannel Speech Enhancement. ICASSP 2022: 6497-6501 - [c13]Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang:
Multichannel Speech Enhancement Without Beamforming. ICASSP 2022: 6502-6506 - [c12]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual Self-Training With Bootstrapped Remixing For Speech Enhancement. ICASSP 2022: 6947-6951 - [c11]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel Dejene Gebru, Vamsi Krishna Ithapu, Paul Calamia:
SAQAM: Spatial Audio Quality Assessment Metric. INTERSPEECH 2022: 649-653 - [c10]Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang:
Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network. INTERSPEECH 2022: 729-733 - 2021
- [c9]Yangyang Xia, Buye Xu, Anurag Kumar:
Incorporating Real-World Noisy Speech in Neural-Network-Based Speech Enhancement Systems. ASRU 2021: 564-570 - [c8]Pranay Manocha, Buye Xu, Anurag Kumar:
NORESQA: A Framework for Speech Quality Assessment using Non-Matching References. NeurIPS 2021: 22363-22378 - [c7]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
DPLM: A Deep Perceptual Spatial-Audio Localization Metric. WASPAA 2021: 6-10 - 2018
- [c6]Yan Zhao, Buye Xu, Ritwik Giri, Tao Zhang:
Perceptually Guided Speech Enhancement Using Deep Neural Networks. ICASSP 2018: 5074-5078 - [c5]Yan Zhao, DeLiang Wang, Buye Xu, Tao Zhang:
Late Reverberation Suppression Using Recurrent Neural Networks with Long Short-Term Memory. ICASSP 2018: 5434-5438 - 2015
- [c4]William S. Woods, Elior Hadad, Ivo Merks, Buye Xu, Sharon Gannot, Tao Zhang:
A real-world recording database for ad hoc microphone arrays. WASPAA 2015: 1-5 - 2014
- [c3]Ivo Merks, Buye Xu, Tao Zhang:
Design of a high order binaural microphone array for hearing aids using a rigid spherical model. ICASSP 2014: 3650-3654 - 2013
- [c2]Eric A. Durant, Jinjun Xiao, Buye Xu, Martin F. McKinney, Tao Zhang:
Perceptually motivated ANC for hearing-impaired listeners. WASPAA 2013: 1-4 - 2012
- [c1]Srikanth Vishnubhotla, Jinjun Xiao, Buye Xu, Martin F. McKinney, Tao Zhang:
Annoyance perception and modeling for hearing-impaired listeners. ICASSP 2012: 161-164
Informal and Other Publications
- 2024
- [i23]Ashutosh Pandey, Buye Xu:
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement. CoRR abs/2401.07879 (2024) - [i22]Tsun-An Hsieh, Jacob Donley, Daniel Wong, Buye Xu, Ashutosh Pandey:
On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement. CoRR abs/2401.07882 (2024) - [i21]Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar:
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement. CoRR abs/2403.01369 (2024) - [i20]Vahid Ahmadi Kalkhorani, Cheng Yu, Anurag Kumar, Ke Tan, Buye Xu, DeLiang Wang:
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling. CoRR abs/2406.11619 (2024) - [i19]Ashutosh Pandey, Sanha Lee, Juan Azcarreta, Daniel Wong, Buye Xu:
All Neural Low-latency Directional Speech Extraction. CoRR abs/2407.04879 (2024) - [i18]Zhongweiyang Xu, Ali Aroudi, Ke Tan, Ashutosh Pandey, Jung-Suk Lee, Buye Xu, Francesco Nesta:
FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses. CoRR abs/2408.06468 (2024) - [i17]Longbiao Cheng, Ashutosh Pandey, Buye Xu, Tobi Delbruck, Shih-Chii Liu:
Dynamic Gated Recurrent Neural Network for Compute-efficient Speech Enhancement. CoRR abs/2408.12425 (2024) - 2023
- [i16]Haibin Wu, Ke Tan, Buye Xu, Anurag Kumar, Daniel Wong:
Rethinking complex-valued deep neural networks for monaural speech enhancement. CoRR abs/2301.04320 (2023) - 2022
- [i15]Efthymios Tzinis, Yossi Adi, Vamsi Krishna Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing. CoRR abs/2202.08862 (2022) - [i14]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
SAQAM: Spatial Audio Quality Assessment Metric. CoRR abs/2206.12297 (2022) - [i13]Tong Xiao, Buye Xu, Chuming Zhao:
Spatially Selective Active Noise Control Systems. CoRR abs/2208.09997 (2022) - [i12]Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu:
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement. CoRR abs/2211.08624 (2022) - [i11]Rodrigo Mira, Buye Xu, Jacob Donley, Anurag Kumar, Stavros Petridis, Vamsi Krishna Ithapu, Maja Pantic:
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders. CoRR abs/2211.10999 (2022) - 2021
- [i10]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
DPLM: A Deep Perceptual Spatial-Audio Localization Metric. CoRR abs/2105.14180 (2021) - [i9]Ori Kabeli, Yossi Adi, Zhenyu Tang, Buye Xu, Anurag Kumar:
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation. CoRR abs/2106.13493 (2021) - [i8]Yangyang Xia, Buye Xu, Anurag Kumar:
Incorporating Real-world Noisy Speech in Neural-network-based Speech Enhancement Systems. CoRR abs/2109.05172 (2021) - [i7]Pranay Manocha, Buye Xu, Anurag Kumar:
NORESQA - A Framework for Speech Quality Assessment using Non-Matching References. CoRR abs/2109.08125 (2021) - [i6]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual self-training with bootstrapped remixing for speech enhancement. CoRR abs/2110.10103 (2021) - [i5]Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang:
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement. CoRR abs/2110.10757 (2021) - [i4]Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang:
TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement. CoRR abs/2110.11844 (2021) - [i3]Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang:
Multichannel Speech Enhancement without Beamforming. CoRR abs/2110.13130 (2021) - [i2]Jonah Casebeer, Jacob Donley, Daniel Wong, Buye Xu, Anurag Kumar:
NICE-Beam: Neural Integrated Covariance Estimators for Time-Varying Beamformers. CoRR abs/2112.04613 (2021) - 2020
- [i1]Ke Tan, Buye Xu, Anurag Kumar, Eliya Nachmani, Yossi Adi:
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation. CoRR abs/2009.01381 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint