default search action
Samuele Cornell
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j7]Carlo Aironi, Samuele Cornell, Stefano Squartini:
A Graph-Based Neural Approach to Linear Sum Assignment Problems. Int. J. Neural Syst. 34(3): 2450011:1-2450011:18 (2024) - [j6]Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini:
End-to-end integration of speech separation and voice activity detection for low-latency diarization of telephone conversations. Speech Commun. 161: 103081 (2024) - [c31]Luca Della Libera, Cem Subakan, Mirco Ravanelli, Samuele Cornell, Frédéric Lepoutre, François Grondin:
Resource-Efficient Separation Transformer. ICASSP 2024: 761-765 - [c30]Samuele Cornell, Jee-Weon Jung, Shinji Watanabe, Stefano Squartini:
One Model to Rule Them All ? Towards End-to-End Joint Speaker Diarization and Speech Recognition. ICASSP 2024: 11856-11860 - [i32]Wangyou Zhang, Robin Scheibler, Kohei Saijo, Samuele Cornell, Chenda Li, Zhaoheng Ni, Anurag Kumar, Jan Pirklbauer, Marvin Sach, Shinji Watanabe, Tim Fingscheidt, Yanmin Qian:
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement. CoRR abs/2406.04660 (2024) - [i31]Samuele Cornell, Janek Ebbers, Constance Douwes, Irene Martín-Morató, Manu Harju, Annamaria Mesaros, Romain Serizel:
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels. CoRR abs/2406.08056 (2024) - [i30]Chenda Li, Samuele Cornell, Shinji Watanabe, Yanmin Qian:
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement. CoRR abs/2406.13471 (2024) - [i29]Samuele Cornell, Taejin Park, Steve Huang, Christoph Böddeker, Xuankai Chang, Matthew Maciejewski, Matthew Wiesner, Paola García, Shinji Watanabe:
The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization. CoRR abs/2407.16447 (2024) - [i28]Samuele Cornell, Jordan Darefsky, Zhiyao Duan, Shinji Watanabe:
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition. CoRR abs/2408.09215 (2024) - [i27]Masao Someki, Kwanghee Choi, Siddhant Arora, William Chen, Samuele Cornell, Jionghao Han, Yifan Peng, Jiatong Shi, Vaibhav Srivastav, Shinji Watanabe:
ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration. CoRR abs/2409.09506 (2024) - 2023
- [j5]Luca Serafini, Samuele Cornell, Giovanni Morrone, Enrico Zovato, Alessio Brutti, Stefano Squartini:
An experimental review of speaker diarization methods with application to two-speaker conversational telephone speech recordings. Comput. Speech Lang. 82: 101534 (2023) - [j4]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing. J. Open Source Softw. 8(91): 5403 (2023) - [j3]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin, Mirko Bronzi:
Exploring Self-Attention Mechanisms for Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2169-2180 (2023) - [j2]Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe:
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3221-3236 (2023) - [c29]Carlo Aironi, Samuele Cornell, Luca Serafini, Stefano Squartini:
A Time-Frequency Generative Adversarial Based Method for Audio Packet Loss Concealment. EUSIPCO 2023: 121-125 - [c28]Samuele Cornell, Zhong-Qiu Wang, Yoshiki Masuyama, Shinji Watanabe, Manuel Pariente, Nobutaka Ono, Stefano Squartini:
Multi-Channel Speaker Extraction with Adversarial Training: The Wavlab Submission to The Clarity ICASSP 2023 Grand Challenge. ICASSP 2023: 1-2 - [c27]Romain Serizel, Samuele Cornell, Nicolas Turpault:
Performance Above All? Energy Consumption vs. Performance, a Study on Sound Event Detection with Heterogeneous Data. ICASSP 2023: 1-5 - [c26]Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe:
TF-GRIDNET: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation. ICASSP 2023: 1-5 - [c25]Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe:
FNeural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated full- and sub-band Modeling. ICASSP 2023: 1-5 - [c24]Carlo Aironi, Samuele Cornell, Leonardo Gabrielli, Stefano Squartini:
A Score-aware Generative Approach for Music Signals Inpainting. IS2 2023: 1-7 - [c23]Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. WASPAA 2023: 1-5 - [d1]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing (espnet-v.202310). Zenodo, 2023 - [i26]Samuele Cornell, Zhong-Qiu Wang, Yoshiki Masuyama, Shinji Watanabe, Manuel Pariente, Nobutaka Ono:
Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge. CoRR abs/2302.07928 (2023) - [i25]Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini:
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations. CoRR abs/2303.12002 (2023) - [i24]Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe:
Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling. CoRR abs/2304.08707 (2023) - [i23]Luca Serafini, Samuele Cornell, Giovanni Morrone, Enrico Zovato, Alessio Brutti, Stefano Squartini:
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings. CoRR abs/2305.18074 (2023) - [i22]Samuele Cornell, Matthew Wiesner, Shinji Watanabe, Desh Raj, Xuankai Chang, Paola García, Yoshiki Masuyama, Zhong-Qiu Wang, Stefano Squartini, Sanjeev Khudanpur:
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios. CoRR abs/2306.13734 (2023) - [i21]Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. CoRR abs/2307.12231 (2023) - [i20]Samuele Cornell, Jee-weon Jung, Shinji Watanabe, Stefano Squartini:
One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition. CoRR abs/2310.01688 (2023) - [i19]Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis:
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch. CoRR abs/2310.17864 (2023) - 2022
- [j1]Samuele Cornell, Maurizio Omologo, Stefano Squartini, Emmanuel Vincent:
Overlapped Speech Detection and speaker counting using distant microphone arrays. Comput. Speech Lang. 72: 101306 (2022) - [c22]Carlo Aironi, Samuele Cornell, Stefano Squartini:
Tackling the Linear Sum Assignment Problem with Graph Neural Networks. AII 2022: 90-101 - [c21]Francesca Ronchini, Samuele Cornell, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Daniel P. W. Ellis:
Description and Analysis of Novelties Introduced in DCASE Task 4 2022 on the Baseline System. DCASE 2022 - [c20]Carlo Aironi, Samuele Cornell, Emanuele Principi, Stefano Squartini:
Graph Node Embeddings for ontology-aware Sound Event Classification: an evaluation study. EUSIPCO 2022: 414-418 - [c19]Samuele Cornell, Manuel Pariente, François Grondin, Stefano Squartini:
Learning Filterbanks for End-to-End Acoustic Beamforming. ICASSP 2022: 6507-6511 - [c18]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin:
Real-M: Towards Speech Separation on Real Mixtures. ICASSP 2022: 6862-6866 - [c17]Yen-Ju Lu, Samuele Cornell, Xuankai Chang, Wangyou Zhang, Chenda Li, Zhaoheng Ni, Zhong-Qiu Wang, Shinji Watanabe:
Towards Low-Distortion Multi-Channel Speech Enhancement: The ESPNET-Se Submission to the L3DAS22 Challenge. ICASSP 2022: 9201-9205 - [c16]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. INTERSPEECH 2022: 5458-5462 - [c15]Yoshiki Masuyama, Xuankai Chang, Samuele Cornell, Shinji Watanabe, Nobutaka Ono:
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation. SLT 2022: 260-265 - [c14]Giovanni Morrone, Samuele Cornell, Desh Raj, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini:
Low-Latency Speech Separation Guided Diarization for Telephone Conversations. SLT 2022: 641-646 - [c13]Samuele Cornell, Thomas Balestri, Thibaud Sénéchal:
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection. SLT 2022: 1052-1058 - [i18]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin, Mirko Bronzi:
On Using Transformers for Speech-Separation. CoRR abs/2202.02884 (2022) - [i17]Yen-Ju Lu, Samuele Cornell, Xuankai Chang, Wangyou Zhang, Chenda Li, Zhaoheng Ni, Zhong-Qiu Wang, Shinji Watanabe:
Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge. CoRR abs/2202.12298 (2022) - [i16]Cem Subakan, Mirco Ravanelli, Samuele Cornell, Frédéric Lepoutre, François Grondin:
Resource-Efficient Separation Transformer. CoRR abs/2206.09507 (2022) - [i15]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. CoRR abs/2207.09514 (2022) - [i14]Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe:
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation. CoRR abs/2209.03952 (2022) - [i13]Francesca Ronchini, Samuele Cornell, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Daniel P. W. Ellis:
Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system. CoRR abs/2210.07856 (2022) - [i12]Yoshiki Masuyama, Xuankai Chang, Samuele Cornell, Shinji Watanabe, Nobutaka Ono:
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation. CoRR abs/2210.10742 (2022) - [i11]Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe:
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation. CoRR abs/2211.12433 (2022) - 2021
- [c12]Francesca Ronchini, Romain Serizel, Nicolas Turpault, Samuele Cornell:
The Impact of Non-Target Events in Synthetic Soundscapes for Sound Event Detection. DCASE 2021: 115-119 - [c11]Carlo Aironi, Samuele Cornell, Emanuele Principi, Stefano Squartini:
Graph-based Representation of Audio signals for Sound Event Classification. EUSIPCO 2021: 566-570 - [c10]Cem Subakan, Mirco Ravanelli, Samuele Cornell, Mirko Bronzi, Jianyuan Zhong:
Attention Is All You Need In Speech Separation. ICASSP 2021: 21-25 - [c9]Samuele Cornell, Alessio Brutti, Marco Matassoni, Stefano Squartini:
Learning to Rank Microphones for Distant Speech Recognition. Interspeech 2021: 3855-3859 - [i10]Samuele Cornell, Alessio Brutti, Marco Matassoni, Stefano Squartini:
Learning to Rank Microphones for Distant Speech Recognition. CoRR abs/2104.02819 (2021) - [i9]Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-Chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato De Mori, Yoshua Bengio:
SpeechBrain: A General-Purpose Speech Toolkit. CoRR abs/2106.04624 (2021) - [i8]Francesca Ronchini, Romain Serizel, Nicolas Turpault, Samuele Cornell:
The impact of non-target events in synthetic soundscapes for sound event detection. CoRR abs/2109.14061 (2021) - [i7]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin:
REAL-M: Towards Speech Separation on Real Mixtures. CoRR abs/2110.10812 (2021) - [i6]Samuele Cornell, Manuel Pariente, François Grondin, Stefano Squartini:
Learning Filterbanks for End-to-End Acoustic Beamforming. CoRR abs/2111.04614 (2021) - [i5]Samuele Cornell, Thomas Balestri, Thibaud Sénéchal:
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection. CoRR abs/2111.10639 (2021) - 2020
- [c8]Samuele Cornell, Michel Olvera, Manuel Pariente, Giovanni Pepe, Emanuele Principi, Leonardo Gabrielli, Stefano Squartini:
Domain-Adversarial Training and Trainable Parallel Front-End for the DCASE 2020 Task 4 Sound Event Detection Challenge. DCASE 2020: 26-30 - [c7]Samuele Cornell, Michel Olvera, Manuel Pariente, Giovanni Pepe, Emanuele Principi, Leonardo Gabrielli, Stefano Squartini:
Task-Aware Separation for the DCASE 2020 Task 4 Sound Event Detection and Separation Challenge. DCASE 2020: 31-35 - [c6]Manuel Pariente, Samuele Cornell, Antoine Deleforge, Emmanuel Vincent:
Filterbank Design for End-to-end Speech Separation. ICASSP 2020: 6364-6368 - [c5]Samuele Cornell, Emanuele Principi, Stefano Squartini:
A Novel Adversarial Training Scheme for Deep Neural Network based Speech Enhancement. IJCNN 2020: 1-8 - [c4]Marco Severini, Emanuele Principi, Samuele Cornell, Leonardo Gabrielli, Stefano Squartini:
Who Cried When: Infant Cry Diarization with Dilated Fully-Convolutional Neural Networks. IJCNN 2020: 1-8 - [c3]Manuel Pariente, Samuele Cornell, Joris Cosentino, Sunit Sivasankaran, Efthymios Tzinis, Jens Heitkaemper, Michel Olvera, Fabian-Robert Stöter, Mathieu Hu, Juan M. Martín-Doñas, David Ditter, Ariel Frank, Antoine Deleforge, Emmanuel Vincent:
Asteroid: The PyTorch-Based Audio Source Separation Toolkit for Researchers. INTERSPEECH 2020: 2637-2641 - [c2]Samuele Cornell, Maurizio Omologo, Stefano Squartini, Emmanuel Vincent:
Detecting and Counting Overlapping Speakers in Distant Speech Scenarios. INTERSPEECH 2020: 3107-3111 - [i4]Manuel Pariente, Samuele Cornell, Joris Cosentino, Sunit Sivasankaran, Efthymios Tzinis, Jens Heitkaemper, Michel Olvera, Fabian-Robert Stöter, Mathieu Hu, Juan M. Martín-Doñas, David Ditter, Ariel Frank, Antoine Deleforge, Emmanuel Vincent:
Asteroid: the PyTorch-based audio source separation toolkit for researchers. CoRR abs/2005.04132 (2020) - [i3]Cem Subakan, Mirco Ravanelli, Samuele Cornell, Mirko Bronzi, Jianyuan Zhong:
Attention is All You Need in Speech Separation. CoRR abs/2010.13154 (2020)
2010 – 2019
- 2019
- [c1]Alessandro Terenzi, Valeria Bruschi, Samuele Cornell, Andrea Castellani, Stefania Cecchi:
A Multiband Structure based on Hammerstein Model for Nonlinear Audio System Identification. ISPA 2019: 9-14 - [i2]Manuel Pariente, Samuele Cornell, Antoine Deleforge, Emmanuel Vincent:
Filterbank design for end-to-end speech separation. CoRR abs/1910.10400 (2019) - [i1]Md. Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, Hervé Bredin, Pavel Korshunov, Alessio Brutti, Romain Serizel, Emmanuel Vincent, Nicholas W. D. Evans, Sébastien Marcel, Stefano Squartini, Claude Barras:
The Speed Submission to DIHARD II: Contributions & Lessons Learned. CoRR abs/1911.02388 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 22:01 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint