default search action
Michael I. Mandel
Person information
- affiliation: City University of New York, Graduate Center, NY, USA
- affiliation: Ohio State University, Columbus, OH, USA
- affiliation (PhD 2010): Columbia University, New York, NY, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j11]Ali Raza Syed, Enis Berk Çoban, Dara Pir, Michael I. Mandel:
Data-Centric Methods for Environmental Sound Classification With Limited Labels. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4288-4297 (2024) - [c53]Enis Berk Çoban, Megan Perra, Michael I. Mandel:
Towards High Resolution Weather Monitoring With Sound Data. ICASSP 2024: 1306-1310 - [i14]Enis Berk Çoban, Michael I. Mandel, Johanna Devaney:
What do MLLMs hear? Examining reasoning with text and sound components in Multimodal Large Language Models. CoRR abs/2406.04615 (2024) - 2023
- [c52]Ali Raza Syed, Michael I. Mandel:
Estimating Shapley Values of Training Utterances for Automatic Speech Recognition Models. ICASSP 2023: 1-5 - [i13]Enis Berk Çoban, Megan Perra, Michael I. Mandel:
Towards High Resolution Weather Monitoring with Sound Data. CoRR abs/2309.16867 (2023) - 2022
- [c51]Enis Berk Çoban, Megan Perra, Dara Pir, Michael I. Mandel:
EDANSA-2019: The Ecoacoustic Dataset from Arctic North Slope Alaska. DCASE 2022 - [c50]Viet Anh Trinh, Hassan Salami Kavaki, Michael I. Mandel:
Importantaug: A Data Augmentation Agent for Speech. ICASSP 2022: 8592-8596 - 2021
- [j10]Viet Anh Trinh, Michael I. Mandel:
Directly Comparing the Listening Strategies of Humans and Machines. IEEE ACM Trans. Audio Speech Lang. Process. 29: 312-323 (2021) - [c49]Zhaoheng Ni, Yong Xu, Meng Yu, Bo Wu, Shi-Xiong Zhang, Dong Yu, Michael I. Mandel:
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation. SLT 2021: 817-824 - [c48]Enis Berk Çoban, Ali Raza Syed, Dara Pir, Michael I. Mandel:
Towards Large Scale Ecoacoustic Monitoring with Small Amounts of Labeled Data. WASPAA 2021: 181-185 - [i12]Viet Anh Trinh, Hassan Salami Kavaki, Michael I. Mandel:
ImportantAug: a data augmentation agent for speech. CoRR abs/2112.07156 (2021) - 2020
- [c47]Soumi Maiti, Michael I. Mandel:
Speaker Independence of Neural Vocoders and Their Effect on Parametric Resynthesis Speech Enhancement. ICASSP 2020: 206-210 - [c46]Enis Berk Çoban, Dara Pir, Richard So, Michael I. Mandel:
Transfer Learning from Youtube Soundtracks to Tag Arctic Ecoacoustic Recordings. ICASSP 2020: 726-730 - [c45]Zhaoheng Ni, Michael I. Mandel:
Mask-Dependent Phase Estimation for Monaural Speaker Separation. ICASSP 2020: 7269-7273 - [c44]Viet Anh Trinh, Michael I. Mandel:
Large Scale Evaluation of Importance Maps in Automatic Speech Recognition. INTERSPEECH 2020: 1166-1170 - [c43]Hassan Salami Kavaki, Michael I. Mandel:
Identifying Important Time-Frequency Locations in Continuous Speech Utterances. INTERSPEECH 2020: 1639-1643 - [i11]Shinji Watanabe, Michael I. Mandel, Jon Barker, Emmanuel Vincent:
CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings. CoRR abs/2004.09249 (2020) - [i10]Viet Anh Trinh, Michael I. Mandel:
Large scale evaluation of importance maps in automatic speech recognition. CoRR abs/2005.10929 (2020) - [i9]Zhaoheng Ni, Yong Xu, Meng Yu, Bo Wu, Shi-Xiong Zhang, Dong Yu, Michael I. Mandel:
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation. CoRR abs/2011.09162 (2020) - [i8]Félix Grèzes, Zhaoheng Ni, Viet Anh Trinh, Michael I. Mandel:
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks. CoRR abs/2012.01576 (2020) - [i7]Zhaoheng Ni, Félix Grèzes, Viet Anh Trinh, Michael I. Mandel:
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks. CoRR abs/2012.02191 (2020) - [i6]Félix Grèzes, Zhaoheng Ni, Viet Anh Trinh, Michael I. Mandel:
Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement. CoRR abs/2012.03388 (2020)
2010 – 2019
- 2019
- [c42]Soumi Maiti, Michael I. Mandel:
Speech Denoising by Parametric Resynthesis. ICASSP 2019: 6995-6999 - [c41]Soumi Maiti, Michael I. Mandel:
Parametric Resynthesis With Neural Vocoders. WASPAA 2019: 303-307 - [e2]Michael I. Mandel, Justin Salamon, Daniel P. W. Ellis:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), New York University, NY, USA, October 2019. 2019, ISBN 978-0-578-59596-2 [contents] - [i5]Soumi Maiti, Michael I. Mandel:
Speech denoising by parametric resynthesis. CoRR abs/1904.01537 (2019) - [i4]Soumi Maiti, Michael I. Mandel:
Parametric Resynthesis with neural vocoders. CoRR abs/1906.06762 (2019) - [i3]Zhaoheng Ni, Michael I. Mandel:
Onssen: an open-source speech separation and enhancement library. CoRR abs/1911.00982 (2019) - [i2]Soumi Maiti, Michael I. Mandel:
Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement. CoRR abs/1911.06266 (2019) - 2018
- [c40]Soumi Maiti, Joey Ching, Michael I. Mandel:
Large Vocabulary Concatenative Resynthesis. INTERSPEECH 2018: 1190-1194 - [c39]Ali Raza Syed, Viet Anh Trinh, Michael I. Mandel:
Concatenative Resynthesis with Improved Training Signals for Speech Enhancement. INTERSPEECH 2018: 1195-1199 - [c38]Viet Anh Trinh, Brian McFee, Michael I. Mandel:
Bubble Cooperative Networks for Identifying Important Speech Cues. INTERSPEECH 2018: 1616-1620 - [c37]Zhaoheng Ni, Rutuja Ubale, Yao Qian, Michael I. Mandel, Su-Youn Yoon, Abhinav Misra, David Suendermann-Oeft:
Unusable Spoken Response Detection with BLSTM Neural Networks. ISCSLP 2018: 255-259 - 2017
- [c36]Zhaoheng Ni, Ahmet Cem Yuksel, Xiuyan Ni, Michael I. Mandel, Lei Xie:
Confused or not Confused?: Disentangling Brain Activity from EEG Data Using Bidirectional LSTM Recurrent Neural Networks. BCB 2017: 241-246 - [c35]Hussein Ghaly, Michael I. Mandel:
Analyzing Human and Machine Performance In Resolving Ambiguous Spoken Sentences. SCNLP@EMNLP 2017 2017: 18-26 - [c34]Johanna Devaney, Michael I. Mandel:
An evaluation of score-informed methods for estimating fundamental frequency and power from polyphonic audio. ICASSP 2017: 181-185 - [c33]Ali Raza Syed, Andrew Rosenberg, Michael I. Mandel:
Active learning for low-resource speech recognition: Impact of selection size and language modeling data. ICASSP 2017: 5315-5319 - [c32]Soumi Maiti, Michael I. Mandel:
Concatenative Resynthesis Using Twin Networks. INTERSPEECH 2017: 3647-3651 - [p2]Michael I. Mandel, Jon P. Barker:
Multichannel Spatial Clustering Using Model-Based Source Separation. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 51-77 - [p1]Xiong Xiao, Shinji Watanabe, Hakan Erdogan, Michael I. Mandel, Liang Lu, John R. Hershey, Michael L. Seltzer, Guoguo Chen, Yu Zhang, Dong Yu:
Discriminative Beamforming with Phase-Aware Neural Networks for Speech Enhancement and Recognition. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 79-104 - 2016
- [c31]Xiong Xiao, Shinji Watanabe, Hakan Erdogan, Liang Lu, John R. Hershey, Michael L. Seltzer, Guoguo Chen, Yu Zhang, Michael I. Mandel, Dong Yu:
Deep beamforming networks for multi-channel speech recognition. ICASSP 2016: 5745-5749 - [c30]Michael I. Mandel:
Directly Comparing the Listening Strategies of Humans and Machines. INTERSPEECH 2016: 660-664 - [c29]Hakan Erdogan, John R. Hershey, Shinji Watanabe, Michael I. Mandel, Jonathan Le Roux:
Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks. INTERSPEECH 2016: 1981-1985 - [c28]Michael I. Mandel, Jon Barker:
Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions. INTERSPEECH 2016: 1991-1995 - [e1]Michael I. Mandel, Johanna Devaney, Douglas Turnbull, George Tzanetakis:
Proceedings of the 17th International Society for Music Information Retrieval Conference, ISMIR 2016, New York City, United States, August 7-11, 2016. 2016, ISBN 978-0-692-75506-8 [contents] - 2015
- [c27]Deblin Bagchi, Michael I. Mandel, Zhongqiu Wang, Yanzhang He, Andrew R. Plummer, Eric Fosler-Lussier:
Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition. ASRU 2015: 496-503 - [c26]Michael I. Mandel, Nicoleta Roman:
Enforcing consistency in spectral masks using Markov random fields. EUSIPCO 2015: 2028-2032 - [c25]Michael I. Mandel, Young Suk Cho:
Audio super-resolution using concatenative resynthesis. WASPAA 2015: 1-5 - [c24]Sreyas Srimath Tirumala, Michael I. Mandel:
Exciting estimated clean spectra for speech resynthesis. WASPAA 2015: 1-5 - 2014
- [c23]Michael I. Mandel, Young Suk Cho, Yuxuan Wang:
Learning a concatenative resynthesis system for noise suppression. GlobalSIP 2014: 582-586 - [c22]Michael I. Mandel, Arun Narayanan:
Analysis-by-synthesis feature estimation for robust automatic speech recognition using spectral masks. ICASSP 2014: 2509-2513 - [c21]Michael I. Mandel, Sarah E. Yoho, Eric W. Healy:
Generalizing time-frequency importance functions across noises, talkers, and phonemes. INTERSPEECH 2014: 2016-2020 - 2013
- [j9]Lilong Jiang, Michael I. Mandel, Arnab Nandi:
GestureQuery: A Multitouch Database Query Interface. Proc. VLDB Endow. 6(12): 1342-1345 (2013) - [j8]Arnab Nandi, Lilong Jiang, Michael I. Mandel:
Gestural Query Specification. Proc. VLDB Endow. 7(4): 289-300 (2013) - [c20]Arnab Nandi, Michael I. Mandel:
The interactive join: recognizing gestures for database queries. CHI Extended Abstracts 2013: 1203-1208 - [c19]Nicoleta Roman, Michael I. Mandel:
Classification based binaural dereverberation. INTERSPEECH 2013: 3249-3253 - [c18]Michael I. Mandel:
Learning an intelligibility map of individual utterances. WASPAA 2013: 1-4 - 2012
- [j7]Hugo Larochelle, Michael I. Mandel, Razvan Pascanu, Yoshua Bengio:
Learning Algorithms for the Classification Restricted Boltzmann Machine. J. Mach. Learn. Res. 13: 643-669 (2012) - [c17]Johanna Devaney, Michael I. Mandel, Ichiro Fujinaga:
A Study of Intonation in Three-Part Singing using the Automatic Music Performance Analysis and Comparison Toolkit (AMPACT). ISMIR 2012: 511-516 - 2011
- [j6]Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis:
Combining localization cues and source model constraints for binaural source separation. Speech Commun. 53(5): 606-621 (2011) - [j5]Michael I. Mandel, Razvan Pascanu, Douglas Eck, Yoshua Bengio, Luca Maria Aiello, Rossano Schifanella, Filippo Menczer:
Contextual tag inference. ACM Trans. Multim. Comput. Commun. Appl. 7(Supplement): 32 (2011) - [c16]Johanna Devaney, Michael I. Mandel, Ichiro Fujinaga:
Characterizing singing voice fundamental frequency trajectories. WASPAA 2011: 73-76 - [i1]Michael I. Mandel, Razvan Pascanu, Hugo Larochelle, Yoshua Bengio:
Autotagging music with conditional restricted Boltzmann machines. CoRR abs/1103.2832 (2011) - 2010
- [j4]Michael I. Mandel, Ron J. Weiss, Daniel P. W. Ellis:
Model-Based Expectation-Maximization Source Separation and Localization. IEEE Trans. Speech Audio Process. 18(2): 382-394 (2010) - [j3]Michael I. Mandel, Scott Bressler, Barbara G. Shinn-Cunningham, Daniel P. W. Ellis:
Evaluating Source Separation Algorithms With Reverberant Speech. IEEE Trans. Speech Audio Process. 18(7): 1872-1883 (2010) - [c15]Michael I. Mandel, Douglas Eck, Yoshua Bengio:
Learning Tags that Vary Within a Song. ISMIR 2010: 399-404 - [c14]James Bergstra, Michael I. Mandel, Douglas Eck:
Scalable Genre and Tag Prediction with Spectral Covariance. ISMIR 2010: 507-512
2000 – 2009
- 2009
- [c13]Edith Law, Kris West, Michael I. Mandel, Mert Bay, J. Stephen Downie:
Evaluation of Algorithms Using Games: The Case of Music Tagging. ISMIR 2009: 387-392 - [c12]Johanna Devaney, Michael I. Mandel, Daniel P. W. Ellis:
Improving MIDI-audio alignment with acoustic features. WASPAA 2009: 45-48 - [c11]Michael I. Mandel, Daniel P. W. Ellis:
The Ideal Interaural Parameter Mask: A bound on binaural separation systems. WASPAA 2009: 85-88 - 2008
- [j2]Thomas S. Huang, Charlie K. Dagli, Shyamsundar Rajaram, Edward Y. Chang, Michael I. Mandel, Graham E. Poliner, Daniel P. W. Ellis:
Active Learning for Interactive Multimedia Retrieval. Proc. IEEE 96(4): 648-667 (2008) - [c10]Daniel P. W. Ellis, Courtenay V. Cotton, Michael I. Mandel:
Cross-correlation of beat-synchronous representations for music similarity. ICASSP 2008: 57-60 - [c9]Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis:
Source separation based on binaural cues and source model constraints. INTERSPEECH 2008: 419-422 - [c8]Michael I. Mandel, Daniel P. W. Ellis:
Multiple-Instance Learning for Music Information Retrieval. ISMIR 2008: 577-582 - 2007
- [c7]Michael I. Mandel, Daniel P. W. Ellis:
A Web-Based Game for Collecting Music Metadata. ISMIR 2007: 365-366 - 2006
- [j1]Michael I. Mandel, Graham E. Poliner, Daniel P. W. Ellis:
Support vector machine active learning for music retrieval. Multim. Syst. 12(1): 3-13 (2006) - [c6]Michael I. Mandel, Daniel P. W. Ellis:
A probability model for interaural phase difference. SAPA@INTERSPEECH 2006: 1-6 - [c5]Michael I. Mandel, Daniel P. W. Ellis, Tony Jebara:
An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments. NIPS 2006: 953-960 - 2005
- [c4]Michael I. Mandel, Dan Ellis:
Song-Level Features and Support Vector Machines for Music Classification. ISMIR 2005: 594-599 - 2004
- [c3]Gita Sukthankar, Michael I. Mandel, Katia Sycara-Cyranski, Jessica K. Hodgins:
Modeling Physical Capabilities of Humanoid Agents Using Motion Capture Dat. AAMAS 2004: 344-351 - [c2]Erik B. Sudderth, Michael I. Mandel, William T. Freeman, Alan S. Willsky:
Visual Hand Tracking Using Nonparametric Belief Propagation. CVPR Workshops 2004: 189 - [c1]Erik B. Sudderth, Michael I. Mandel, William T. Freeman, Alan S. Willsky:
Distributed Occlusion Reasoning for Tracking with Nonparametric Belief Propagation. NIPS 2004: 1369-1376
Coauthor Index
aka: Dan Ellis
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-20 22:00 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint