default search action
Ming Sun 0007
Person information
- affiliation: Amazon Alexa
Other persons with the same name
- Ming Sun — disambiguation page
- Ming Sun 0001 — Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA
- Ming Sun 0002 — University of Missouri, Department of Electrical and Computer Engineering, Columbia, MO, USA
- Ming Sun 0003 — Qiqihar University, College of Computer and Control Engineering, China (and 2 more)
- Ming Sun 0004 — Heriot Watt University, School of the Built Environment, Institute for Building and Urban Design, Edinburgh, UK
- Ming Sun 0005 — Arizona State University, Tempe, AZ, USA
- Ming Sun 0006 — Nankai University, School of Computer Science and Control Engineering, Tianjin, China
- Ming Sun 0008 — SenseTime Research, Beijing, China (and 1 more)
- Ming Sun 0009 — Nanjing University of Posts and Telecommunications, Nanjing, China
- Ming Sun 0010 — Institute of Information Engineering, Chinese Academy of Sciences, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [c31]Qin Zhang, Qingming Tang, Chieh-Chi Kao, Ming Sun, Yang Liu, Chao Wang:
Wikitag: Wikipedia-Based Knowledge Embeddings Towards Improved Acoustic Event Classification. ICASSP 2022: 136-140 - [c30]Arman Zharmagambetov, Qingming Tang, Chieh-Chi Kao, Qin Zhang, Ming Sun, Viktor Rozgic, Jasha Droppo, Chao Wang:
Improved Representation Learning For Acoustic Event Classification Using Tree-Structured Ontology. ICASSP 2022: 321-325 - [c29]Meng Feng, Chieh-Chi Kao, Qingming Tang, Ming Sun, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Federated Self-Supervised Learning for Acoustic Event Classification. ICASSP 2022: 481-485 - [c28]Rahil Parikh, Harshavardhan Sundar, Ming Sun, Chao Wang, Spyros Matsoukas:
Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework. INTERSPEECH 2022: 4192-4196 - [i15]Meng Feng, Chieh-Chi Kao, Qingming Tang, Ming Sun, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Federated Self-Supervised Learning for Acoustic Event Classification. CoRR abs/2203.11997 (2022) - [i14]Rahil Parikh, Harshavardhan Sundar, Ming Sun, Chao Wang, Spyros Matsoukas:
Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework. CoRR abs/2206.13476 (2022) - 2021
- [c27]Anthea Cheung, Qingming Tang, Chieh-Chi Kao, Ming Sun, Chao Wang:
Improved Student Model Training for Acoustic Event Detection Models. DCASE 2021: 181-185 - [c26]Hsin-Ping Huang, Krishna C. Puvvada, Ming Sun, Chao Wang:
Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification. ICASSP 2021: 331-335 - [c25]Ho-Hsiang Wu, Chieh-Chi Kao, Qingming Tang, Ming Sun, Brian McFee, Juan Pablo Bello, Chao Wang:
Multi-Task Self-Supervised Pre-Training for Music Classification. ICASSP 2021: 556-560 - [i13]Ho-Hsiang Wu, Chieh-Chi Kao, Qingming Tang, Ming Sun, Brian McFee, Juan Pablo Bello, Chao Wang:
Multi-Task Self-Supervised Pre-Training for Music Classification. CoRR abs/2102.03229 (2021) - 2020
- [c24]Bowen Shi, Ming Sun, Krishna C. Puvvada, Chieh-Chi Kao, Spyros Matsoukas, Chao Wang:
Few-Shot Acoustic Event Detection Via Meta Learning. ICASSP 2020: 76-80 - [c23]Chieh-Chi Kao, Ming Sun, Weiran Wang, Chao Wang:
A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification. ICASSP 2020: 316-320 - [c22]Harshavardhan Sundar, Weiran Wang, Ming Sun, Chao Wang:
Raw Waveform Based End-to-end Deep Convolutional Network for Spatial Localization of Multiple Acoustic Sources. ICASSP 2020: 4642-4646 - [c21]Chieh-Chi Kao, Bowen Shi, Ming Sun, Chao Wang:
A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling. INTERSPEECH 2020: 846-850 - [c20]Chun-Chieh Chang, Chieh-Chi Kao, Ming Sun, Chao Wang:
Intra-Utterance Similarity Preserving Knowledge Distillation for Audio Tagging. INTERSPEECH 2020: 851-855 - [c19]Yixin Gao, Noah D. Stein, Chieh-Chi Kao, Yunliang Cai, Ming Sun, Tao Zhang, Shiv Naga Prasad Vitaladevuni:
On Front-End Gain Invariant Modeling for Wake Word Spotting. INTERSPEECH 2020: 991-995 - [c18]Weimin Wang, Weiran Wang, Ming Sun, Chao Wang:
Acoustic Scene Analysis with Multi-Head Attention Networks. INTERSPEECH 2020: 1191-1195 - [i12]Chieh-Chi Kao, Ming Sun, Weiran Wang, Chao Wang:
A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification. CoRR abs/2002.06279 (2020) - [i11]Bowen Shi, Ming Sun, Krishna C. Puvvada, Chieh-Chi Kao, Spyros Matsoukas, Chao Wang:
Few-shot acoustic event detection via meta-learning. CoRR abs/2002.09143 (2020) - [i10]Chieh-Chi Kao, Bowen Shi, Ming Sun, Chao Wang:
A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling. CoRR abs/2008.03350 (2020) - [i9]Chun-Chieh Chang, Chieh-Chi Kao, Ming Sun, Chao Wang:
Intra-Utterance Similarity Preserving Knowledge Distillation for Audio Tagging. CoRR abs/2009.01759 (2020) - [i8]Yixin Gao, Noah D. Stein, Chieh-Chi Kao, Yunliang Cai, Ming Sun, Tao Zhang, Shiv Vitaladevuni:
On Front-end Gain Invariant Modeling for Wake Word Spotting. CoRR abs/2010.06676 (2020)
2010 – 2019
- 2019
- [c17]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Semi-supervised Acoustic Event Detection Based on Tri-training. ICASSP 2019: 750-754 - [c16]Vipul Arora, Ming Sun, Chao Wang:
Deep Embeddings for Rare Audio Event Detection with Imbalanced Data. ICASSP 2019: 3297-3301 - [c15]Qingming Tang, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Chao Wang:
Hierarchical Residual-pyramidal Model for Large Context Based Media Presence Detection. ICASSP 2019: 3312-3316 - [c14]Srinivas Parthasarathy, Viktor Rozgic, Ming Sun, Chao Wang:
Improving Emotion Classification through Variational Inference of Latent Variables. ICASSP 2019: 7410-7414 - [c13]Yuriy Mishchenko, Yusuf Goren, Ming Sun, Chris Beauchene, Spyros Matsoukas, Oleg Rybakov, Shiv Naga Prasad Vitaladevuni:
Low-Bit Quantization and Quantization-Aware Training for Small-Footprint Keyword Spotting. ICMLA 2019: 706-711 - [c12]Chieh-Chi Kao, Ming Sun, Yixin Gao, Shiv Vitaladevuni, Chao Wang:
Sub-Band Convolutional Neural Networks for Small-Footprint Spoken Term Classification. INTERSPEECH 2019: 2195-2199 - [c11]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Compression of Acoustic Event Detection Models with Quantized Distillation. INTERSPEECH 2019: 3639-3643 - [c10]Courtney Mansfield, Ming Sun, Yuzong Liu, Ankur Gandhe, Björn Hoffmeister:
Neural Text Normalization with Subword Units. NAACL-HLT (2) 2019: 190-196 - [i7]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Semi-supervised Acoustic Event Detection based on tri-training. CoRR abs/1904.12926 (2019) - [i6]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training. CoRR abs/1905.00855 (2019) - [i5]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Compression of Acoustic Event Detection Models With Quantized Distillation. CoRR abs/1907.00873 (2019) - [i4]Chieh-Chi Kao, Ming Sun, Yixin Gao, Shiv Vitaladevuni, Chao Wang:
Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification. CoRR abs/1907.01448 (2019) - [i3]Weimin Wang, Weiran Wang, Ming Sun, Chao Wang:
Acoustic scene analysis with multi-head attention networks. CoRR abs/1909.08961 (2019) - 2018
- [c9]Jinxi Guo, Ken'ichi Kumatani, Ming Sun, Minhua Wu, Anirudh Raju, Nikko Strom, Arindam Mandal:
Time-Delayed Bottleneck Highway Networks Using a DFT Feature for Keyword Spotting. ICASSP 2018: 5489-5493 - [c8]Minhua Wu, Sankaran Panchapagesan, Ming Sun, Jiacheng Gu, Ryan Thomas, Shiv Naga Prasad Vitaladevuni, Björn Hoffmeister, Arindam Mandal:
Monophone-Based Background Modeling for Two-Stage On-Device Wake Word Detection. ICASSP 2018: 5494-5498 - [c7]Chieh-Chi Kao, Weiran Wang, Ming Sun, Chao Wang:
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection. INTERSPEECH 2018: 1358-1362 - [i2]Chieh-Chi Kao, Weiran Wang, Ming Sun, Chao Wang:
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection. CoRR abs/1808.06627 (2018) - 2017
- [c6]Ming Sun, Andreas Schwarz, Minhua Wu, Nikko Strom, Spyros Matsoukas, Shiv Vitaladevuni:
An Empirical Study of Cross-Lingual Transfer Learning Techniques for Small-Footprint Keyword Spotting. ICMLA 2017: 255-260 - [c5]Ming Sun, David Snyder, Yixin Gao, Varun K. Nagaraja, Mike Rodehorst, Sankaran Panchapagesan, Nikko Strom, Spyros Matsoukas, Shiv Vitaladevuni:
Compressed Time Delay Neural Network for Small-Footprint Keyword Spotting. INTERSPEECH 2017: 3607-3611 - [i1]Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan, Gengshen Fu, Arindam Mandal, Spyros Matsoukas, Nikko Strom, Shiv Vitaladevuni:
Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting. CoRR abs/1705.02411 (2017) - 2016
- [c4]Sankaran Panchapagesan, Ming Sun, Aparna Khare, Spyros Matsoukas, Arindam Mandal, Björn Hoffmeister, Shiv Vitaladevuni:
Multi-Task Learning and Weighted Cross-Entropy for DNN-Based Keyword Spotting. INTERSPEECH 2016: 760-764 - [c3]George Tucker, Minhua Wu, Ming Sun, Sankaran Panchapagesan, Gengshen Fu, Shiv Vitaladevuni:
Model Compression Applied to Small-Footprint Keyword Spotting. INTERSPEECH 2016: 1878-1882 - [c2]Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan, Gengshen Fu, Arindam Mandal, Spyros Matsoukas, Nikko Strom, Shiv Vitaladevuni:
Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting. SLT 2016: 474-480 - 2015
- [c1]Ming Sun, Varun K. Nagaraja, Björn Hoffmeister, Shiv Vitaladevuni:
Model Shrinking for Embedded Keyword Spotting. ICMLA 2015: 369-374
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-05 21:36 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint