default search action
Kwangyoun Kim
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Suwon Shon, Kwangyoun Kim, Prashant Sridhar, Yi-Te Hsu, Shinji Watanabe, Karen Livescu:
Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models. ICASSP 2024: 11156-11160 - [c20]Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar:
Improving ASR Contextual Biasing with Guided Attention. ICASSP 2024: 12096-12100 - [i18]Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar, Shinji Watanabe:
Improving ASR Contextual Biasing with Guided Attention. CoRR abs/2401.08835 (2024) - [i17]Suwon Shon, Kwangyoun Kim, Yi-Te Hsu, Prashant Sridhar, Shinji Watanabe, Karen Livescu:
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding. CoRR abs/2406.09345 (2024) - [i16]Justin Lovelace, Soham Ray, Kwangyoun Kim, Kilian Q. Weinberger, Felix Wu:
Sample-Efficient Diffusion for Text-To-Speech Synthesis. CoRR abs/2409.03717 (2024) - 2023
- [c19]Yifan Peng, Kwangyoun Kim, Felix Wu, Prashant Sridhar, Shinji Watanabe:
Structured Pruning of Self-Supervised Pre-Trained Models for Speech Recognition and Understanding. ICASSP 2023: 1-5 - [c18]Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe:
Context-Aware Fine-Tuning of Self-Supervised Speech Models. ICASSP 2023: 1-5 - [c17]Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Jeong Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi:
Wav2Seq: Pre-Training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages. ICASSP 2023: 1-5 - [c16]Yifan Peng, Kwangyoun Kim, Felix Wu, Brian Yan, Siddhant Arora, William Chen, Jiyang Tang, Suwon Shon, Prashant Sridhar, Shinji Watanabe:
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks. INTERSPEECH 2023: 2208-2212 - [i15]Yifan Peng, Kwangyoun Kim, Felix Wu, Prashant Sridhar, Shinji Watanabe:
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding. CoRR abs/2302.14132 (2023) - [i14]Yifan Peng, Kwangyoun Kim, Felix Wu, Brian Yan, Siddhant Arora, William Chen, Jiyang Tang, Suwon Shon, Prashant Sridhar, Shinji Watanabe:
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks. CoRR abs/2305.11073 (2023) - [i13]Suwon Shon, Kwangyoun Kim, Prashant Sridhar, Yi-Te Hsu, Shinji Watanabe, Karen Livescu:
Generative Context-aware Fine-tuning of Self-supervised Speech Models. CoRR abs/2312.09895 (2023) - 2022
- [c15]Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Jeong Han, Kilian Q. Weinberger, Yoav Artzi:
Performance-Efficiency Trade-Offs in Unsupervised Pre-Training for Speech Recognition. ICASSP 2022: 7667-7671 - [c14]Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Jeong Han, Shinji Watanabe:
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition. ICASSP 2022: 7872-7876 - [c13]Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
E-Branchformer: Branchformer with Enhanced Merging for Speech Recognition. SLT 2022: 84-91 - [i12]Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Jeong Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi:
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages. CoRR abs/2205.01086 (2022) - [i11]Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
E-Branchformer: Branchformer with Enhanced merging for speech recognition. CoRR abs/2210.00077 (2022) - [i10]Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe:
Context-aware Fine-tuning of Self-supervised Speech Models. CoRR abs/2212.08542 (2022) - 2021
- [j1]Kyungmin Lee, Hyunwhan Joe, Hyeontaek Lim, Kwangyoun Kim, Sungsoo Kim, Chang Woo Han, Hong-Gee Kim:
Sequential routing framework: Fully capsule network-based speech recognition. Comput. Speech Lang. 70: 101228 (2021) - [c12]Ashutosh Gupta, Ankur Kumar, Dhananjaya Gowda, Kwangyoun Kim, Sachin Singh, Shatrughan Singh, Chanwoo Kim:
Neural Utterance Confidence Measure for RNN-Transducers and Two Pass Models. ICASSP 2021: 6398-6402 - [c11]Kwangyoun Kim, Felix Wu, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
Multi-Mode Transformer Transducer with Stochastic Future Context. Interspeech 2021: 1827-1831 - [i9]Kwangyoun Kim, Felix Wu, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
Multi-mode Transformer Transducer with Stochastic Future Context. CoRR abs/2106.09760 (2021) - [i8]Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Jeong Han, Kilian Q. Weinberger, Yoav Artzi:
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition. CoRR abs/2109.06870 (2021) - [i7]Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Jeong Han, Shinji Watanabe:
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition. CoRR abs/2110.05571 (2021) - 2020
- [c10]Chanwoo Kim, Kwangyoun Kim, Sathish Reddy Indurthi:
Small Energy Masking for Improved Neural Network Training for End-To-End Speech Recognition. ICASSP 2020: 7684-7688 - [c9]Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Hejung Yang, Abhinav Garg, Sachin Singh, Jiyeon Kim, Mehul Kumar, Sichen Jin, Shatrughan Singh, Chanwoo Kim:
Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition. INTERSPEECH 2020: 2827-2831 - [c8]Abhinav Garg, Gowtham P. Vadisetti, Dhananjaya Gowda, Sichen Jin, Aditya Jayasimha, Youngho Han, Jiyeon Kim, Junmo Park, Kwangyoun Kim, Sooyeon Kim, Young-Yoon Lee, Kyungbo Min, Chanwoo Kim:
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing. INTERSPEECH 2020: 3371-3375 - [i6]Kwangyoun Kim, Kyungmin Lee, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim:
Attention based on-device streaming speech recognition with large speech corpus. CoRR abs/2001.00577 (2020) - [i5]Chanwoo Kim, Kwangyoun Kim, Sathish Reddy Indurthi:
Small energy masking for improved neural network training for end-to-end speech recognition. CoRR abs/2002.06312 (2020) - [i4]Kyungmin Lee, Hyunwhan Joe, Hyeontaek Lim, Kwangyoun Kim, Sungsoo Kim, Chang Woo Han, Hong-Gee Kim:
Sequential Routing Framework: Fully Capsule Network-based Speech Recognition. CoRR abs/2007.11747 (2020)
2010 – 2019
- 2019
- [c7]Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Improved Multi-Stage Training of Online Attention-Based Encoder-Decoder Models. ASRU 2019: 70-77 - [c6]Chanwoo Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim:
End-to-End Training of a Large Vocabulary End-to-End Speech Recognition System. ASRU 2019: 562-569 - [c5]Kwangyoun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim, Kyungmin Lee, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim:
Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus. ASRU 2019: 956-963 - [c4]Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda:
Power-Law Nonlinearity with Maximally Uniform Distribution Criterion for Improved Neural Network Training in Automatic Speech Recognition. ASRU 2019: 988-995 - [c3]Dhananjaya Gowda, Abhinav Garg, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition. INTERSPEECH 2019: 2783-2787 - [i3]Chanwoo Kim, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda:
end-to-end training of a large vocabulary end-to-end speech recognition system. CoRR abs/1912.11040 (2019) - [i2]Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda:
power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition. CoRR abs/1912.11041 (2019) - [i1]Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models. CoRR abs/1912.12384 (2019) - 2012
- [c2]Younghyun Lee, Kwangyoun Kim, David K. Han, Hanseok Ko:
Acoustic and visual signal based violence detection system for indoor security application. ICCE 2012: 737-738 - 2011
- [c1]Kwangyoun Kim, Hanseok Ko:
Hierarchical approach for abnormal acoustic event classification in an elevator. AVSS 2011: 89-94
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-10 22:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint