default search action

combined dblp search
author search
venue search
publication search

ask others

Kwangyoun Kim

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShonKSH0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShonKSH0L24
Suwon Shon, Kwangyoun Kim, Prashant Sridhar, Yi-Te Hsu, Shinji Watanabe, Karen Livescu:
Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models. ICASSP 2024: 11156-11160
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangKSWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangKSWS24
Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar:
Improving ASR Contextual Biasing with Guided Attention. ICASSP 2024: 12096-12100
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08835
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08835
Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar, Shinji Watanabe:
Improving ASR Contextual Biasing with Guided Attention. CoRR abs/2401.08835 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-09345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-09345
Suwon Shon, Kwangyoun Kim, Yi-Te Hsu, Prashant Sridhar, Shinji Watanabe, Karen Livescu:
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding. CoRR abs/2406.09345 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-03717
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-03717
Justin Lovelace, Soham Ray, Kwangyoun Kim, Kilian Q. Weinberger, Felix Wu:
Sample-Efficient Diffusion for Text-To-Speech Synthesis. CoRR abs/2409.03717 (2024)
2023
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PengKWSW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PengKWSW23
Yifan Peng, Kwangyoun Kim, Felix Wu, Prashant Sridhar, Shinji Watanabe:
Structured Pruning of Self-Supervised Pre-Trained Models for Speech Recognition and Understanding. ICASSP 2023: 1-5
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShonWKSLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShonWKSLW23
Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe:
Context-Aware Fine-Tuning of Self-Supervised Speech Models. ICASSP 2023: 1-5
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuKWHMWA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuKWHMWA23
Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Jeong Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi:
Wav2Seq: Pre-Training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages. ICASSP 2023: 1-5
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengKWYACTSS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengKWYACTSS023
Yifan Peng, Kwangyoun Kim, Felix Wu, Brian Yan, Siddhant Arora, William Chen, Jiyang Tang, Suwon Shon, Prashant Sridhar, Shinji Watanabe:
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks. INTERSPEECH 2023: 2208-2212
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14132
Yifan Peng, Kwangyoun Kim, Felix Wu, Prashant Sridhar, Shinji Watanabe:
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding. CoRR abs/2302.14132 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11073
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11073
Yifan Peng, Kwangyoun Kim, Felix Wu, Brian Yan, Siddhant Arora, William Chen, Jiyang Tang, Suwon Shon, Prashant Sridhar, Shinji Watanabe:
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks. CoRR abs/2305.11073 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09895
Suwon Shon, Kwangyoun Kim, Prashant Sridhar, Yi-Te Hsu, Shinji Watanabe, Karen Livescu:
Generative Context-aware Fine-tuning of Self-supervised Speech Models. CoRR abs/2312.09895 (2023)
2022
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuKPHWA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuKPHWA22
Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Jeong Han, Kilian Q. Weinberger, Yoav Artzi:
Performance-Efficiency Trade-Offs in Unsupervised Pre-Training for Speech Recognition. ICASSP 2022: 7667-7671
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanLKHW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanLKHW22
Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Jeong Han, Shinji Watanabe:
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition. ICASSP 2022: 7872-7876
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KimWPPSHW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KimWPPSHW22
Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
E-Branchformer: Branchformer with Enhanced Merging for Speech Recognition. SLT 2022: 84-91
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01086
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01086
Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Jeong Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi:
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages. CoRR abs/2205.01086 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00077
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00077
Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
E-Branchformer: Branchformer with Enhanced merging for speech recognition. CoRR abs/2210.00077 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08542
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08542
Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe:
Context-aware Fine-tuning of Self-supervised Speech Models. CoRR abs/2212.08542 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LeeJLKKHK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LeeJLKKHK21
Kyungmin Lee, Hyunwhan Joe, Hyeontaek Lim, Kwangyoun Kim, Sungsoo Kim, Chang Woo Han, Hong-Gee Kim:
Sequential routing framework: Fully capsule network-based speech recognition. Comput. Speech Lang. 70: 101228 (2021)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuptaKGKSSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuptaKGKSSK21
Ashutosh Gupta, Ankur Kumar, Dhananjaya Gowda, Kwangyoun Kim, Sachin Singh, Shatrughan Singh, Chanwoo Kim:
Neural Utterance Confidence Measure for RNN-Transducers and Two Pass Models. ICASSP 2021: 6398-6402
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimWSHW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimWSHW21
Kwangyoun Kim, Felix Wu, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
Multi-Mode Transformer Transducer with Stochastic Future Context. Interspeech 2021: 1827-1831
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09760
Kwangyoun Kim, Felix Wu, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
Multi-mode Transformer Transducer with Stochastic Future Context. CoRR abs/2106.09760 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-06870
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-06870
Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Jeong Han, Kilian Q. Weinberger, Yoav Artzi:
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition. CoRR abs/2109.06870 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05571
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05571
Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Jeong Han, Shinji Watanabe:
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition. CoRR abs/2110.05571 (2021)
2020
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimKI20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimKI20
Chanwoo Kim, Kwangyoun Kim, Sathish Reddy Indurthi:
Small Energy Masking for Improved Neural Network Training for End-To-End Speech Recognition. ICASSP 2020: 7684-7688
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GowdaKKYGSKKJSK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GowdaKKYGSKKJSK20
Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Hejung Yang, Abhinav Garg, Sachin Singh, Jiyeon Kim, Mehul Kumar, Sichen Jin, Shatrughan Singh, Chanwoo Kim:
Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition. INTERSPEECH 2020: 2827-2831
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GargVGJJHKPKKLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GargVGJJHKPKKLM20
Abhinav Garg, Gowtham P. Vadisetti, Dhananjaya Gowda, Sichen Jin, Aditya Jayasimha, Youngho Han, Jiyeon Kim, Junmo Park, Kwangyoun Kim, Sooyeon Kim, Young-Yoon Lee, Kyungbo Min, Chanwoo Kim:
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing. INTERSPEECH 2020: 3371-3375
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-00577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-00577
Kwangyoun Kim, Kyungmin Lee, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim:
Attention based on-device streaming speech recognition with large speech corpus. CoRR abs/2001.00577 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06312
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06312
Chanwoo Kim, Kwangyoun Kim, Sathish Reddy Indurthi:
Small energy masking for improved neural network training for end-to-end speech recognition. CoRR abs/2002.06312 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-11747
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-11747
Kyungmin Lee, Hyunwhan Joe, Hyeontaek Lim, Kwangyoun Kim, Sungsoo Kim, Chang Woo Han, Hong-Gee Kim:
Sequential Routing Framework: Fully Capsule Network-based Speech Recognition. CoRR abs/2007.11747 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GargGKKKK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GargGKKKK19
Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Improved Multi-Stage Training of Online Attention-Based Encoder-Decoder Models. ASRU 2019: 70-77
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KimSSHGKKKKLHGK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KimSSHGKKKKLHGK19
Chanwoo Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim:
End-to-End Training of a Large Vocabulary End-to-End Speech Recognition System. ASRU 2019: 562-569
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KimJLHKLGPKJLYK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KimJLHKLGPKJLYK19
Kwangyoun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim, Kyungmin Lee, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim:
Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus. ASRU 2019: 956-963
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KimKKG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KimKKG19
Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda:
Power-Law Nonlinearity with Maximally Uniform Distribution Criterion for Improved Neural Network Training in Automatic Speech Recognition. ASRU 2019: 988-995
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GowdaGKKK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GowdaGKKK19
Dhananjaya Gowda, Abhinav Garg, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition. INTERSPEECH 2019: 2783-2787
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-11040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-11040
Chanwoo Kim, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda:
end-to-end training of a large vocabulary end-to-end speech recognition system. CoRR abs/1912.11040 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-11041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-11041
Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda:
power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition. CoRR abs/1912.11041 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-12384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-12384
Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models. CoRR abs/1912.12384 (2019)
2012
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iccel/LeeKHK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccel/LeeKHK12
Younghyun Lee, Kwangyoun Kim, David K. Han, Hanseok Ko:
Acoustic and visual signal based violence detection system for indoor security application. ICCE 2012: 737-738
2011
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/avss/KimK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/avss/KimK11
Kwangyoun Kim, Hanseok Ko:
Hierarchical approach for abnormal acoustic event classification in an elevator. AVSS 2011: 89-94

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.