default search action
Jee-Weon Jung
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j2]Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Brown, Jee-weon Jung, Daniel Garcia-Romero, Andrew Zisserman:
The VoxCeleb Speaker Recognition Challenge: A Retrospective. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3850-3866 (2024) - [c52]Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan S. Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe:
On the Evaluation of Speech Foundation Models for Spoken Language Understanding. ACL (Findings) 2024: 11923-11938 - [c51]Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-Weon Jung, François G. Germain, Jonathan Le Roux, Shinji Watanabe:
Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation. ICASSP 2024: 316-320 - [c50]Kwanghee Choi, Jee-Weon Jung, Shinji Watanabe:
Understanding Probe Behaviors Through Variational Bounds of Mutual Information. ICASSP 2024: 5655-5659 - [c49]Wangyou Zhang, Jee-weon Jung, Yanmin Qian:
Improving Design of Input Condition Invariant Speech Enhancement. ICASSP 2024: 10696-10700 - [c48]Xuankai Chang, Brian Yan, Kwanghee Choi, Jee-Weon Jung, Yichen Lu, Soumi Maiti, Roshan S. Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang:
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study. ICASSP 2024: 11481-11485 - [c47]Samuele Cornell, Jee-Weon Jung, Shinji Watanabe, Stefano Squartini:
One Model to Rule Them All ? Towards End-to-End Joint Speaker Diarization and Speech Recognition. ICASSP 2024: 11856-11860 - [c46]Jee-Weon Jung, Roshan S. Sharma, William Chen, Bhiksha Raj, Shinji Watanabe:
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models. ICASSP 2024: 12071-12075 - [c45]Doyeop Kwak, Jaemin Jung, Kihyun Nam, Youngjoon Jang, Jee-Weon Jung, Shinji Watanabe, Joon Son Chung:
VoxMM: Rich Transcription of Conversations in the Wild. ICASSP 2024: 12551-12555 - [c44]Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-Weon Jung, Xuankai Chang, Shinji Watanabe:
VoxtLM: Unified Decoder-Only Models for Consolidating Speech Recognition, Synthesis and Speech, Text Continuation Tasks. ICASSP 2024: 13326-13330 - [c43]Siddhant Arora, Hayato Futami, Jee-weon Jung, Yifan Peng, Roshan S. Sharma, Yosuke Kashiwagi, Emiru Tsunoo, Karen Livescu, Shinji Watanabe:
UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions. NAACL-HLT 2024: 2754-2774 - [c42]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen, Nicholas W. D. Evans, Jean-François Bonastre, Itshak Lapidot:
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification. Odyssey 2024: 158-164 - [i62]Jee-weon Jung, Roshan S. Sharma, William Chen, Bhiksha Raj, Shinji Watanabe:
AugSumm: towards generalizable speech summarization using synthetic labels from large language model. CoRR abs/2401.06806 (2024) - [i61]Wangyou Zhang, Jee-weon Jung, Shinji Watanabe, Yanmin Qian:
Improving Design of Input Condition Invariant Speech Enhancement. CoRR abs/2401.14271 (2024) - [i60]Yifan Peng, Jinchuan Tian, William Chen, Siddhant Arora, Brian Yan, Yui Sudo, Muhammad Shakeel, Kwanghee Choi, Jiatong Shi, Xuankai Chang, Jee-weon Jung, Shinji Watanabe:
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer. CoRR abs/2401.16658 (2024) - [i59]Jee-weon Jung, Wangyou Zhang, Jiatong Shi, Zakaria Aldeneh, Takuya Higuchi, Barry-John Theobald, Ahmed Hussen Abdelaziz, Shinji Watanabe:
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models. CoRR abs/2401.17230 (2024) - [i58]Zakaria Aldeneh, Takuya Higuchi, Jee-weon Jung, Skyler Seto, Tatiana Likhomanenko, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe, Barry-John Theobald:
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features? CoRR abs/2402.00340 (2024) - [i57]Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro:
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages. CoRR abs/2402.16021 (2024) - [i56]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen, Nicholas W. D. Evans, Jean-François Bonastre, Itshak Lapidot:
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification. CoRR abs/2403.01355 (2024) - [i55]Wangyou Zhang, Kohei Saijo, Jee-weon Jung, Chenda Li, Shinji Watanabe, Yanmin Qian:
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement. CoRR abs/2406.04269 (2024) - [i54]Jee-weon Jung, Xin Wang, Nicholas W. D. Evans, Shinji Watanabe, Hye-jin Shim, Hemlata Tak, Sidhhant Arora, Junichi Yamagishi, Joon Son Chung:
To what extent can ASV systems naturally defend against spoofing attacks? CoRR abs/2406.05339 (2024) - [i53]Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan S. Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe:
On the Evaluation of Speech Foundation Models for Spoken Language Understanding. CoRR abs/2406.10083 (2024) - [i52]Kihyun Nam, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung:
Disentangled Representation Learning for Environment-agnostic Speaker Recognition. CoRR abs/2406.14559 (2024) - [i51]Hye-jin Shim, Md. Sahidullah, Jee-weon Jung, Shinji Watanabe, Tomi Kinnunen:
Beyond Silence: Bias Analysis through Loss and Asymmetric Approach in Audio Anti-Spoofing. CoRR abs/2406.17246 (2024) - [i50]Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans, Kong Aik Lee, Junichi Yamagishi:
ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale. CoRR abs/2408.08739 (2024) - [i49]Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Brown, Jee-weon Jung, Daniel Garcia-Romero, Andrew Zisserman:
The VoxCeleb Speaker Recognition Challenge: A Retrospective. CoRR abs/2408.14886 (2024) - [i48]Jee-weon Jung, Wangyou Zhang, Soumi Maiti, Yihan Wu, Xin Wang, Ji-Hoon Kim, Yuta Matsunaga, Seyun Um, Jinchuan Tian, Hye-jin Shim, Nicholas W. D. Evans, Joon Son Chung, Shinnosuke Takamichi, Shinji Watanabe:
Text-To-Speech Synthesis In The Wild. CoRR abs/2409.08711 (2024) - [i47]Zakaria Aldeneh, Takuya Higuchi, Jee-weon Jung, Li-Wei Chen, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe, Tatiana Likhomanenko, Barry-John Theobald:
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels. CoRR abs/2409.10791 (2024) - [i46]Jiatong Shi, Jinchuan Tian, Yihan Wu, Jee-weon Jung, Jia Qi Yip, Yoshiki Masuyama, William Chen, Yuning Wu, Yuxun Tang, Massa Baali, Dareen Alharthi, Dong Zhang, Ruifan Deng, Tejes Srivastava, Haibin Wu, Alexander H. Liu, Bhiksha Raj, Qin Jin, Ruihua Song, Shinji Watanabe:
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech. CoRR abs/2409.15897 (2024) - [i45]Jee-weon Jung, Yihan Wu, Xin Wang, Ji-Hoon Kim, Soumi Maiti, Yuta Matsunaga, Hye-jin Shim, Jinchuan Tian, Nicholas W. D. Evans, Joon Son Chung, Wangyou Zhang, Seyun Um, Shinnosuke Takamichi, Shinji Watanabe:
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild. CoRR abs/2409.17285 (2024) - 2023
- [c41]Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan S. Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel, Jee-Weon Jung, Soumi Maiti, Shinji Watanabe:
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data. ASRU 2023: 1-8 - [c40]Hee-Soo Heo, Youngki Kwon, Bong-Jin Lee, You Jin Kim, Jee-Weon Jung:
High-Resolution Embedding Extractor for Speaker Diarisation. ICASSP 2023: 1-5 - [c39]Jee-Weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesung Huh, Andrew Brown, Youngki Kwon, Shinji Watanabe, Joon Son Chung:
In Search of Strong Embedding Extractors for Speaker Diarisation. ICASSP 2023: 1-5 - [c38]You Jin Kim, Hee-Soo Heo, Jee-Weon Jung, Youngki Kwon, Bong-Jin Lee, Joon Son Chung:
Advancing the Dimensionality Reduction of Speaker Embeddings for Speaker Diarisation: Disentangling Noise and Informing Speech Activity. ICASSP 2023: 1-5 - [c37]Youngki Kwon, Hee-Soo Heo, Bong-Jin Lee, You Jin Kim, Jee-Weon Jung:
Absolute Decision Corrupts Absolutely: Conservative Online Speaker Diarisation. ICASSP 2023: 1-5 - [c36]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen:
Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing. INTERSPEECH 2023: 3804-3808 - [c35]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings. INTERSPEECH 2023: 3989-3993 - [c34]Hee-Soo Heo, Jee-weon Jung, Jingu Kang, Youngki Kwon, Bong-Jin Lee, You Jin Kim, Joon Son Chung:
Curriculum Learning for Self-supervised Speaker Verification. INTERSPEECH 2023: 4693-4697 - [c33]Jee-weon Jung, Soonshin Seo, Hee-Soo Heo, Geonmin Kim, You Jin Kim, Youngki Kwon, Minjae Lee, Bong-Jin Lee:
Encoder-decoder Multimodal Speaker Change Detection. INTERSPEECH 2023: 5311-5315 - [c32]Kihyun Nam, Youkyum Kim, Jaesung Huh, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung:
Disentangled Representation Learning for Multilingual Speaker Recognition. INTERSPEECH 2023: 5316-5320 - [i44]Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman:
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge. CoRR abs/2302.10248 (2023) - [i43]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards single integrated spoofing-aware speaker verification embeddings. CoRR abs/2305.19051 (2023) - [i42]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen:
Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing. CoRR abs/2305.19953 (2023) - [i41]Jee-weon Jung, Soonshin Seo, Hee-Soo Heo, Geonmin Kim, You Jin Kim, Youngki Kwon, Minjae Lee, Bong-Jin Lee:
Encoder-decoder multimodal speaker change detection. CoRR abs/2306.00680 (2023) - [i40]Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-weon Jung, Xuankai Chang, Shinji Watanabe:
Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks. CoRR abs/2309.07937 (2023) - [i39]Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan S. Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel, Jee-weon Jung, Soumi Maiti, Shinji Watanabe:
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data. CoRR abs/2309.13876 (2023) - [i38]Xuankai Chang, Brian Yan, Kwanghee Choi, Jee-Weon Jung, Yichen Lu, Soumi Maiti, Roshan S. Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang:
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study. CoRR abs/2309.15800 (2023) - [i37]Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-weon Jung, François G. Germain, Jonathan Le Roux, Shinji Watanabe:
Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation. CoRR abs/2309.17352 (2023) - [i36]Samuele Cornell, Jee-weon Jung, Shinji Watanabe, Stefano Squartini:
One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition. CoRR abs/2310.01688 (2023) - [i35]Siddhant Arora, Hayato Futami, Jee-weon Jung, Yifan Peng, Roshan S. Sharma, Yosuke Kashiwagi, Emiru Tsunoo, Shinji Watanabe:
UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network. CoRR abs/2310.02973 (2023) - [i34]Kwanghee Choi, Jee-weon Jung, Shinji Watanabe:
Understanding Probe Behaviors through Variational Bounds of Mutual Information. CoRR abs/2312.10019 (2023) - 2022
- [c31]Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Ha-Jin Yu:
Attentive Max Feature Map and Joint Training for Acoustic Scene Classification. ICASSP 2022: 1036-1040 - [c30]Jee-weon Jung, Hee-Soo Heo, Hemlata Tak, Hye-jin Shim, Joon Son Chung, Bong-Jin Lee, Ha-Jin Yu, Nicholas W. D. Evans:
AASIST: Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks. ICASSP 2022: 6367-6371 - [c29]Youngki Kwon, Hee-Soo Heo, Jee-Weon Jung, You Jin Kim, Bong-Jin Lee, Joon Son Chung:
Multi-Scale Speaker Embedding-Based Graph Attention Networks For Speaker Diarisation. ICASSP 2022: 8367-8371 - [c28]Jee-weon Jung, You Jin Kim, Hee-Soo Heo, Bong-Jin Lee, Youngki Kwon, Joon Son Chung:
Pushing the limits of raw waveform speaker recognition. INTERSPEECH 2022: 2228-2232 - [c27]Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas W. D. Evans, Tomi Kinnunen:
SASV 2022: The First Spoofing-Aware Speaker Verification Challenge. INTERSPEECH 2022: 2893-2897 - [c26]Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas W. D. Evans:
Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation. Odyssey 2022: 112-119 - [c25]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. Odyssey 2022: 330-337 - [c24]Sung Hwan Mun, Jee-weon Jung, Min Hyun Han, Nam Soo Kim:
Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification. SLT 2022: 548-554 - [i33]Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Hong-Goo Kang, Ha-Jin Yu, Nicholas W. D. Evans, Tomi Kinnunen:
SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan. CoRR abs/2201.10283 (2022) - [i32]Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas W. D. Evans:
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation. CoRR abs/2202.12233 (2022) - [i31]Jee-weon Jung, You Jin Kim, Hee-Soo Heo, Bong-Jin Lee, Youngki Kwon, Joon Son Chung:
Pushing the limits of raw waveform speaker recognition. CoRR abs/2203.08488 (2022) - [i30]Sung Hwan Mun, Jee-weon Jung, Nam Soo Kim:
Selective Kernel Attention for Robust Speaker Verification. CoRR abs/2204.01005 (2022) - [i29]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. CoRR abs/2204.09976 (2022) - [i28]Jee-weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesong Lee, Hye-jin Shim, Youngki Kwon, Joon Son Chung, Shinji Watanabe:
Large-scale learning of generalised representations for speaker recognition. CoRR abs/2210.10985 (2022) - [i27]Jee-weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesung Huh, Andrew Brown, Youngki Kwon, Shinji Watanabe, Joon Son Chung:
In search of strong embedding extractors for speaker diarisation. CoRR abs/2210.14682 (2022) - [i26]Kihyun Nam, Youkyum Kim, Hee Soo Heo, Jee-weon Jung, Joon Son Chung:
Disentangled representation learning for multilingual speaker recognition. CoRR abs/2211.00437 (2022) - [i25]Hee-Soo Heo, Youngki Kwon, Bong-Jin Lee, You Jin Kim, Jee-weon Jung:
High-resolution embedding extractor for speaker diarisation. CoRR abs/2211.04060 (2022) - [i24]Youngki Kwon, Hee-Soo Heo, Bong-Jin Lee, You Jin Kim, Jee-weon Jung:
Absolute decision corrupts absolutely: conservative online speaker diarisation. CoRR abs/2211.04768 (2022) - 2021
- [c23]Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu:
DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events. ICASSP 2021: 621-625 - [c22]Jee-weon Jung, Hee-Soo Heo, Ha-Jin Yu, Joon Son Chung:
Graph Attention Networks for Speaker Verification. ICASSP 2021: 6149-6153 - [c21]Hemlata Tak, Jee-weon Jung, Jose Patino, Massimiliano Todisco, Nicholas W. D. Evans:
Graph Attention Networks for Anti-Spoofing. Interspeech 2021: 2356-2360 - [c20]Jee-weon Jung, Hee-Soo Heo, Youngki Kwon, Joon Son Chung, Bong-Jin Lee:
Three-Class Overlapped Speech Detection Using a Convolutional Recurrent Neural Network. Interspeech 2021: 3086-3090 - [c19]Youngki Kwon, Jee-weon Jung, Hee-Soo Heo, You Jin Kim, Bong-Jin Lee, Joon Son Chung:
Adapting Speaker Embeddings for Speaker Diarisation. Interspeech 2021: 3101-3105 - [i23]Jee-weon Jung, Hee-Soo Heo, Youngki Kwon, Joon Son Chung, Bong-Jin Lee:
Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network. CoRR abs/2104.02878 (2021) - [i22]Youngki Kwon, Jee-weon Jung, Hee-Soo Heo, You Jin Kim, Bong-Jin Lee, Joon Son Chung:
Adapting Speaker Embeddings for Speaker Diarisation. CoRR abs/2104.02879 (2021) - [i21]Hemlata Tak, Jee-weon Jung, Jose Patino, Massimiliano Todisco, Nicholas W. D. Evans:
Graph Attention Networks for Anti-Spoofing. CoRR abs/2104.03654 (2021) - [i20]Ju-ho Kim, Hye-jin Shim, Jee-weon Jung, Ha-Jin Yu:
Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System. CoRR abs/2104.06604 (2021) - [i19]Hye-jin Shim, Ju-ho Kim, Jee-weon Jung, Ha-Jin Yu:
Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes. CoRR abs/2104.07213 (2021) - [i18]Hemlata Tak, Jee-weon Jung, Jose Patino, Madhu R. Kamble, Massimiliano Todisco, Nicholas W. D. Evans:
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection. CoRR abs/2107.12710 (2021) - [i17]Jee-weon Jung, Hee-Soo Heo, Hemlata Tak, Hye-jin Shim, Joon Son Chung, Bong-Jin Lee, Ha-Jin Yu, Nicholas W. D. Evans:
AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks. CoRR abs/2110.01200 (2021) - [i16]Youngki Kwon, Hee-Soo Heo, Jee-weon Jung, You Jin Kim, Bong-Jin Lee, Joon Son Chung:
Multi-scale speaker embedding-based graph attention networks for speaker diarisation. CoRR abs/2110.03361 (2021) - [i15]You Jin Kim, Hee-Soo Heo, Jee-weon Jung, Youngki Kwon, Bong-Jin Lee, Joon Son Chung:
Disentangled dimensionality reduction for noise-robust speaker diarisation. CoRR abs/2110.03380 (2021) - 2020
- [j1]Jee-Weon Jung, Hee-Soo Heo, Hye-Jin Shim, Ha-Jin Yu:
Knowledge Distillation in Acoustic Scene Classification. IEEE Access 8: 166870-166879 (2020) - [c18]Ju-ho Kim, Jee-Weon Jung, Hye-Jin Shim, Ha-Jin Yu:
Audio Tag Representation Guided Dual Attention Network for Acoustic Scene Classification. DCASE 2020: 76-80 - [c17]Hye-jin Shim, Hee-Soo Heo, Jee-weon Jung, Ha-Jin Yu:
Self-Supervised Pre-Training with Acoustic Configurations for Replay Spoofing Detection. INTERSPEECH 2020: 1091-1095 - [c16]Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Seung-bin Kim, Ha-Jin Yu:
Acoustic Scene Classification Using Audio Tagging. INTERSPEECH 2020: 1176-1180 - [c15]Jee-weon Jung, Seung-bin Kim, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu:
Improved RawNet with Feature Map Scaling for Text-Independent Speaker Verification Using Raw Waveforms. INTERSPEECH 2020: 1496-1500 - [c14]Seung-bin Kim, Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu:
Segment Aggregation for Short Utterances Speaker Verification Using Raw Waveforms. INTERSPEECH 2020: 1521-1525 - [c13]Jee-Weon Jung, Ju-ho Kim, Hye-Jin Shim, Seung-bin Kim, Ha-Jin Yu:
Selective Deep Speaker Embedding Enhancement for Speaker Verification. Odyssey 2020: 171-178 - [i14]Jee-weon Jung, Hye-jin Shim, Hee-Soo Heo, Ha-Jin Yu:
A study on the role of subsidiary information in replay attack spoofing detection. CoRR abs/2001.11688 (2020) - [i13]Jee-weon Jung, Seung-bin Kim, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu:
Improved RawNet with Filter-wise Rescaling for Text-independent Speaker Verification using Raw Waveforms. CoRR abs/2004.00526 (2020) - [i12]Seung-bin Kim, Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu:
Segment Aggregation for short utterances speaker verification using raw waveforms. CoRR abs/2005.03329 (2020) - [i11]Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Seung-bin Kim, Ha-Jin Yu:
Integrated Replay Spoofing-aware Text-independent Speaker Verification. CoRR abs/2006.05599 (2020) - [i10]Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Ha-Jin Yu:
Capturing scattered discriminative information using a deep architecture in acoustic scene classification. CoRR abs/2007.04631 (2020) - [i9]Jee-weon Jung, Hee-Soo Heo, Ha-Jin Yu, Joon Son Chung:
Graph Attention Networks for Speaker Verification. CoRR abs/2010.11543 (2020)
2010 – 2019
- 2019
- [c12]Jee-Weon Jung, Hee-Soo Heo, Hye-Jin Shim, Ha-Jin Yu:
Short Utterance Compensation in Speaker Verification via Cosine-Based Teacher-Student Learning of Speaker Embeddings. ASRU 2019: 335-341 - [c11]Jee-weon Jung, HeeSoo Heo, Hye-jin Shim, Ha-Jin Yu:
Distilling the Knowledge of Specialist Deep Neural Networks in Acoustic Scene Classification. DCASE 2019: 114-118 - [c10]Hee-Soo Heo, Jee-weon Jung, Hye-jin Shim, Ha-Jin Yu:
Acoustic Scene Classification Using Teacher-Student Learning with Soft-Labels. INTERSPEECH 2019: 614-618 - [c9]Jee-weon Jung, Hye-jin Shim, Hee-Soo Heo, Ha-Jin Yu:
Replay Attack Detection with Complementary High-Resolution Information Using End-to-End DNN for the ASVspoof 2019 Challenge. INTERSPEECH 2019: 1083-1087 - [c8]Jee-weon Jung, Hee-Soo Heo, Ju-ho Kim, Hye-jin Shim, Ha-Jin Yu:
RawNet: Advanced End-to-End Deep Neural Network Using Raw Waveforms for Text-Independent Speaker Verification. INTERSPEECH 2019: 1268-1272 - [c7]Hee-Soo Heo, Jee-weon Jung, Il-Ho Yang, Sung-Hyun Yoon, Hye-jin Shim, Ha-Jin Yu:
End-to-End Losses Based on Speaker Basis Vectors and All-Speaker Hard Negative Mining for Speaker Verification. INTERSPEECH 2019: 4035-4039 - [i8]Hee-Soo Heo, Jee-weon Jung, Il-Ho Yang, Sung-Hyun Yoon, Hye-jin Shim, Ha-Jin Yu:
End-to-end losses based on speaker basis vectors and all-speaker hard negative mining for speaker verification. CoRR abs/1902.02455 (2019) - [i7]Jee-weon Jung, Hee-Soo Heo, Ju-ho Kim, Hye-jin Shim, Ha-Jin Yu:
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification. CoRR abs/1904.08104 (2019) - [i6]Jee-weon Jung, Hye-jin Shim, Hee-Soo Heo, Ha-Jin Yu:
Replay attack detection with complementary high-resolution information using end-to-end DNN for the ASVspoof 2019 Challenge. CoRR abs/1904.10134 (2019) - [i5]Hee-Soo Heo, Jee-weon Jung, Hye-jin Shim, Ha-Jin Yu:
Acoustic scene classification using teacher-student learning with soft-labels. CoRR abs/1904.10135 (2019) - [i4]Hee-Soo Heo, Jee-weon Jung, Hye-jin Shim, Il-Ho Yang, Ha-Jin Yu:
Cosine similarity-based adversarial process. CoRR abs/1907.00542 (2019) - [i3]Hye-jin Shim, Hee-Soo Heo, Jee-weon Jung, Ha-Jin Yu:
Self-supervised pre-training with acoustic configurations for replay spoofing detection. CoRR abs/1910.09778 (2019) - 2018
- [c6]Jee-weon Jung, Hee-Soo Heo, Hye-jin Shim, Ha-Jin Yu:
DNN based multi-level feature ensemble for acoustic scene classification. DCASE 2018: 118-122 - [c5]Jee-weon Jung, Hee-Soo Heo, Il-Ho Yang, Hye-jin Shim, Ha-Jin Yu:
A Complete End-to-End Speaker Verification System Using Deep Neural Networks: From Raw Signals to Verification Result. ICASSP 2018: 5349-5353 - [c4]Jee-weon Jung, Hee-Soo Heo, Il-Ho Yang, Hye-jin Shim, Ha-Jin Yu:
Avoiding Speaker Overfitting in End-to-End DNNs Using Raw Waveform for Text-Independent Speaker Verification. INTERSPEECH 2018: 3583-3587 - [c3]Hye-jin Shim, Jee-weon Jung, Hee-Soo Heo, Sung-Hyun Yoon, Ha-Jin Yu:
Replay Spoofing Detection System for Automatic Speaker Verification Using Multi-Task Learning of Noise Classes. TAAI 2018: 172-176 - [i2]Hye-Jin Shim, Jee-Weon Jung, Hee-Soo Heo, Sung-Hyun Yoon, Ha-Jin Yu:
Replay attack spoofing detection system using replay noise by multi-task learning. CoRR abs/1808.09638 (2018) - [i1]Jee-weon Jung, Hee-Soo Heo, Hye-jin Shim, Ha-Jin Yu:
Short utterance compensation in speaker verification via cosine-based teacher-student learning of speaker embeddings. CoRR abs/1810.10884 (2018) - 2017
- [c2]Jee-Weon Jung, Hee-Soo Heo, Il-Ho Yang, Sung-Hyun Yoon, Hye-Jin Shim, Ha-Jin Yu:
DNN-Based Audio Scene Classification for DCASE2017: Dual Input Features, Balancing Cost, and Stochastic Data Duplication. DCASE 2017: 59-63 - [c1]Hee-Soo Heo, Jee-weon Jung, Il-Ho Yang, Sung-Hyun Yoon, Ha-Jin Yu:
Joint Training of Expanded End-to-End DNN for Text-Dependent Speaker Verification. INTERSPEECH 2017: 1532-1536
Coauthor Index
aka: Hye-jin Shim
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 00:51 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint