default search action

combined dblp search
author search
venue search
publication search

ask others

Minsu Kim 0001

> Home > Persons

Person information

affiliation: Korea Advanced Institute of Science and Technology (KAIST), School of Electrical Engineering, Integrated Vision and Language Laboratory, Daejeon, South Korea
affiliation: Yonsei University, Seoul, School of Electrical and Electronic Engineering, South Korea

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KimCKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KimCKR24
Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro:
Textless Unit-to-Unit Training for Many-to-Many Multilingual Speech-to-Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3934-3946 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/YeoKCKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/YeoKCKR24
Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. IEEE Trans. Multim. 26: 6462-6474 (2024)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ParkKRKHYR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ParkKRKHYR24
Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeong Hun Yeo, Yong Man Ro:
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation. ACL (1) 2024: 16334-16348
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChoiPKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChoiPKR24
Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro:
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. CVPR 2024: 27315-27327
[c23]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/YeoH0R24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YeoH0R24
Jeong Hun Yeo, Seunghee Han, Minsu Kim, Yong Man Ro:
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing. EMNLP (Findings) 2024: 11391-11406
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ParkKCR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ParkKCR24
Se Jin Park, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Exploring Phonetic Context-Aware Lip-Sync for Talking Face Generation. ICASSP 2024: 4325-4329
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimCMY0R24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimCMY0R24
Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro:
Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-Training and Multi-Modal Tokens. ICASSP 2024: 7970-7974
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoiKPR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoiKPR24
Jeongsoo Choi, Minsu Kim, Se Jin Park, Yong Man Ro:
Text-Driven Talking Face Synthesis by Reprogramming Audio-Driven Models. ICASSP 2024: 8065-8069
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YeoK0R24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YeoK0R24
Jeong Hun Yeo, Minsu Kim, Shinji Watanabe, Yong Man Ro:
Visual Speech Recognition for Languages with Limited Labeled Data Using Automatic Labels from Whisper. ICASSP 2024: 10471-10475
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KimYPRR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KimYPRR24
Minsu Kim, Jeong Hun Yeo, Se Jin Park, Hyeongseop Rha, Yong Man Ro:
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation. ACM Multimedia 2024: 1311-1320
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-09802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-09802
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Se Jin Park, Yong Man Ro:
Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units. CoRR abs/2401.09802 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-15151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-15151
Jeong Hun Yeo, Seunghee Han, Minsu Kim, Yong Man Ro:
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing. CoRR abs/2402.15151 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16021
Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro:
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages. CoRR abs/2402.16021 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07867
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07867
Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeong Hun Yeo, Yong Man Ro:
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation. CoRR abs/2406.07867 (2024)
2023
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KimKR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KimKR23
Minsu Kim, Chae Won Kim, Yong Man Ro:
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video. AAAI 2023: 8273-8281
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HongKCR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HongKCR23
Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CVPR 2023: 18783-18794
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimHR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimHR23
Minsu Kim, Joanna Hong, Yong Man Ro:
Lip-to-Speech Synthesis in the Wild with Multi-Task Learning. ICASSP 2023: 1-5
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YeoKR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YeoKR23
Jeong Hun Yeo, Minsu Kim, Yong Man Ro:
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition. ICASSP 2023: 1-5
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/KimYCR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/KimYCR23
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Yong Man Ro:
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge. ICCV 2023: 15313-15325
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiKR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiKR23
Jeongsoo Choi, Minsu Kim, Yong Man Ro:
Intelligible Lip-to-Speech Synthesis with Speech Units. INTERSPEECH 2023: 4349-4353
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08102
Minsu Kim, Hyung-Il Kim, Yong Man Ro:
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition. CoRR abs/2302.08102 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08841
Minsu Kim, Joanna Hong, Yong Man Ro:
Lip-to-Speech Synthesis in the Wild with Multi-task Learning. CoRR abs/2302.08841 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08536
Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CoRR abs/2303.08536 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08670
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08670
Minsu Kim, Chae Won Kim, Yong Man Ro:
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video. CoRR abs/2303.08670 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04542
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04542
Jeong Hun Yeo, Minsu Kim, Yong Man Ro:
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition. CoRR abs/2305.04542 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19556
Se Jin Park, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation. CoRR abs/2305.19556 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19603
Jeongsoo Choi, Minsu Kim, Yong Man Ro:
Intelligible Lip-to-Speech Synthesis with Speech Units. CoRR abs/2305.19603 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-16003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-16003
Jeongsoo Choi, Minsu Kim, Se Jin Park, Yong Man Ro:
Reprogramming Audio-driven Talking Face Synthesis into Text-driven. CoRR abs/2306.16003 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-01831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-01831
Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro:
Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit Translation. CoRR abs/2308.01831 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07593
Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. CoRR abs/2308.07593 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09311
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Yong Man Ro:
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge. CoRR abs/2308.09311 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08531
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08531
Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro:
Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens. CoRR abs/2309.08531 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08535
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08535
Jeong Hun Yeo, Minsu Kim, Shinji Watanabe, Yong Man Ro:
Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model. CoRR abs/2309.08535 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05934
Se Jin Park, Joanna Hong, Minsu Kim, Yong Man Ro:
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion. CoRR abs/2310.05934 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02512
Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro:
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. CoRR abs/2312.02512 (2023)
2022
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/KimHPR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/KimHPR22
Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition. IEEE Trans. Multim. 24: 4342-4355 (2022)
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KimYR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KimYR22
Minsu Kim, Jeong Hun Yeo, Yong Man Ro:
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading. AAAI 2022: 1174-1182
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ParkKHCR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ParkKHCR22
Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. AAAI 2022: 2062-2070
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/HongKR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/HongKR22
Joanna Hong, Minsu Kim, Yong Man Ro:
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection. ECCV (36) 2022: 452-468
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/KimKR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/KimKR22
Minsu Kim, Hyunjun Kim, Yong Man Ro:
Speaker-Adaptive Lip Reading with User-Dependent Padding. ECCV (36) 2022: 576-593
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HongKYR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HongKYR22
Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro:
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. INTERSPEECH 2022: 2838-2842
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01265
Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. CoRR abs/2204.01265 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01725
Minsu Kim, Jeong Hun Yeo, Yong Man Ro:
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading. CoRR abs/2204.01725 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01726
Minsu Kim, Joanna Hong, Yong Man Ro:
Lip to Speech Synthesis with Visual Context Attentional GAN. CoRR abs/2204.01726 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07458
Joanna Hong, Minsu Kim, Yong Man Ro:
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection. CoRR abs/2206.07458 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-06020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-06020
Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro:
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. CoRR abs/2207.06020 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-04498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-04498
Minsu Kim, Hyunjun Kim, Yong Man Ro:
Speaker-adaptive Lip Reading with User-dependent Padding. CoRR abs/2208.04498 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13186
Minsu Kim, Youngjoon Yu, Sungjune Park, Yong Man Ro:
Meta Input: How to Leverage Off-the-Shelf Deep Neural Networks. CoRR abs/2210.13186 (2022)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00924
Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. CoRR abs/2211.00924 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HongKPR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HongKPR21
Joanna Hong, Minsu Kim, Se Jin Park, Yong Man Ro:
Speech Reconstruction With Reminiscent Sound Via Visual Voice Memory. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3654-3667 (2021)
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/KimHPR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/KimHPR21
Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. ICCV 2021: 296-306
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/KimKR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/KimKR21
Junho Kim, Minsu Kim, Yong Man Ro:
Interpretation of Lesional Detection via Counterfactual Generation. ICIP 2021: 96-100
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KimHR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KimHR21
Minsu Kim, Joanna Hong, Yong Man Ro:
Lip to Speech Synthesis with Visual Context Attentional GAN. NeurIPS 2021: 2758-2770
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/KimLLR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/KimLLR20
Minsu Kim, Hong Joo Lee, Sangmin Lee, Yong Man Ro:
Robust Video Facial Authentication With Unsupervised Mode Disentanglement. ICIP 2020: 1321-1325
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/KimKKLLHR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/KimKKLLHR20
Junho Kim, Minsu Kim, Jung Uk Kim, Hong Joo Lee, Sangmin Lee, Joanna Hong, Yong Man Ro:
Learning Style Correlation for Elaborate Few-Shot Classification. ICIP 2020: 1791-1795
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/KimHKLR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/KimHKLR20
Minsu Kim, Joanna Hong, Junho Kim, Hong Joo Lee, Yong Man Ro:
Unsupervised Disentangling of Viewpoint and Residues Variations by Substituting Representations for Robust Face Recognition. ICPR 2020: 8952-8959

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.