![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Joanna Hong
Person information
Refine list
![note](https://dblp.uni-trier.de./img/note-mark.dark.12x12.png)
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c14]Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeong Hun Yeo, Yong Man Ro:
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation. ACL (1) 2024: 16334-16348 - [i12]Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeong Hun Yeo, Yong Man Ro:
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation. CoRR abs/2406.07867 (2024) - 2023
- [c13]Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CVPR 2023: 18783-18794 - [c12]Joanna Hong, Se Jin Park, Yong Man Ro:
Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model. EMNLP (Findings) 2023: 4886-4890 - [c11]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip-to-Speech Synthesis in the Wild with Multi-Task Learning. ICASSP 2023: 1-5 - [c10]Jeongsoo Choi, Joanna Hong, Yong Man Ro:
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding. ICCV 2023: 7778-7787 - [i11]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip-to-Speech Synthesis in the Wild with Multi-task Learning. CoRR abs/2302.08841 (2023) - [i10]Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CoRR abs/2303.08536 (2023) - [i9]Jeongsoo Choi, Joanna Hong, Yong Man Ro:
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding. CoRR abs/2308.07787 (2023) - [i8]Se Jin Park, Joanna Hong, Minsu Kim, Yong Man Ro:
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion. CoRR abs/2310.05934 (2023) - [i7]Joanna Hong, Se Jin Park, Yong Man Ro:
Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model. CoRR abs/2310.14946 (2023) - 2022
- [j2]Minsu Kim
, Joanna Hong
, Se Jin Park
, Yong Man Ro
:
CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition. IEEE Trans. Multim. 24: 4342-4355 (2022) - [c9]Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. AAAI 2022: 2062-2070 - [c8]Joanna Hong
, Minsu Kim
, Yong Man Ro
:
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection. ECCV (36) 2022: 452-468 - [c7]Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro:
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. INTERSPEECH 2022: 2838-2842 - [i6]Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. CoRR abs/2204.01265 (2022) - [i5]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip to Speech Synthesis with Visual Context Attentional GAN. CoRR abs/2204.01726 (2022) - [i4]Joanna Hong, Minsu Kim, Yong Man Ro
:
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection. CoRR abs/2206.07458 (2022) - [i3]Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro
:
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. CoRR abs/2207.06020 (2022) - [i2]Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. CoRR abs/2211.00924 (2022) - 2021
- [j1]Joanna Hong
, Minsu Kim
, Se Jin Park
, Yong Man Ro
:
Speech Reconstruction With Reminiscent Sound Via Visual Voice Memory. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3654-3667 (2021) - [c6]Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro
:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. ICCV 2021: 296-306 - [c5]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip to Speech Synthesis with Visual Context Attentional GAN. NeurIPS 2021: 2758-2770 - 2020
- [c4]Joanna Hong, Jung Uk Kim
, Sangmin Lee
, Yong Man Ro
:
Comprehensive Facial Expression Synthesis Using Human-Interpretable Language. ICIP 2020: 1641-1645 - [c3]Junho Kim, Minsu Kim, Jung Uk Kim
, Hong Joo Lee, Sangmin Lee
, Joanna Hong, Yong Man Ro
:
Learning Style Correlation for Elaborate Few-Shot Classification. ICIP 2020: 1791-1795 - [c2]Minsu Kim, Joanna Hong, Junho Kim, Hong Joo Lee, Yong Man Ro
:
Unsupervised Disentangling of Viewpoint and Residues Variations by Substituting Representations for Robust Face Recognition. ICPR 2020: 8952-8959 - [c1]Joanna Hong, Hong Joo Lee, Yelin Kim, Yong Man Ro
:
Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units. MMM (2) 2020: 100-111 - [i1]Joanna Hong, Jung Uk Kim, Sangmin Lee, Yong Man Ro:
Comprehensive Facial Expression Synthesis using Human-Interpretable Language. CoRR abs/2007.08154 (2020)
Coauthor Index
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:24 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint