default search action
Tao Jin 0004
Person information
- affiliation: Zhejiang University, Hangzhou, China
Other persons with the same name
- Tao Jin — disambiguation page
- Tao Jin 0001 — Tsinghua University, Beijing, China
- Tao Jin 0002 — University of Virginia, VA, USA
- Tao Jin 0003 — University of Pittsburgh, Pittsburgh, PA, USA
- Tao Jin 0005 — China University of Petroleum, Qingdao, China
- Tao Jin 0006 — Fuzhou University, Fuzhou, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j5]Min Tan, Ruirui Wang, Ankur Purwar, Tao Jin, Jun Yu, Alex C. Kot:
GTADT: Gated tone-sensitive acne grading via augmented domain transfer. Multim. Tools Appl. 83(8): 24875-24897 (2024) - [j4]Linjun Li, Tao Jin, Wang Lin, Hao Jiang, Wenwen Pan, Jian Wang, Shuwen Xiao, Yan Xia, Weihao Jiang, Zhou Zhao:
Multi-Granularity Relational Attention Network for Audio-Visual Question Answering. IEEE Trans. Circuits Syst. Video Technol. 34(8): 7080-7094 (2024) - 2023
- [j3]Min Tan, Tao Jin, Danhui Ye, Kuiwen Xu, Xiaoling Gu, Jun Yu:
Electromagnetic Imaging Boosted Visual Object Recognition Under Difficult Visual Conditions. IEEE Trans. Geosci. Remote. Sens. 61: 1-12 (2023) - 2022
- [j2]Tao Jin, Zhou Zhao, Peng Wang, Jun Yu, Fei Wu:
Interaction augmented transformer with decoupled decoding for video captioning. Neurocomputing 492: 496-507 (2022) - 2019
- [j1]Tao Jin, Yingming Li, Zhongfei Zhang:
Recurrent convolutional video captioning with global and local attention. Neurocomputing 370: 118-127 (2019)
Conference and Workshop Papers
- 2024
- [c23]Zirun Guo, Tao Jin, Zhou Zhao:
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition. ACL (1) 2024: 1726-1736 - [c22]Tao Jin, Wang Lin, Ye Wang, Linjun Li, Xize Cheng, Zhou Zhao:
Rethinking the Multimodal Correlation of Multimodal Sequential Learning via Generalizable Attentional Results Alignment. ACL (1) 2024: 5247-5265 - [c21]Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. ACL (Findings) 2024: 9973-9986 - [c20]Songju Lei, Xize Cheng, Mengjiao Lyu, Jianqiao Hu, Jintao Tan, Runlin Liu, Lingyu Xiong, Tao Jin, Xiandong Li, Zhou Zhao:
Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation. ACL (1) 2024: 10082-10099 - [c19]Jimin Xu, Tianbao Wang, Tao Jin, Shengyu Zhang, Dongjie Fu, Zhe Wang, Jiangjing Lyu, Chengfei Lv, Chaoyue Niu, Zhou Yu, Zhou Zhao, Fei Wu:
MPOD123: One Image to 3D Content Generation Using Mask-Enhanced Progressive Outline-to-Detail Optimization. CVPR 2024: 10682-10692 - [c18]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. ICML 2024 - [c17]Wang Lin, Jingyuan Chen, Jiaxin Shi, Yichen Zhu, Chen Liang, Junzhong Miao, Tao Jin, Zhou Zhao, Fei Wu, Shuicheng Yan, Hanwang Zhang:
Non-confusing Generation of Customized Concepts in Diffusion Models. ICML 2024 - [c16]Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. KDD 2024: 3245-3254 - 2023
- [c15]Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao:
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment. ACL (1) 2023: 6592-6607 - [c14]Ye Wang, Tao Jin, Wang Lin, Xize Cheng, Linjun Li, Zhou Zhao:
Semantic-conditioned Dual Adaptation for Cross-domain Query-based Visual Segmentation. ACL (Findings) 2023: 9797-9815 - [c13]Ye Wang, Wang Lin, Shengyu Zhang, Tao Jin, Linjun Li, Xize Cheng, Zhou Zhao:
Weakly-Supervised Spoken Video Grounding via Semantic Interaction Learning. ACL (1) 2023: 10914-10932 - [c12]Linjun Li, Tao Jin, Xize Cheng, Ye Wang, Wang Lin, Rongjie Huang, Zhou Zhao:
Contrastive Token-Wise Meta-Learning for Unseen Performer Visual Temporal-Aligned Translation. ACL (Findings) 2023: 10993-11007 - [c11]Wang Lin, Tao Jin, Wenwen Pan, Linjun Li, Xize Cheng, Ye Wang, Zhou Zhao:
TAVT: Towards Transferable Audio-Visual Text Generation. ACL (1) 2023: 14983-14999 - [c10]Wang Lin, Tao Jin, Ye Wang, Wenwen Pan, Linjun Li, Xize Cheng, Zhou Zhao:
Exploring Group Video Captioning with Efficient Relational Approximation. ICCV 2023: 15235-15244 - [c9]Xize Cheng, Tao Jin, Rongjie Huang, Linjun Li, Wang Lin, Zehan Wang, Ye Wang, Huadai Liu, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. ICCV 2023: 15689-15699 - [c8]Tao Jin, Xize Cheng, Linjun Li, Wang Lin, Ye Wang, Zhou Zhao:
Rethinking Missing Modality Learning from a Decoding Perspective. ACM Multimedia 2023: 4431-4439 - 2022
- [c7]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
Prior Knowledge and Memory Enriched Transformer for Sign Language Translation. ACL (Findings) 2022: 3766-3775 - [c6]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
MC-SLT: Towards Low-Resource Signer-Adaptive Sign Language Translation. ACM Multimedia 2022: 4939-4947 - 2021
- [c5]Tao Jin, Zhou Zhao:
Contrastive Disentangled Meta-Learning for Signer-Independent Sign Language Translation. ACM Multimedia 2021: 5065-5073 - [c4]Tao Jin, Zhou Zhao:
Generalizable Multi-linear Attention Network. NeurIPS 2021: 9049-9060 - 2020
- [c3]Tao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang:
Dual Low-Rank Multimodal Fusion. EMNLP (Findings) 2020: 377-387 - [c2]Tao Jin, Siyu Huang, Ming Chen, Yingming Li, Zhongfei Zhang:
SBAT: Video Captioning with Sparse Boundary-Aware Transformer. IJCAI 2020: 630-636 - 2019
- [c1]Tao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang:
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning. EMNLP/IJCNLP (1) 2019: 2001-2011
Informal and Other Publications
- 2024
- [i11]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. CoRR abs/2405.04883 (2024) - [i10]Wang Lin, Jingyuan Chen, Jiaxin Shi, Yichen Zhu, Chen Liang, Junzhong Miao, Tao Jin, Zhou Zhao, Fei Wu, Shuicheng Yan, Hanwang Zhang:
Non-confusing Generation of Customized Concepts in Diffusion Models. CoRR abs/2405.06914 (2024) - [i9]Ye Wang, Jiahao Xun, Mingjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. CoRR abs/2406.14017 (2024) - [i8]Zirun Guo, Tao Jin, Zhou Zhao:
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition. CoRR abs/2407.05374 (2024) - 2023
- [i7]Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. CoRR abs/2303.05309 (2023) - [i6]Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao:
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment. CoRR abs/2306.06410 (2023) - [i5]Zehan Wang, Ziang Zhang, Luping Liu, Yang Zhao, Haifeng Huang, Tao Jin, Zhou Zhao:
Extending Multi-modal Contrastive Representations. CoRR abs/2310.08884 (2023) - [i4]Haifeng Huang, Zehan Wang, Rongjie Huang, Luping Liu, Xize Cheng, Yang Zhao, Tao Jin, Zhou Zhao:
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers. CoRR abs/2312.08168 (2023) - [i3]Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, Changpeng Yang, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. CoRR abs/2312.15197 (2023) - 2020
- [i2]Tao Jin, Siyu Huang, Ming Chen, Yingming Li, Zhongfei Zhang:
SBAT: Video Captioning with Sparse Boundary-Aware Transformer. CoRR abs/2007.11888 (2020) - 2019
- [i1]Tao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang:
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning. CoRR abs/1911.00212 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint