default search action
Longteng Guo
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j3]Jing Liu, Sihan Chen, Xingjian He, Longteng Guo, Xinxin Zhu, Weining Wang, Jinhui Tang:
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset. IEEE Trans. Pattern Anal. Mach. Intell. 47(2): 708-724 (2025) - 2024
- [j2]Wenxuan Wang, Xingjian He, Yisi Zhang, Longteng Guo, Jiachen Shen, Jiangyun Li, Jing Liu:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation. IEEE Trans. Multim. 26: 6906-6916 (2024) - [c19]Junyi Chen, Longteng Guo, Jia Sun, Shuai Shao, Zehuan Yuan, Liang Lin, Dongyu Zhang:
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE. AAAI 2024: 1110-1119 - [c18]Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, Jing Liu:
OneDiff: A Generalist Model for Image Difference Captioning. ACCV (3) 2024: 114-130 - [c17]Wenxuan Wang, Tongtian Yue, Yisi Zhang, Longteng Guo, Xingjian He, Xinlong Wang, Jing Liu:
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation. CVPR 2024: 12998-13008 - [c16]Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu:
SC- Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models. CVPR 2024: 13073-13083 - [c15]Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu:
Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering. EMNLP 2024: 1857-1868 - [c14]Shichen Lu, Longteng Guo, Wenxuan Wang, Zijia Zhao, Tongtian Yue, Jing Liu, Si Liu:
Collaborative Training of Tiny-Large Vision Language Models. ACM Multimedia 2024: 4928-4937 - [c13]Mingzhen Sun, Weining Wang, Yanyuan Qiao, Jiahui Sun, Zihan Qin, Longteng Guo, Xinxin Zhu, Jing Liu:
MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation. ACM Multimedia 2024: 10853-10861 - [i24]Dongze Hao, Jian Jia, Longteng Guo, Qunbo Wang, Te Yang, Yan Li, Yanhua Cheng, Bo Wang, Quan Chen, Han Li, Jing Liu:
Knowledge Condensation and Reasoning for Knowledge-based VQA. CoRR abs/2403.10037 (2024) - [i23]Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu:
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models. CoRR abs/2403.13263 (2024) - [i22]Yanyuan Qiao, Zheng Yu, Longteng Guo, Sihan Chen, Zijia Zhao, Mingzhen Sun, Qi Wu, Jing Liu:
VL-Mamba: Exploring State Space Models for Multimodal Learning. CoRR abs/2403.13600 (2024) - [i21]Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu:
Boter: Bootstrapping Knowledge Selection and Question Answering for Knowledge-based VQA. CoRR abs/2404.13947 (2024) - [i20]Zijia Zhao, Haoyu Lu, Yuqi Huo, Yifan Du, Tongtian Yue, Longteng Guo, Bingning Wang, Weipeng Chen, Jing Liu:
Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs. CoRR abs/2406.09367 (2024) - [i19]Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, Jing Liu:
OneDiff: A Generalist Model for Image Difference Captioning. CoRR abs/2407.05645 (2024) - [i18]Mingzhen Sun, Weining Wang, Yanyuan Qiao, Jiahui Sun, Zihan Qin, Longteng Guo, Xinxin Zhu, Jing Liu:
MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation. CoRR abs/2410.01594 (2024) - [i17]Tongtian Yue, Longteng Guo, Jie Cheng, Xuange Gao, Jing Liu:
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs. CoRR abs/2410.10456 (2024) - [i16]Zijia Zhao, Longteng Guo, Tongtian Yue, Erdong Hu, Shuai Shao, Zehuan Yuan, Hua Huang, Jing Liu:
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval. CoRR abs/2410.18715 (2024) - [i15]Tongtian Yue, Shuning Xue, Xuange Gao, Yepeng Tang, Longteng Guo, Jie Jiang, Jing Liu:
EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training. CoRR abs/2410.19779 (2024) - 2023
- [c12]Shichen Lu, Longteng Guo, Xingjian He, Xinxin Zhu, Jing Liu, Si Liu:
CSDNet: Contrastive Similarity Distillation Network for Multi-lingual Image-Text Retrieval. ICIG (3) 2023: 385-395 - [c11]Zikang Liu, Sihan Chen, Longteng Guo, Handong Li, Xingjian He, Jing Liu:
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner. ACM Multimedia 2023: 5120-5131 - [c10]Zijia Zhao, Longteng Guo, Xingjian He, Shuai Shao, Zehuan Yuan, Jing Liu:
MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling. SIGIR 2023: 1528-1538 - [i14]Sihan Chen, Xingjian He, Longteng Guo, Xinxin Zhu, Weining Wang, Jinhui Tang, Jing Liu:
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset. CoRR abs/2304.08345 (2023) - [i13]Zikang Liu, Sihan Chen, Longteng Guo, Handong Li, Xingjian He, Jing Liu:
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner. CoRR abs/2305.11769 (2023) - [i12]Zijia Zhao, Longteng Guo, Tongtian Yue, Sihan Chen, Shuai Shao, Xinxin Zhu, Zehuan Yuan, Jing Liu:
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst. CoRR abs/2305.16103 (2023) - [i11]Junyi Chen, Longteng Guo, Jia Sun, Shuai Shao, Zehuan Yuan, Liang Lin, Dongyu Zhang:
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE. CoRR abs/2308.11971 (2023) - [i10]Wenxuan Wang, Tongtian Yue, Yisi Zhang, Longteng Guo, Xingjian He, Xinlong Wang, Jing Liu:
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation. CoRR abs/2312.08007 (2023) - 2022
- [i9]Zijia Zhao, Longteng Guo, Xingjian He, Shuai Shao, Zehuan Yuan, Jing Liu:
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning. CoRR abs/2210.04183 (2022) - 2021
- [c9]Longteng Guo, Danni Ai, Hong Song, Jian Yang:
Multi-scale Landmark Localization Network for 3D Facial Point Clouds. ICDSP 2021: 86-93 - [c8]Wenzhu Wu, Weining Wang, Longteng Guo, Jing Liu:
Keypoint Context Aggregation for Human Pose Estimation. ICIG (2) 2021: 386-396 - [c7]Sihan Chen, Xinxin Zhu, Dongze Hao, Wei Liu, Jiawei Liu, Zijia Zhao, Longteng Guo, Jing Liu:
MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques. ACM Multimedia 2021: 4853-4857 - [i8]Longteng Guo, Jing Liu, Xinxin Zhu, Hanqing Lu:
Fast Sequence Generation with Multi-Agent Reinforcement Learning. CoRR abs/2101.09698 (2021) - [i7]Wei Liu, Sihan Chen, Longteng Guo, Xinxin Zhu, Jing Liu:
CPTR: Full Transformer Network for Image Captioning. CoRR abs/2101.10804 (2021) - [i6]Jing Liu, Xinxin Zhu, Fei Liu, Longteng Guo, Zijia Zhao, Mingzhen Sun, Weining Wang, Hanqing Lu, Shiyu Zhou, Jiajun Zhang, Jinqiao Wang:
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation. CoRR abs/2107.00249 (2021) - 2020
- [j1]Longteng Guo, Jing Liu, Shichen Lu, Hanqing Lu:
Show, Tell, and Polish: Ruminant Decoding for Image Captioning. IEEE Trans. Multim. 22(8): 2149-2162 (2020) - [c6]Longteng Guo, Jing Liu, Xinxin Zhu, Peng Yao, Shichen Lu, Hanqing Lu:
Normalized and Geometry-Aware Self-Attention Network for Image Captioning. CVPR 2020: 10324-10333 - [c5]Peng Yao, Jiangyun Li, Longteng Guo, Jing Liu:
Modeling Local and Global Contexts for Image Captioning. ICME 2020: 1-6 - [c4]Longteng Guo, Jing Liu, Xinxin Zhu, Xingjian He, Jie Jiang, Hanqing Lu:
Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning. IJCAI 2020: 767-773 - [i5]Longteng Guo, Jing Liu, Xinxin Zhu, Peng Yao, Shichen Lu, Hanqing Lu:
Normalized and Geometry-Aware Self-Attention Network for Image Captioning. CoRR abs/2003.08897 (2020) - [i4]Longteng Guo, Jing Liu, Xinxin Zhu, Xingjian He, Jie Jiang, Hanqing Lu:
Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning. CoRR abs/2005.04690 (2020) - [i3]Xinxin Zhu, Weining Wang, Longteng Guo, Jing Liu:
AutoCaption: Image Captioning with Neural Architecture Search. CoRR abs/2012.09742 (2020)
2010 – 2019
- 2019
- [c3]Longteng Guo, Jing Liu, Peng Yao, Jiangwei Li, Hanqing Lu:
MSCap: Multi-Style Image Captioning With Unpaired Stylized Text. CVPR 2019: 4204-4213 - [c2]Longteng Guo, Jing Liu, Jinhui Tang, Jiangwei Li, Wei Luo, Hanqing Lu:
Aligning Linguistic Words and Visual Semantic Units for Image Captioning. ACM Multimedia 2019: 765-773 - [i2]Longteng Guo, Jing Liu, Jinhui Tang, Jiangwei Li, Wei Luo, Hanqing Lu:
Aligning Linguistic Words and Visual Semantic Units for Image Captioning. CoRR abs/1908.02127 (2019) - [i1]Xinxin Zhu, Longteng Guo, Peng Yao, Jing Liu, Shichen Lu, Zheng Yu, Wei Liu, Hanqing Lu:
Multi-View Features and Hybrid Reward Strategies for Vatex Video Captioning Challenge 2019. CoRR abs/1910.11102 (2019) - 2017
- [c1]Longteng Guo, Jing Liu, Yuhang Wang, Zhonghua Luo, Wei Wen, Hanqing Lu:
Sketch-based Image Retrieval using Generative Adversarial Networks. ACM Multimedia 2017: 1267-1268
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-27 00:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint