default search action
Linjie Li
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j6]Zhenyu Wu, Juchuan Guo, Yichen Liu, Linjie Li, Yang Ji:
An Iterative Resampling Deep Decoupling Domain Adaptation method for class-imbalance bearing fault diagnosis under variant working conditions. Expert Syst. Appl. 252: 124240 (2024) - [j5]Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang, Linjie Li, Lijuan Wang, Jianfeng Gao:
Multimodal Foundation Models: From Specialists to General-Purpose Assistants. Found. Trends Comput. Graph. Vis. 16(1-2): 1-214 (2024) - [c46]Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal:
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation. CVPR Workshops 2024: 5280-5289 - [c45]Tan Wang, Linjie Li, Kevin Lin, Yuanhao Zhai, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
Disco: Disentangled Control for Realistic Human Dance Generation. CVPR 2024: 9326-9336 - [c44]Chaoyi Zhang, Kevin Lin, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning. CVPR 2024: 13647-13657 - [c43]Jielin Qiu, Jiacheng Zhu, William Han, Aditesh Kumar, Karthik Mittal, Claire Jin, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Ding Zhao, Bo Li, Lijuan Wang:
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos. CVPR 2024: 21909-21921 - [c42]Yuanhao Zhai, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David S. Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang:
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation. ECCV (15) 2024: 134-152 - [c41]Zhengyuan Yang, Jianfeng Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
Idea2Img: Iterative Self-refinement with GPT-4V for Automatic Image Design and Generation. ECCV (38) 2024: 167-184 - [c40]Linjie Li, Haitao Chang, Yongjia Xu, Shuo Zhao, Panfeng Huang, Zhengxiong Liu:
Enhancing Human-to-Robot Skill Transfer: A Framework Integrating Movement and Variable Impedance Based on EMG. ICIT 2024: 1-6 - [c39]Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang:
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning. ICLR 2024 - [c38]Peter West, Ximing Lu, Nouha Dziri, Faeze Brahman, Linjie Li, Jena D. Hwang, Liwei Jiang, Jillian Fisher, Abhilasha Ravichander, Khyathi Raghavi Chandu, Benjamin Newman, Pang Wei Koh, Allyson Ettinger, Yejin Choi:
The Generative AI Paradox: "What It Can Create, It May Not Understand". ICLR 2024 - [c37]Weihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Xinchao Wang, Lijuan Wang:
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities. ICML 2024 - [c36]Jie An, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Zicheng Liu, Lijuan Wang, Jiebo Luo:
Bring Metric Functions into Diffusion Models. IJCAI 2024: 578-586 - [c35]Jie An, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo:
OpenLEAF: A Novel Benchmark for Open-Domain Interleaved Image-Text Generation. ACM Multimedia 2024: 11137-11145 - [i67]Alex Jinpeng Wang, Linjie Li, Kevin Qinghong Lin, Jianfeng Wang, Kevin Lin, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou:
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training. CoRR abs/2401.00849 (2024) - [i66]Jie An, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Zicheng Liu, Lijuan Wang, Jiebo Luo:
Bring Metric Functions into Diffusion Models. CoRR abs/2401.02414 (2024) - [i65]Linjie Li, Six Liu, Zhenyu Wu, Ji Yang:
TaE: Task-aware Expandable Representation for Long Tail Class Incremental Learning. CoRR abs/2402.05797 (2024) - [i64]Jielin Qiu, William Han, Winfred Wang, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Christos Faloutsos, Lei Li, Lijuan Wang:
Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition. CoRR abs/2403.12339 (2024) - [i63]An Yan, Zhengyuan Yang, Junda Wu, Wanrong Zhu, Jianwei Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Julian J. McAuley, Jianfeng Gao, Lijuan Wang:
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs. CoRR abs/2404.16375 (2024) - [i62]Alex Jinpeng Wang, Linjie Li, Yiqi Lin, Min Li, Lijuan Wang, Mike Zheng Shou:
Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning. CoRR abs/2406.02547 (2024) - [i61]Yuanhao Zhai, Kevin Lin, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Chung-Ching Lin, David S. Doermann, Junsong Yuan, Lijuan Wang:
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation. CoRR abs/2406.06890 (2024) - [i60]Xuehai He, Weixi Feng, Kaizhi Zheng, Yujie Lu, Wanrong Zhu, Jiachen Li, Yue Fan, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Kevin Lin, William Yang Wang, Lijuan Wang, Xin Eric Wang:
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos. CoRR abs/2406.08407 (2024) - [i59]Kevin Qinghong Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou:
VideoGUI: A Benchmark for GUI Automation from Instructional Videos. CoRR abs/2406.10227 (2024) - [i58]Khyathi Raghavi Chandu, Linjie Li, Anas Awadalla, Ximing Lu, Jae Sung Park, Jack Hessel, Lijuan Wang, Yejin Choi:
Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness. CoRR abs/2407.01942 (2024) - [i57]Yuanhao Zhai, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David S. Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang:
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation. CoRR abs/2407.10937 (2024) - [i56]Weihao Yu, Zhengyuan Yang, Linfeng Ren, Linjie Li, Jianfeng Wang, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang, Xinchao Wang:
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities. CoRR abs/2408.00765 (2024) - [i55]Peng Xia, Siwei Han, Shi Qiu, Yiyang Zhou, Zhaoyang Wang, Wenhao Zheng, Zhaorun Chen, Chenhang Cui, Mingyu Ding, Linjie Li, Lijuan Wang, Huaxiu Yao:
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models. CoRR abs/2410.10139 (2024) - [i54]Kaizhi Zheng, Xiaotong Chen, Xuehai He, Jing Gu, Linjie Li, Zhengyuan Yang, Kevin Lin, Jianfeng Wang, Lijuan Wang, Xin Eric Wang:
EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing. CoRR abs/2410.12836 (2024) - [i53]Yuyang Zhao, Chung-Ching Lin, Kevin Lin, Zhiwen Yan, Linjie Li, Zhengyuan Yang, Jianfeng Wang, Gim Hee Lee, Lijuan Wang:
GenXD: Generating Any 3D and 4D Scenes. CoRR abs/2411.02319 (2024) - [i52]Qin Liu, Jianfeng Wang, Zhengyuan Yang, Linjie Li, Kevin Lin, Marc Niethammer, Lijuan Wang:
LiVOS: Light Video Object Segmentation with Gated Linear Matching. CoRR abs/2411.02818 (2024) - [i51]Kevin Qinghong Lin, Linjie Li, Difei Gao, Zhengyuan Yang, Shiwei Wu, Zechen Bai, Weixian Lei, Lijuan Wang, Mike Zheng Shou:
ShowUI: One Vision-Language-Action Model for GUI Visual Agent. CoRR abs/2411.17465 (2024) - [i50]Xiyao Wang, Zhengyuan Yang, Linjie Li, Hongjin Lu, Yuancheng Xu, Chung-Ching Lin, Kevin Lin, Furong Huang, Lijuan Wang:
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension. CoRR abs/2412.03704 (2024) - 2023
- [c34]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Ming Gong, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. ACL (1) 2023: 1309-1320 - [c33]Chung-Ching Lin, Jiang Wang, Kun Luo, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu:
Adaptive Human Matting for Dynamic Videos. CVPR 2023: 10229-10238 - [c32]Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang:
ReCo: Region-Controlled Text-to-Image Generation. CVPR 2023: 14246-14255 - [c31]Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao:
Generalized Decoding for Pixel, Image, and Language. CVPR 2023: 15116-15127 - [c30]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling. CVPR 2023: 22898-22909 - [c29]Linjie Li, Zhe Gan, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Ce Liu, Lijuan Wang:
LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling. CVPR 2023: 23119-23129 - [c28]Yi-Lin Sung, Linjie Li, Kevin Lin, Zhe Gan, Mohit Bansal, Lijuan Wang:
An Empirical Study of Multimodal Model Merging. EMNLP (Findings) 2023: 1563-1575 - [c27]Tan Wang, Kevin Lin, Linjie Li, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
Equivariant Similarity for Vision-Language Foundation Models. ICCV 2023: 11964-11974 - [c26]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. IJCAI 2023: 1506-1514 - [c25]Xueyan Zou, Jianwei Yang, Hao Zhang, Feng Li, Linjie Li, Jianfeng Wang, Lijuan Wang, Jianfeng Gao, Yong Jae Lee:
Segment Everything Everywhere All at Once. NeurIPS 2023 - [i49]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. CoRR abs/2302.10781 (2023) - [i48]Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang:
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action. CoRR abs/2303.11381 (2023) - [i47]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Gong Ming, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. CoRR abs/2303.12346 (2023) - [i46]Tan Wang, Kevin Lin, Linjie Li, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
Equivariant Similarity for Vision-Language Foundation Models. CoRR abs/2303.14465 (2023) - [i45]Chung-Ching Lin, Jiang Wang, Kun Luo, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu:
Adaptive Human Matting for Dynamic Videos. CoRR abs/2304.06018 (2023) - [i44]Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal:
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation. CoRR abs/2304.06671 (2023) - [i43]Xueyan Zou, Jianwei Yang, Hao Zhang, Feng Li, Linjie Li, Jianfeng Gao, Yong Jae Lee:
Segment Everything Everywhere All at Once. CoRR abs/2304.06718 (2023) - [i42]Yi-Lin Sung, Linjie Li, Kevin Lin, Zhe Gan, Mohit Bansal, Lijuan Wang:
An Empirical Study of Multimodal Model Merging. CoRR abs/2304.14933 (2023) - [i41]Jielin Qiu, Jiacheng Zhu, William Han, Aditesh Kumar, Karthik Mittal, Claire Jin, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Bo Li, Ding Zhao, Lijuan Wang:
MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos. CoRR abs/2306.04216 (2023) - [i40]Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang:
Aligning Large Multi-Modal Model with Robust Instruction Tuning. CoRR abs/2306.14565 (2023) - [i39]Tan Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
DisCo: Disentangled Control for Referring Human Dance Generation in Real World. CoRR abs/2307.00040 (2023) - [i38]Xin Yuan, Linjie Li, Jianfeng Wang, Zhengyuan Yang, Kevin Lin, Zicheng Liu, Lijuan Wang:
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models. CoRR abs/2307.14648 (2023) - [i37]Weihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Xinchao Wang, Lijuan Wang:
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities. CoRR abs/2308.02490 (2023) - [i36]Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang, Linjie Li, Lijuan Wang, Jianfeng Gao:
Multimodal Foundation Models: From Specialists to General-Purpose Assistants. CoRR abs/2309.10020 (2023) - [i35]Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision). CoRR abs/2309.17421 (2023) - [i34]Jie An, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo:
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation. CoRR abs/2310.07749 (2023) - [i33]Zhengyuan Yang, Jianfeng Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation. CoRR abs/2310.08541 (2023) - [i32]Kevin Lin, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Lijuan Wang:
DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design. CoRR abs/2310.15144 (2023) - [i31]Kevin Lin, Faisal Ahmed, Linjie Li, Chung-Ching Lin, Ehsan Azarnasab, Zhengyuan Yang, Jianfeng Wang, Lin Liang, Zicheng Liu, Yumao Lu, Ce Liu, Lijuan Wang:
MM-VID: Advancing Video Understanding with GPT-4V(ision). CoRR abs/2310.19773 (2023) - [i30]Peter West, Ximing Lu, Nouha Dziri, Faeze Brahman, Linjie Li, Jena D. Hwang, Liwei Jiang, Jillian Fisher, Abhilasha Ravichander, Khyathi Raghavi Chandu, Benjamin Newman, Pang Wei Koh, Allyson Ettinger, Yejin Choi:
The Generative AI Paradox: "What It Can Create, It May Not Understand". CoRR abs/2311.00059 (2023) - [i29]An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian J. McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang:
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation. CoRR abs/2311.07562 (2023) - [i28]Chaoyi Zhang, Kevin Lin, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning. CoRR abs/2311.17435 (2023) - [i27]Xueyan Zou, Linjie Li, Jianfeng Wang, Jianwei Yang, Mingyu Ding, Zhengyuan Yang, Feng Li, Hao Zhang, Shilong Liu, Arul Aravinthan, Yong Jae Lee, Lijuan Wang:
Interfacing Foundation Models' Embeddings. CoRR abs/2312.07532 (2023) - 2022
- [j4]Zhe Gan, Linjie Li, Chunyuan Li, Lijuan Wang, Zicheng Liu, Jianfeng Gao:
Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends. Found. Trends Comput. Graph. Vis. 14(3-4): 163-352 (2022) - [j3]Ning Zhang, Lingran Zhang, Linjie Li, Junyou Geng, Lei Zhao, Yan Ren, Zhongdong Dong, Feng Chen:
Global Profiling of 2-hydroxyisobutyrylome in Common Wheat. Genom. Proteom. Bioinform. 20(4): 688-701 (2022) - [j2]Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang:
GIT: A Generative Image-to-text Transformer for Vision and Language. Trans. Mach. Learn. Res. 2022 (2022) - [c24]Zhe Gan, Yen-Chun Chen, Linjie Li, Tianlong Chen, Yu Cheng, Shuohang Wang, Jingjing Liu, Lijuan Wang, Zicheng Liu:
Playing Lottery Tickets with Vision and Language. AAAI 2022: 652-660 - [c23]Linjie Li, Zhenyu Wu, Jiaming Liu, Yang Ji:
TaE: Task-Aware Expandable Representation for Long Tail Class Incremental Learning. ACCV (4) 2022: 335-351 - [c22]Linjie Li, Yi Xiao, Dewei Ma, Kai Zheng:
PREVAIL: Pre-trained Variational Adversarial Active Learning for Molecular Property Prediction. CCIS 2022: 143-149 - [c21]Kevin Lin, Linjie Li, Chung-Ching Lin, Faisal Ahmed, Zhe Gan, Zicheng Liu, Yumao Lu, Lijuan Wang:
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning. CVPR 2022: 17928-17937 - [c20]Chung-Ching Lin, Kevin Lin, Lijuan Wang, Zicheng Liu, Linjie Li:
Crossmodal Representation Learning for Zero-shot Action Recognition. CVPR 2022: 19946-19956 - [c19]Xinyu Men, Yubo Li, Yihuang Zeng, Linjie Li:
Multiple Z-Complementary Code Sets With Low Inter-Set Cross-Correlation. IWSDA 2022: 1-5 - [c18]Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang:
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone. NeurIPS 2022 - [i26]Chung-Ching Lin, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu:
Cross-modal Representation Learning for Zero-shot Action Recognition. CoRR abs/2205.01657 (2022) - [i25]Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang:
GIT: A Generative Image-to-text Transformer for Vision and Language. CoRR abs/2205.14100 (2022) - [i24]Linjie Li, Zhe Gan, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Ce Liu, Lijuan Wang:
LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling. CoRR abs/2206.07160 (2022) - [i23]Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang:
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone. CoRR abs/2206.07643 (2022) - [i22]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling. CoRR abs/2209.01540 (2022) - [i21]Zhe Gan, Linjie Li, Chunyuan Li, Lijuan Wang, Zicheng Liu, Jianfeng Gao:
Vision-Language Pre-training: Basics, Recent Advances, and Future Trends. CoRR abs/2210.09263 (2022) - [i20]Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang:
ReCo: Region-Controlled Text-to-Image Generation. CoRR abs/2211.15518 (2022) - [i19]Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao:
Generalized Decoding for Pixel, Image, and Language. CoRR abs/2212.11270 (2022) - 2021
- [c17]Mingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng, Linjie Li, Zhou Yu, Jingjing Liu:
UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training. CVPR 2021: 4155-4165 - [c16]Jie Lei, Linjie Li, Luowei Zhou, Zhe Gan, Tamara L. Berg, Mohit Bansal, Jingjing Liu:
Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling. CVPR 2021: 7331-7341 - [c15]Linjie Li, Jie Lei, Zhe Gan, Jingjing Liu:
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models. ICCV 2021: 2022-2031 - [c14]Siqi Sun, Yen-Chun Chen, Linjie Li, Shuohang Wang, Yuwei Fang, Jingjing Liu:
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval. NAACL-HLT 2021: 982-997 - [c13]Linjie Li, Jie Lei, Zhe Gan, Licheng Yu, Yen-Chun Chen, Rohit Pillai, Yu Cheng, Luowei Zhou, Xin Wang, William Yang Wang, Tamara L. Berg, Mohit Bansal, Jingjing Liu, Lijuan Wang, Zicheng Liu:
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation. NeurIPS Datasets and Benchmarks 2021 - [c12]Wenhu Chen, Zhe Gan, Linjie Li, Yu Cheng, William Yang Wang, Jingjing Liu:
Meta Module Network for Compositional Visual Reasoning. WACV 2021: 655-664 - [i18]Jie Lei, Linjie Li, Luowei Zhou, Zhe Gan, Tamara L. Berg, Mohit Bansal, Jingjing Liu:
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling. CoRR abs/2102.06183 (2021) - [i17]Siqi Sun, Yen-Chun Chen, Linjie Li, Shuohang Wang, Yuwei Fang, Jingjing Liu:
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval. CoRR abs/2103.08784 (2021) - [i16]Mingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng, Linjie Li, Zhou Yu, Jingjing Liu:
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training. CoRR abs/2104.00332 (2021) - [i15]Zhe Gan, Yen-Chun Chen, Linjie Li, Tianlong Chen, Yu Cheng, Shuohang Wang, Jingjing Liu:
Playing Lottery Tickets with Vision and Language. CoRR abs/2104.11832 (2021) - [i14]Linjie Li, Jie Lei, Zhe Gan, Jingjing Liu:
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models. CoRR abs/2106.00245 (2021) - [i13]Linjie Li, Jie Lei, Zhe Gan, Licheng Yu, Yen-Chun Chen, Rohit Pillai, Yu Cheng, Luowei Zhou, Xin Eric Wang, William Yang Wang, Tamara Lee Berg, Mohit Bansal, Jingjing Liu, Lijuan Wang, Zicheng Liu:
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation. CoRR abs/2106.04632 (2021) - [i12]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling. CoRR abs/2111.12681 (2021) - [i11]Kevin Lin, Linjie Li, Chung-Ching Lin, Faisal Ahmed, Zhe Gan, Zicheng Liu, Yumao Lu, Lijuan Wang:
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning. CoRR abs/2111.13196 (2021) - [i10]Yixin Nie, Linjie Li, Zhe Gan, Shuohang Wang, Chenguang Zhu, Michael Zeng, Zicheng Liu, Mohit Bansal, Lijuan Wang:
MLP Architectures for Vision-and-Language Modeling: An Empirical Study. CoRR abs/2112.04453 (2021) - 2020
- [j1]Linjie Li, Mian Zhang, Kesheng Wang:
A Fault Diagnostic Scheme Based on Capsule Network for Rolling Bearing under Different Rotational Speeds. Sensors 20(7): 1841 (2020) - [c11]Shicheng Zheng, Wanguo Li, Yongling Fu, Chang Du, Chen Li, Linjie Li:
Analysis of Vibration Characteristics of Rolling Linear Guides. AIAM 2020: 318-325 - [c10]Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu:
UNITER: UNiversal Image-TExt Representation Learning. ECCV (30) 2020: 104-120 - [c9]Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu:
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training. EMNLP (1) 2020: 2046-2065 - [c8]Liqun Chen, Zhe Gan, Yu Cheng, Linjie Li, Lawrence Carin, Jingjing Liu:
Graph Optimal Transport for Cross-Domain Alignment. ICML 2020: 1542-1553 - [c7]Zhe Gan, Yen-Chun Chen, Linjie Li, Chen Zhu, Yu Cheng, Jingjing Liu:
Large-Scale Adversarial Training for Vision-and-Language Representation Learning. NeurIPS 2020 - [i9]Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu:
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training. CoRR abs/2005.00200 (2020) - [i8]Zhe Gan, Yen-Chun Chen, Linjie Li, Chen Zhu, Yu Cheng, Jingjing Liu:
Large-Scale Adversarial Training for Vision-and-Language Representation Learning. CoRR abs/2006.06195 (2020) - [i7]Liqun Chen, Zhe Gan, Yu Cheng, Linjie Li, Lawrence Carin, Jingjing Liu:
Graph Optimal Transport for Cross-Domain Alignment. CoRR abs/2006.14744 (2020) - [i6]Linjie Li, Zhe Gan, Jingjing Liu:
A Closer Look at the Robustness of Vision-and-Language Pre-trained Models. CoRR abs/2012.08673 (2020)
2010 – 2019
- 2019
- [c6]Zhe Gan, Yu Cheng, Ahmed El Kholy, Linjie Li, Jingjing Liu, Jianfeng Gao:
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog. ACL (1) 2019: 6463-6474 - [c5]Linjie Li, Zhe Gan, Yu Cheng, Jingjing Liu:
Relation-Aware Graph Attention Network for Visual Question Answering. ICCV 2019: 10312-10321 - [c4]Linjie Li, Guangxin Wang, Lili Zhu, Weidong He:
Configuration Design and Simulation of Novel Petal Tooth Nutation Joint Drive for Robot. ICIRA (1) 2019: 652-663 - [i5]Zhe Gan, Yu Cheng, Ahmed El Kholy, Linjie Li, Jingjing Liu, Jianfeng Gao:
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog. CoRR abs/1902.00579 (2019) - [i4]Linjie Li, Zhe Gan, Yu Cheng, Jingjing Liu:
Relation-aware Graph Attention Network for Visual Question Answering. CoRR abs/1903.12314 (2019) - [i3]Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu:
UNITER: Learning UNiversal Image-TExt Representations. CoRR abs/1909.11740 (2019) - [i2]Wenhu Chen, Zhe Gan, Linjie Li, Yu Cheng, William Yang Wang, Jingjing Liu:
Meta Module Network for Compositional Visual Reasoning. CoRR abs/1910.03230 (2019) - 2017
- [c3]Amanda Song, Linjie Li, Chad Atalla, Gary Cottrell:
Learning to See People like People: Predicting Social Perceptions of Faces. CogSci 2017 - [i1]Amanda Song, Linjie Li, Chad Atalla, Garrison W. Cottrell:
Learning to see people like people. CoRR abs/1705.04282 (2017) - 2016
- [c2]Linjie Li, Vicente L. Malave, Amanda Song, Angela J. Yu:
Extracting Human Face Similarity Judgments: Pairs or Triplets? CogSci 2016 - [c1]Amanda Song, Linjie Li, Vicente L. Malave, Gary Cottrell, Angela J. Yu:
Understanding human facial attractiveness from multiple views. CogSci 2016
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:22 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint