default search action
Ji Zhang 0011
Person information
- affiliation: Alibaba Group, DAMO Academy, Hangzhou, China
Other persons with the same name
- Ji Zhang — disambiguation page
- Ji Zhang 0001 — University of Southern Queensland, Faculty of Health Engineering and Sciences, Toowoomba, Australia (and 1 more)
- Ji Zhang 0002 — Auburn University, Department of Computer Science and Software Engineering, AL, USA
- Ji Zhang 0003 — Kaarta, Inc., Pittsburgh, PA, USA (and 1 more)
- Ji Zhang 0004 — Henan University of Science and Technology, School of Mathematics and Statistics, Luoyang, China (and 1 more)
- Ji Zhang 0005 — Xi'an Jiaotong University, Institute of Artificial Intelligence and Robotics, China
- Ji Zhang 0006 — Beijing Institute of Technology, School of Automation, China
- Ji Zhang 0007 — Wuhan University of Science and Technology, School of Computer Science and Technology / Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, China
- Ji Zhang 0008 — Beihang University, School of Instrumentation and Optoelectronic Engineering, Beijing, China
- Ji Zhang 0009 — Soochow University, School of Electronic and Information Engineering, Suzhou, China
- Ji Zhang 0010 — Huazhong University of Science and Technology, HUST, Wuhan National Laboratory for Optoelectronics, China (and 2 more)
- Ji Zhang 0012 — University of Electronic Science and Technology of China, School of Computer Science and Engineering, Chengdu, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j2]Jiabo Ye, Junfeng Tian, Ming Yan, Haiyang Xu, Qinghao Ye, Yaya Shi, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin:
UniQRNet: Unifying Referring Expression Grounding and Segmentation with QRNet. ACM Trans. Multim. Comput. Commun. Appl. 20(8): 246:1-246:28 (2024) - [c65]Chaoya Jiang, Wei Ye, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Shikun Zhang:
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training. AAAI 2024: 2489-2497 - [c64]Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Gao Xing, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang:
SocialBench: Sociality Evaluation of Role-Playing Conversational Agents. ACL (Findings) 2024: 2108-2126 - [c63]Yumeng Liu, Zhenghua Li, Haochen Jiang, Bo Zhang, Chen Li, Ji Zhang:
Towards Better Utilization of Multi-Reference Training Data for Chinese Grammatical Error Correction. ACL (Findings) 2024: 3044-3052 - [c62]Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Budget-Constrained Tool Learning with Planning. ACL (Findings) 2024: 9039-9052 - [c61]An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs. ACL (Findings) 2024: 10960-10977 - [c60]Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion. ACL (1) 2024: 11229-11245 - [c59]Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Model Composition for Multimodal Large Language Models. ACL (1) 2024: 11246-11262 - [c58]Jixiang Hong, Quan Tu, Changyu Chen, Gao Xing, Ji Zhang, Rui Yan:
CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment. ACL (Findings) 2024: 14596-14609 - [c57]Vadim Grigorev, Jiayu Li, Weizhi Ma, Zhiyu He, Min Zhang, Yiqun Liu, Ming Yan, Ji Zhang:
SiTunes: A Situational Music Recommendation Dataset with Physiological and Psychological Signals. CHIIR 2024: 417-421 - [c56]Yuhan Liu, Xiuying Chen, Gao Xing, Ji Zhang, Rui Yan:
IAD: In-Context Learning Ability Decoupler of Large Language Models in Meta-Training. LREC/COLING 2024: 8535-8545 - [c55]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. LREC/COLING 2024: 14664-14675 - [c54]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. LREC/COLING 2024: 17031-17041 - [c53]Qinghao Ye, Haiyang Xu, Jiabo Ye, Ming Yan, Anwen Hu, Haowei Liu, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration. CVPR 2024: 13040-13051 - [c52]Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang:
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model. CVPR 2024: 27026-27036 - [c51]Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang:
TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging. EMNLP 2024: 1882-1898 - [c50]Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou:
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding. EMNLP (Findings) 2024: 3096-3120 - [c49]Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang:
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent. EMNLP 2024: 16658-16680 - [c48]Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang:
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models. EMNLP 2024: 17446-17467 - [c47]Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu:
MIBench: Evaluating Multimodal Large Language Models over Multiple Images. EMNLP 2024: 22417-22428 - [c46]Haoyu Tang, Shuaike Zhang, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Liqiang Nie:
Two-Stage Information Bottleneck For Temporal Language Grounding. ICME 2024: 1-6 - [c45]Jiabo Ye, Junfeng Tian, Xiaoshan Yang, Zhenru Zhang, Anwen Hu, Ming Yan, Ji Zhang, Liang He, Xin Lin:
VG-Annotator: Vision-Language Models as Query Annotators for Unsupervised Visual Grounding. ICME 2024: 1-6 - [c44]Jinqian Chen, Haoyu Tang, Junhao Cheng, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Liqiang Nie:
Breaking Barriers of System Heterogeneity: Straggler-Tolerant Multimodal Federated Learning via Knowledge Distillation. IJCAI 2024: 3789-3797 - [c43]Yuhan Liu, Xiuying Chen, Xiaoqing Zhang, Xing Gao, Ji Zhang, Rui Yan:
From Skepticism to Acceptance: Simulating the Attitude Dynamics Toward Fake News. IJCAI 2024: 7886-7894 - [c42]Chaoya Jiang, Hongrui Jia, Mengfan Dong, Wei Ye, Haiyang Xu, Ming Yan, Ji Zhang, Shikun Zhang:
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models. ACM Multimedia 2024: 525-534 - [c41]Han Jiang, Haoyu Tang, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Jihua Zhu, Liqiang Nie:
Revisiting Unsupervised Temporal Action Localization: The Primacy of High-Quality Actionness and Pseudolabels. ACM Multimedia 2024: 5643-5652 - [c40]Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. ACM Multimedia 2024: 6929-6938 - [c39]Yuhao Dan, Junfeng Tian, Jie Zhou, Ming Yan, Ji Zhang, Qin Chen, Liang He:
Modeling Comparative Logical Relation with Contrastive Learning for Text Generation. NLPCC (4) 2024: 107-119 - [c38]Yuhan Liu, Zelin Cao, Xing Gao, Ji Zhang, Rui Yan:
Bridging the Space Gap: Unifying Geometry Knowledge Graph Embedding with Optimal Transport. WWW 2024: 2128-2137 - [i68]Hongzhan Chen, Xiaojun Quan, Hehong Chen, Ming Yan, Ji Zhang:
Knowledge Distillation for Closed-Source Language Models. CoRR abs/2401.07013 (2024) - [i67]Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang:
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent. CoRR abs/2401.07324 (2024) - [i66]Junyang Wang, Haiyang Xu, Jiabo Ye, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang:
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception. CoRR abs/2401.16158 (2024) - [i65]Zijun Liu, Boqun Kou, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement. CoRR abs/2402.12146 (2024) - [i64]Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion. CoRR abs/2402.12195 (2024) - [i63]Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Model Composition for Multimodal Large Language Models. CoRR abs/2402.12750 (2024) - [i62]An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs. CoRR abs/2402.12835 (2024) - [i61]Chaoya Jiang, Wei Ye, Mengfan Dong, Hongrui Jia, Haiyang Xu, Ming Yan, Ji Zhang, Shikun Zhang:
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models. CoRR abs/2402.15721 (2024) - [i60]Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Budget-Constrained Tool Learning with Planning. CoRR abs/2402.15960 (2024) - [i59]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. CoRR abs/2402.16769 (2024) - [i58]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. CoRR abs/2403.00249 (2024) - [i57]Mieradilijiang Maimaiti, Yuanhang Zheng, Ji Zhang, Fei Huang, Yue Zhang, Wenpei Luo, Kaiyu Huang:
Improving Cross-lingual Representation for Semantic Retrieval with Code-switching. CoRR abs/2403.01364 (2024) - [i56]Yuhan Liu, Xiuying Chen, Xiaoqing Zhang, Xing Gao, Ji Zhang, Rui Yan:
From Skepticism to Acceptance: Simulating the Attitude Dynamics Toward Fake News. CoRR abs/2403.09498 (2024) - [i55]Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Chen Li, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou:
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding. CoRR abs/2403.12895 (2024) - [i54]Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Xing Gao, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang, Jingren Zhou:
RoleInteract: Evaluating the Social Interaction of Role-Playing Agents. CoRR abs/2403.13679 (2024) - [i53]Zonghan Yang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy. CoRR abs/2403.14589 (2024) - [i52]Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang:
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning. CoRR abs/2404.16635 (2024) - [i51]Junyang Wang, Haiyang Xu, Haitao Jia, Xi Zhang, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang:
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration. CoRR abs/2406.01014 (2024) - [i50]Yuhao Dan, Junfeng Tian, Jie Zhou, Ming Yan, Ji Zhang, Qin Chen, Liang He:
Modeling Comparative Logical Relation with Contrastive Learning for Text Generation. CoRR abs/2406.09095 (2024) - [i49]Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu:
MIBench: Evaluating Multimodal Large Language Models over Multiple Images. CoRR abs/2407.15272 (2024) - [i48]Jiabo Ye, Haiyang Xu, Haowei Liu, Anwen Hu, Ming Yan, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou:
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models. CoRR abs/2408.04840 (2024) - [i47]Tianyuan Shi, Fanqi Wan, Canbin Huang, Xiaojun Quan, Chenliang Li, Ming Yan, Ji Zhang:
ProFuser: Progressive Fusion of Large Language Models. CoRR abs/2408.04998 (2024) - [i46]Chaoya Jiang, Hongrui Jia, Haiyang Xu, Wei Ye, Mengfan Dong, Ming Yan, Ji Zhang, Fei Huang, Shikun Zhang:
MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model. CoRR abs/2408.12321 (2024) - [i45]Anwen Hu, Haiyang Xu, Liang Zhang, Jiabo Ye, Ming Yan, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou:
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding. CoRR abs/2409.03420 (2024) - [i44]Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang:
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models. CoRR abs/2410.04027 (2024) - 2023
- [j1]Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Xianzhe Xu, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong Jin:
Achieving Human Parity on Visual Question Answering. ACM Trans. Inf. Syst. 41(3): 79:1-79:40 (2023) - [c37]Ang Lv, Jinpeng Li, Yuhan Chen, Gao Xing, Ji Zhang, Rui Yan:
DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations. ACL (1) 2023: 1267-1280 - [c36]Qianglong Chen, Guohai Xu, Ming Yan, Ji Zhang, Fei Huang, Luo Si, Yin Zhang:
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering. ACL (Findings) 2023: 13207-13224 - [c35]Chenliang Li, He Chen, Ming Yan, Weizhou Shen, Haiyang Xu, Zhikai Wu, Zhicheng Zhang, Wenmeng Zhou, Yingda Chen, Chen Cheng, Hongzhu Shi, Ji Zhang, Fei Huang, Jingren Zhou:
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models. EMNLP (Demos) 2023: 566-578 - [c34]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. EMNLP (Findings) 2023: 2841-2858 - [c33]Hongzhan Chen, Siyue Wu, Xiaojun Quan, Rui Wang, Ming Yan, Ji Zhang:
MCC-KD: Multi-CoT Consistent Knowledge Distillation. EMNLP (Findings) 2023: 6805-6820 - [c32]Houquan Zhou, Yumeng Liu, Zhenghua Li, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang:
Improving Seq2Seq Grammatical Error Correction via Decoding Interventions. EMNLP (Findings) 2023: 7393-7405 - [c31]Qinghao Ye, Guohai Xu, Ming Yan, Haiyang Xu, Qi Qian, Ji Zhang, Fei Huang:
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training. ICCV 2023: 15359-15370 - [c30]Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. ICML 2023: 38728-38748 - [c29]Yaya Shi, Haowei Liu, Haiyang Xu, Zongyang Ma, Qinghao Ye, Anwen Hu, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval. ACM Multimedia 2023: 4460-4470 - [c28]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Ji Zhang:
COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment. ACM Multimedia 2023: 4480-4491 - [c27]Qinghao Ye, Haiyang Xu, Ming Yan, Chenlin Zhao, Junyang Wang, Xiaoshan Yang, Ji Zhang, Fei Huang, Jitao Sang, Changsheng Xu:
mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM. ACM Multimedia 2023: 9365-9367 - [i43]Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. CoRR abs/2302.00402 (2023) - [i42]Junfeng Tian, Hehong Chen, Guohai Xu, Ming Yan, Xing Gao, Jianhai Zhang, Chenliang Li, Jiayi Liu, Wenshen Xu, Haiyang Xu, Qi Qian, Wei Wang, Qinghao Ye, Jiejing Zhang, Ji Zhang, Fei Huang, Jingren Zhou:
ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human. CoRR abs/2304.07849 (2023) - [i41]Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality. CoRR abs/2304.14178 (2023) - [i40]Qianglong Chen, Feng Ji, Feng-Lin Li, Guohai Xu, Ming Yan, Ji Zhang, Yin Zhang:
AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference. CoRR abs/2305.07928 (2023) - [i39]Qianglong Chen, Guohai Xu, Ming Yan, Ji Zhang, Fei Huang, Luo Si, Yin Zhang:
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering. CoRR abs/2305.08135 (2023) - [i38]Haiyang Xu, Qinghao Ye, Xuan Wu, Ming Yan, Yuan Miao, Jiabo Ye, Guohai Xu, Anwen Hu, Yaya Shi, Guangwei Xu, Chenliang Li, Qi Qian, Maofei Que, Ji Zhang, Xiao Zeng, Fei Huang:
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks. CoRR abs/2306.04362 (2023) - [i37]Ang Lv, Jinpeng Li, Yuhan Chen, Xing Gao, Ji Zhang, Rui Yan:
DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations. CoRR abs/2306.16770 (2023) - [i36]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Yuhao Dan, Chenlin Zhao, Guohai Xu, Chenliang Li, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding. CoRR abs/2307.02499 (2023) - [i35]Guohai Xu, Jiayi Liu, Ming Yan, Haotian Xu, Jinghui Si, Zhuoran Zhou, Peng Yi, Xing Gao, Jitao Sang, Rong Zhang, Ji Zhang, Chao Peng, Fei Huang, Jingren Zhou:
CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility. CoRR abs/2307.09705 (2023) - [i34]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Ji Zhang, Fei Huang:
COPA: Efficient Vision-Language Pre-training Through Collaborative Object- and Patch-Text Alignment. CoRR abs/2308.03475 (2023) - [i33]Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang:
Evaluation and Analysis of Hallucination in Large Vision-Language Models. CoRR abs/2308.15126 (2023) - [i32]Chenliang Li, Hehong Chen, Ming Yan, Weizhou Shen, Haiyang Xu, Zhikai Wu, Zhicheng Zhang, Wenmeng Zhou, Yingda Chen, Chen Cheng, Hongzhu Shi, Ji Zhang, Fei Huang, Jingren Zhou:
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models. CoRR abs/2309.00986 (2023) - [i31]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Alex Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. CoRR abs/2310.05126 (2023) - [i30]Houquan Zhou, Yumeng Liu, Zhenghua Li, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang:
Improving Seq2Seq Grammatical Error Correction via Decoding Interventions. CoRR abs/2310.14534 (2023) - [i29]Hongzhan Chen, Siyue Wu, Xiaojun Quan, Rui Wang, Ming Yan, Ji Zhang:
MCC-KD: Multi-CoT Consistent Knowledge Distillation. CoRR abs/2310.14747 (2023) - [i28]Jixiang Hong, Quan Tu, Changyu Chen, Xing Gao, Ji Zhang, Rui Yan:
CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment. CoRR abs/2310.16271 (2023) - [i27]Qinghao Ye, Haiyang Xu, Jiabo Ye, Ming Yan, Anwen Hu, Haowei Liu, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou:
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration. CoRR abs/2311.04257 (2023) - [i26]Junyang Wang, Yuhang Wang, Guohai Xu, Jing Zhang, Yukai Gu, Haitao Jia, Ming Yan, Ji Zhang, Jitao Sang:
An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation. CoRR abs/2311.07397 (2023) - [i25]Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. CoRR abs/2311.18248 (2023) - [i24]Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang:
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model. CoRR abs/2312.06968 (2023) - [i23]Chaoya Jiang, Wei Ye, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Shikun Zhang:
TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training. CoRR abs/2312.08846 (2023) - 2022
- [c26]Bo Chen, Jiayi Liu, Mieradilijiang Maimaiti, Xing Gao, Ji Zhang:
Generating Persuasive Responses to Customer Reviews with Multi-Source Prior Knowledge in E-commerce. CIKM 2022: 2994-3002 - [c25]Guodun Li, Yuchen Zhai, Qianglong Chen, Xing Gao, Ji Zhang, Yin Zhang:
Continual Few-shot Intent Detection. COLING 2022: 333-343 - [c24]Jiayi Liu, Wei Wei, Zhixuan Chu, Xing Gao, Ji Zhang, Tan Yan, Yulin Kang:
Incorporating Casual Analysis into Diversified and Logical Response Generation. COLING 2022: 378-388 - [c23]Jiabo Ye, Junfeng Tian, Ming Yan, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin:
Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding. CVPR 2022: 15481-15491 - [c22]Chenliang Li, Haiyang Xu, Junfeng Tian, Wei Wang, Ming Yan, Bin Bi, Jiabo Ye, He Chen, Guohai Xu, Zheng Cao, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou, Luo Si:
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. EMNLP 2022: 7241-7259 - [c21]Xuwu Wang, Jiabo Ye, Zhixu Li, Junfeng Tian, Yong Jiang, Ming Yan, Ji Zhang, Yanghua Xiao:
CAT-MNER: Multimodal Named Entity Recognition with Knowledge-Refined Cross-Modal Attention. ICME 2022: 1-6 - [c20]Qianglong Chen, Feng-Lin Li, Guohai Xu, Ming Yan, Ji Zhang, Yin Zhang:
DictBERT: Dictionary Description Knowledge Enhanced Language Model Pre-training via Contrastive Learning. IJCAI 2022: 4086-4092 - [c19]Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji:
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval. ACM Multimedia 2022: 638-647 - [c18]Yi Huang, Xiaoshan Yang, Ji Zhang, Changsheng Xu:
Relative Alignment Network for Source-Free Multimodal Video Domain Adaptation. ACM Multimedia 2022: 1652-1660 - [c17]Feifei Zhang, Ming Yan, Ji Zhang, Changsheng Xu:
Comprehensive Relationship Reasoning for Composed Query Based Image Retrieval. ACM Multimedia 2022: 4655-4664 - [c16]Jianhai Zhang, Mieradilijiang Maimaiti, Gao Xing, Yuanhang Zheng, Ji Zhang:
MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification. NAACL-HLT 2022: 1937-1946 - [c15]Zhi Li, Xing Gao, Ji Zhang, Yin Zhang:
Multi-label Masked Language Modeling on Zero-shot Code-switched Sentiment Analysis. SIGIR 2022: 2663-2668 - [i22]Jiabo Ye, Junfeng Tian, Ming Yan, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin:
Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding. CoRR abs/2203.15442 (2022) - [i21]Wenshen Xu, Mieradilijiang Maimaiti, Yuanhang Zheng, Xin Tang, Ji Zhang:
Auto-MLM: Improved Contrastive Learning for Self-supervised Multi-lingual Knowledge Retrieval. CoRR abs/2203.16187 (2022) - [i20]Jianhai Zhang, Mieradilijiang Maimaiti, Xing Gao, Yuanhang Zheng, Ji Zhang:
MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification. CoRR abs/2204.04952 (2022) - [i19]Chenliang Li, Haiyang Xu, Junfeng Tian, Wei Wang, Ming Yan, Bin Bi, Jiabo Ye, Hehong Chen, Guohai Xu, Zheng Cao, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou, Luo Si:
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. CoRR abs/2205.12005 (2022) - [i18]Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji:
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval. CoRR abs/2207.07285 (2022) - [i17]Qianglong Chen, Feng-Lin Li, Guohai Xu, Ming Yan, Ji Zhang, Yin Zhang:
DictBERT: Dictionary Description Knowledge Enhanced Language Model Pre-training via Contrastive Learning. CoRR abs/2208.00635 (2022) - [i16]Jiayi Liu, Wei Wei, Zhixuan Chu, Xing Gao, Ji Zhang, Tan Yan, Yulin Kang:
Incorporating Casual Analysis into Diversified and Logical Response Generation. CoRR abs/2209.09482 (2022) - [i15]Bo Chen, Jiayi Liu, Mieradilijiang Maimaiti, Xing Gao, Ji Zhang:
Generating Persuasive Responses to Customer Reviews with Multi-Source Prior Knowledge in E-commerce. CoRR abs/2209.09497 (2022) - [i14]Junyang Wang, Yi Zhang, Ming Yan, Ji Zhang, Jitao Sang:
Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment. CoRR abs/2211.07275 (2022) - [i13]Qinghao Ye, Guohai Xu, Ming Yan, Haiyang Xu, Qi Qian, Ji Zhang, Fei Huang:
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training. CoRR abs/2212.14546 (2022) - 2021
- [c14]Qianglong Chen, Feng Ji, Xiangji Zeng, Feng-Lin Li, Ji Zhang, Haiqing Chen, Yin Zhang:
KACE: Generating Knowledge Aware Contrastive Explanations for Natural Language Inference. ACL/IJCNLP (1) 2021: 2516-2527 - [c13]Xuming Lin, Shaobo Cui, Zhongzhou Zhao, Wei Zhou, Ji Zhang, Haiqing Chen:
GGP: A Graph-based Grouping Planner for Explicit Control of Long Text Generation. CIKM 2021: 3253-3257 - [c12]Fu Sun, Feng-Lin Li, Ruize Wang, Qianglong Chen, Xingyi Cheng, Ji Zhang:
K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering. CIKM 2021: 4125-4134 - [c11]Guohai Xu, Hehong Chen, Feng-Lin Li, Fu Sun, Yunzhou Shi, Zhixiong Zeng, Wei Zhou, Zhongzhou Zhao, Ji Zhang:
AliMe MKG: A Multi-modal Knowledge Graph for Live-streaming E-commerce. CIKM 2021: 4808-4812 - [c10]Mieradilijiang Maimaiti, Yang Liu, Yuanhang Zheng, Gang Chen, Kaiyu Huang, Ji Zhang, Huanbo Luan, Maosong Sun:
Segment, Mask, and Predict: Augmenting Chinese Word Segmentation with Self-Supervision. EMNLP (1) 2021: 2068-2077 - [c9]Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Feng Ji, Ji Zhang, Alberto Del Bimbo:
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss. IJCAI 2021: 708-714 - [c8]Yuhao Cui, Zhou Yu, Chunqi Wang, Zhongzhou Zhao, Ji Zhang, Meng Wang, Jun Yu:
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration. ACM Multimedia 2021: 797-806 - [c7]Feng-Lin Li, Zhongzhou Zhao, Qin Lu, Xuming Lin, Hehong Chen, Bo Chen, Liming Pu, Jiashuo Zhang, Fu Sun, Xikai Liu, Liqun Xie, Qi Huang, Ji Zhang, Haiqing Chen:
AliMe Avatar: Multi-modal Content Production and Presentation for Live-streaming E-commerce. SIGIR 2021: 2635-2636 - [c6]Guohai Xu, Yan Shao, Chenliang Li, Feng-Lin Li, Bin Bi, Ji Zhang, Haiqing Chen:
AliMe DA: A Data Augmentation Framework for Question Answering in Cold-start Scenarios. SIGIR 2021: 2637-2638 - [i12]Shaobo Cui, Xintong Bao, Xinxing Zu, Yangyang Guo, Zhongzhou Zhao, Ji Zhang, Haiqing Chen:
OneStop QAMaker: Extract Question-Answer Pairs from Text in a One-Stop Approach. CoRR abs/2102.12128 (2021) - [i11]Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Feng Ji, Ji Zhang, Alberto Del Bimbo:
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss. CoRR abs/2105.01993 (2021) - [i10]Yuhao Cui, Zhou Yu, Chunqi Wang, Zhongzhou Zhao, Ji Zhang, Meng Wang, Jun Yu:
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration. CoRR abs/2108.07073 (2021) - [i9]Shaobo Cui, Xintong Bao, Xuming Lin, Zhongzhou Zhao, Ji Zhang, Wei Zhou, Haiqing Chen:
SPMoE: Generate Multiple Pattern-Aware Outputs with Sparse Pattern Mixture of Experts. CoRR abs/2108.07535 (2021) - [i8]Xuming Lin, Shaobo Cui, Zhongzhou Zhao, Wei Zhou, Ji Zhang, Haiqing Chen:
GGP: A Graph-based Grouping Planner for Explicit Control of Long Text Generation. CoRR abs/2108.07998 (2021) - [i7]Guohai Xu, Hehong Chen, Feng-Lin Li, Fu Sun, Yunzhou Shi, Zhixiong Zeng, Wei Zhou, Zhongzhou Zhao, Ji Zhang:
AliMe MKG: A Multi-modal Knowledge Graph for Live-streaming E-commerce. CoRR abs/2109.07411 (2021) - [i6]Fu Sun, Feng-Lin Li, Ruize Wang, Qianglong Chen, Xingyi Cheng, Ji Zhang:
K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering. CoRR abs/2109.10547 (2021) - [i5]Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Weihua Chen, Xianzhe Xu, Fan Wang, Zheng Cao, Zhicheng Zhang, Qiyu Zhang, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong Jin:
Achieving Human Parity on Visual Question Answering. CoRR abs/2111.08896 (2021) - 2020
- [c5]Zhenxin Fu, Shaobo Cui, Feng Ji, Ji Zhang, Haiqing Chen, Dongyan Zhao, Rui Yan:
Query-to-Session Matching: Do NOT Forget History and Future during Response Selection for Multi-Turn Dialogue Systems. CIKM 2020: 365-374 - [c4]Feng-Lin Li, Hehong Chen, Guohai Xu, Tian Qiu, Feng Ji, Ji Zhang, Haiqing Chen:
AliMeKG: Domain Knowledge Graph Construction and Application in E-commerce. CIKM 2020: 2581-2588 - [c3]Ruize Wang, Zhongyu Wei, Ying Cheng, Piji Li, Haijun Shan, Ji Zhang, Qi Zhang, Xuanjing Huang:
Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication. COLING 2020: 2250-2260 - [i4]Feng-Lin Li, Hehong Chen, Guohai Xu, Tian Qiu, Feng Ji, Ji Zhang, Haiqing Chen:
AliMe KG: Domain Knowledge Graph Construction and Application in E-commerce. CoRR abs/2009.11684 (2020)
2010 – 2019
- 2019
- [c2]Ming Yan, Jiangnan Xia, Chen Wu, Bin Bi, Zhongzhou Zhao, Ji Zhang, Luo Si, Rui Wang, Wei Wang, Haiqing Chen:
A Deep Cascade Model for Multi-Document Reading Comprehension. AAAI 2019: 7354-7361 - [i3]Ruize Wang, Zhongyu Wei, Piji Li, Haijun Shan, Ji Zhang, Qi Zhang, Xuanjing Huang:
Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication. CoRR abs/1911.04192 (2019) - 2018
- [c1]Chunqi Wang, Ji Zhang, Haiqing Chen:
Semi-Autoregressive Neural Machine Translation. EMNLP 2018: 479-488 - [i2]Chunqi Wang, Ji Zhang, Haiqing Chen:
Semi-Autoregressive Neural Machine Translation. CoRR abs/1808.08583 (2018) - [i1]Ming Yan, Jiangnan Xia, Chen Wu, Bin Bi, Zhongzhou Zhao, Ji Zhang, Luo Si, Rui Wang, Wei Wang, Haiqing Chen:
A Deep Cascade Model for Multi-Document Reading Comprehension. CoRR abs/1811.11374 (2018)
Coauthor Index
aka: Gao Xing
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-17 21:52 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint