default search action
Conghui He
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Dinghao Yang, Bin Wang, Weijia Li, Conghui He:
Exploring the user guidance for more accurate building segmentation from high-resolution remote sensing images. Int. J. Appl. Earth Obs. Geoinformation 126: 103609 (2024) - [j8]Weijia Li, Zhenghao Hu, Lingxuan Meng, Jinwang Wang, Juepeng Zheng, Runmin Dong, Conghui He, Gui-Song Xia, Haohuan Fu, Dahua Lin:
Weakly Supervised 3-D Building Reconstruction From Monocular Remote Sensing Images. IEEE Trans. Geosci. Remote. Sens. 62: 1-15 (2024) - [j7]Haojie Ding, Bin Wang, Guoliang Kang, Weijia Li, Conghui He, Yao Zhao, Yunchao Wei:
DropQueries: A Simple Way to Discover Comprehensive Segment Representations. IEEE Trans. Multim. 26: 3481-3490 (2024) - [c33]Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He:
VIGC: Visual Instruction Generation and Correction. AAAI 2024: 5309-5317 - [c32]Le Zhuo, Zewen Chi, Minghao Xu, Heyan Huang, Jianan Zhao, Heqi Zheng, Conghui He, Xian-Ling Mao, Wentao Zhang:
ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training. ACL (1) 2024: 8950-8963 - [c31]Jiaxing Sun, Weiquan Huang, Jiang Wu, Chenya Gu, Wei Li, Songyang Zhang, Hang Yan, Conghui He:
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations. ACL (1) 2024: 11205-11228 - [c30]Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation. CVPR 2024: 13418-13427 - [c29]Weijia Li, Haote Yang, Zhenghao Hu, Juepeng Zheng, Gui-Song Xia, Conghui He:
3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions. CVPR 2024: 27728-27737 - [c28]Junyan Ye, Qiyan Luo, Jinhua Yu, Huaping Zhong, Zhimeng Zheng, Conghui He, Weijia Li:
SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation. CVPR 2024: 27748-27757 - [c27]Junyan Ye, Zhutao Lv, Weijia Li, Jinhua Yu, Haote Yang, Huaping Zhong, Conghui He:
Cross-View Image Geo-Localization with Panorama-BEV Co-retrieval Network. ECCV (37) 2024: 74-90 - [c26]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-Around Player? ECCV (6) 2024: 216-233 - [c25]Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou:
Parrot Captions Teach CLIP to Spot Text. ECCV (42) 2024: 368-385 - [c24]Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin:
ShareGPT4V: Improving Large Multi-modal Models with Better Captions. ECCV (17) 2024: 370-387 - [c23]Yu Sun, Dongzhan Zhou, Chen Lin, Conghui He, Wanli Ouyang, Han-Sen Zhong:
LOCR: Location-Guided Transformer for Optical Character Recognition. EMNLP (Findings) 2024: 5480-5497 - [c22]Xiaoran Liu, Kai Lv, Qipeng Guo, Hang Yan, Conghui He, Xipeng Qiu, Dahua Lin:
LongWanjuan: Towards Systematic Measurement for Long Text Quality. EMNLP (Findings) 2024: 5709-5725 - [c21]Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng:
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training. EMNLP 2024: 15913-15923 - [c20]Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. ICML 2024 - [i64]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024) - [i63]Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. CoRR abs/2402.05935 (2024) - [i62]Kai Lv, Xiaoran Liu, Qipeng Guo, Hang Yan, Conghui He, Xipeng Qiu, Dahua Lin:
LongWanjuan: Towards Systematic Measurement for Long Text Quality. CoRR abs/2402.13583 (2024) - [i61]Shuangrui Ding, Zihan Liu, Xiaoyi Dong, Pan Zhang, Rui Qian, Conghui He, Dahua Lin, Jiaqi Wang:
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation. CoRR abs/2402.17645 (2024) - [i60]Jiantao Qiu, Haijun Lv, Zhenjiang Jin, Rui Wang, Wenchang Ning, Jia Yu, ChaoBin Zhang, Zhenxiang Li, Pei Chu, Yuan Qu, Jin Shi, Lindong Lu, Runyu Peng, Zhiyuan Zeng, Huanze Tang, Zhikai Lei, Jiawei Hong, Keyu Chen, Zhaoye Fei, Ruiliang Xu, Wei Li, Zhongying Tu, Dahua Lin, Yu Qiao, Hang Yan, Conghui He:
WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset. CoRR abs/2402.19282 (2024) - [i59]Yu Sun, Dongzhan Zhou, Chen Lin, Conghui He, Wanli Ouyang, Hansen Zhong:
LOCR: Location-Guided Transformer for Optical Character Recognition. CoRR abs/2403.02127 (2024) - [i58]Le Zhuo, Zewen Chi, Minghao Xu, Heyan Huang, Heqi Zheng, Conghui He, Xian-Ling Mao, Wentao Zhang:
ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training. CoRR abs/2403.07920 (2024) - [i57]Jiaxing Sun, Weiquan Huang, Jiang Wu, Chenya Gu, Wei Li, Songyang Zhang, Hang Yan, Conghui He:
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations. CoRR abs/2403.14112 (2024) - [i56]Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, Fukai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Xiaomeng Zhao, et al.:
InternLM2 Technical Report. CoRR abs/2403.17297 (2024) - [i55]Chao Pang, Jiang Wu, Jiayu Li, Yi Liu, Jiaxing Sun, Weijia Li, Xingxing Weng, Shuai Wang, Litong Feng, Gui-Song Xia, Conghui He:
H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model. CoRR abs/2403.20213 (2024) - [i54]Junyan Ye, Qiyan Luo, Jinhua Yu, Huaping Zhong, Zhimeng Zheng, Conghui He, Weijia Li:
SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation. CoRR abs/2404.02638 (2024) - [i53]Weijia Li, Haote Yang, Zhenghao Hu, Juepeng Zheng, Gui-Song Xia, Conghui He:
3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions. CoRR abs/2404.04823 (2024) - [i52]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i51]Bin Wang, Zhuangcheng Gu, Chao Xu, Bo Zhang, Botian Shi, Conghui He:
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition. CoRR abs/2404.15254 (2024) - [i50]Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang:
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites. CoRR abs/2404.16821 (2024) - [i49]Wei Li, Ren Ma, Jiang Wu, Chenya Gu, Jiahui Peng, Jinyang Len, Songyang Zhang, Hang Yan, Dahua Lin, Conghui He:
FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models. CoRR abs/2404.18359 (2024) - [i48]Tianyi Bai, Hao Liang, Binwang Wan, Ling Yang, Bozhou Li, Yifan Wang, Bin Cui, Conghui He, Binhang Yuan, Wentao Zhang:
A Survey of Multimodal Large Language Model from A Data-centric Perspective. CoRR abs/2405.16640 (2024) - [i47]Bin Wang, Linke Ouyang, Fan Wu, Wenchang Ning, Xiao Han, Zhiyuan Zhao, Jiahui Peng, Yiying Jiang, Dahua Lin, Conghui He:
DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI Data. CoRR abs/2405.18315 (2024) - [i46]Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashuo Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Zhenxiang Li, Pei Chu, Yi Wang, Min Dou, Changyao Tian, Xizhou Zhu, Lewei Lu, Yushi Chen, Junjun He, Zhongying Tu, Tong Lu, Yali Wang, Limin Wang, Dahua Lin, Yu Qiao, Botian Shi, Conghui He, Jifeng Dai:
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text. CoRR abs/2406.08418 (2024) - [i45]Renqiu Xia, Song Mao, Xiangchao Yan, Hongbin Zhou, Bo Zhang, Haoyang Peng, Jiahao Pi, Daocheng Fu, Wenjie Wu, Hancheng Ye, Shiyang Feng, Bin Wang, Chao Xu, Conghui He, Pinlong Cai, Min Dou, Botian Shi, Sheng Zhou, Yongwei Wang, Bin Wang, Junchi Yan, Fei Wu, Yu Qiao:
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models. CoRR abs/2406.11633 (2024) - [i44]Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng:
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training. CoRR abs/2406.16554 (2024) - [i43]Hao Liang, Jiapeng Li, Tianyi Bai, Xijie Huang, Linzhuang Sun, Zhengren Wang, Conghui He, Bin Cui, Chong Chen, Wentao Zhang:
KeyVideoLLM: Towards Large-scale Video Keyframe Selection. CoRR abs/2407.03104 (2024) - [i42]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024) - [i41]Yi Yu, Jingru Yu, Xuhong Wang, Juanjuan Li, Yilun Lin, Conghui He, Yanqing Yang, Yu Qiao, Li Li, Fei-Yue Wang:
Navigating the Data Trading Crossroads: An Interdisciplinary Survey. CoRR abs/2407.11466 (2024) - [i40]Conghui He, Wei Li, Zhenjiang Jin, Chao Xu, Bin Wang, Dahua Lin:
OpenDataLab: Empowering General Artificial Intelligence with Open Datasets. CoRR abs/2407.13773 (2024) - [i39]Zheng Liu, Hao Liang, Xijie Huang, Wentao Xiong, Qinhan Yu, Linzhuang Sun, Chong Chen, Conghui He, Bin Cui, Wentao Zhang:
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models. CoRR abs/2407.20756 (2024) - [i38]Hao Liang, Linzhuang Sun, Jingxuan Wei, Xijie Huang, Linkun Sun, Bihui Yu, Conghui He, Wentao Zhang:
Synth-Empathy: Towards High-Quality Synthetic Empathy Data. CoRR abs/2407.21669 (2024) - [i37]Junyan Ye, Jun He, Weijia Li, Zhutao Lv, Jinhua Yu, Haote Yang, Conghui He:
SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm. CoRR abs/2408.01812 (2024) - [i36]Junyan Ye, Zhutao Lv, Weijia Li, Jinhua Yu, Haote Yang, Huaping Zhong, Conghui He:
Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network. CoRR abs/2408.05475 (2024) - [i35]Weijia Li, Jinhua Yu, Dairong Chen, Yi Lin, Runmin Dong, Xiang Zhang, Conghui He, Haohuan Fu:
Fine-Grained Building Function Recognition from Street-View Images via Geometry-Aware Semi-Supervised Learning. CoRR abs/2408.09460 (2024) - [i34]Weijia Li, Jun He, Junyan Ye, Huaping Zhong, Zhimeng Zheng, Zilong Huang, Dahua Lin, Conghui He:
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis. CoRR abs/2408.14765 (2024) - [i33]Baichuan Zhou, Haote Yang, Dairong Chen, Junyan Ye, Tianyi Bai, Jinhua Yu, Songyang Zhang, Dahua Lin, Conghui He, Weijia Li:
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios. CoRR abs/2408.17267 (2024) - [i32]Bin Wang, Fan Wu, Linke Ouyang, Zhuangcheng Gu, Rui Zhang, Renqiu Xia, Bo Zhang, Conghui He:
CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation. CoRR abs/2409.03643 (2024) - [i31]Chi Zhang, Huaping Zhong, Kuan Zhang, Chengliang Chai, Rui Wang, Xinlin Zhuang, Tianyi Bai, Jiantao Qiu, Lei Cao, Ye Yuan, Guoren Wang, Conghui He:
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models. CoRR abs/2409.16986 (2024) - [i30]Bin Wang, Chao Xu, Xiaomeng Zhao, Linke Ouyang, Fan Wu, Zhiyuan Zhao, Rui Xu, Kaiwen Liu, Yuan Qu, Fukai Shang, Bo Zhang, Liqun Wei, Zhihao Sui, Wei Li, Botian Shi, Yu Qiao, Dahua Lin, Conghui He:
MinerU: An Open-Source Solution for Precise Document Content Extraction. CoRR abs/2409.18839 (2024) - [i29]Bozhou Li, Hao Liang, Yang Li, Fangcheng Fu, Hongzhi Yin, Conghui He, Wentao Zhang:
Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models. CoRR abs/2410.05802 (2024) - [i28]Runchuan Zhu, Zhipeng Ma, Jiang Wu, Junyuan Gao, Jiaqi Wang, Dahua Lin, Conghui He:
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning. CoRR abs/2410.06913 (2024) - [i27]Tianyi Bai, Ling Yang, Zhen Hao Wong, Jiahui Peng, Xinlin Zhuang, Chi Zhang, Lijun Wu, Jiantao Qiu, Wentao Zhang, Binhang Yuan, Conghui He:
Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining. CoRR abs/2410.08102 (2024) - [i26]Junyan Ye, Baichuan Zhou, Zilong Huang, Junan Zhang, Tianyi Bai, Hengrui Kang, Jun He, Honglin Lin, Zihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li:
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models. CoRR abs/2410.09732 (2024) - [i25]Zhiyuan Zhao, Hengrui Kang, Bin Wang, Conghui He:
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception. CoRR abs/2410.12628 (2024) - [i24]Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang, Feng Wu, Dahua Lin:
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction. CoRR abs/2410.17247 (2024) - [i23]Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. CoRR abs/2410.17637 (2024) - [i22]Qintong Zhang, Victor Shea-Jay Huang, Bin Wang, Junyuan Zhang, Zhengren Wang, Hao Liang, Shawn Wang, Matthieu Lin, Conghui He, Wentao Zhang:
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction. CoRR abs/2410.21169 (2024) - 2023
- [c19]Yiqi Lin, Huabin Zheng, Huaping Zhong, Jinjing Zhu, Weijia Li, Conghui He, Lin Wang:
SEPT: Towards Scalable and Efficient Visual Pre-training. AAAI 2023: 1622-1630 - [c18]Weijia Li, Yawen Lai, Linning Xu, Yuanbo Xiangli, Jinhua Yu, Conghui He, Gui-Song Xia, Dahua Lin:
OmniCity: Omnipotent City Understanding with Multi-Level and Multi-View Images. CVPR 2023: 17397-17407 - [c17]Xiaosong Jia, Penghao Wu, Li Chen, Jiangwei Xie, Conghui He, Junchi Yan, Hongyang Li:
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving. CVPR 2023: 21983-21994 - [c16]Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin:
V3Det: Vast Vocabulary Visual Detection Dataset. ICCV 2023: 19787-19797 - [i21]Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin:
V3Det: Vast Vocabulary Visual Detection Dataset. CoRR abs/2304.03752 (2023) - [i20]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i19]Xiaosong Jia, Penghao Wu, Li Chen, Jiangwei Xie, Conghui He, Junchi Yan, Hongyang Li:
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving. CoRR abs/2305.06242 (2023) - [i18]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-around Player? CoRR abs/2307.06281 (2023) - [i17]Conghui He, Zhenjiang Jin, Chao Xu, Jiantao Qiu, Bin Wang, Wei Li, Hang Yan, Jiaqi Wang, Dahua Lin:
WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models. CoRR abs/2308.10755 (2023) - [i16]Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He:
VIGC: Visual Instruction Generation and Correction. CoRR abs/2308.12714 (2023) - [i15]Zhiyuan Zhao, Linke Ouyang, Bin Wang, Siyuan Huang, Pan Zhang, Xiaoyi Dong, Jiaqi Wang, Conghui He:
MLLM-DataEngine: An Iterative Refinement Approach for MLLM. CoRR abs/2308.13566 (2023) - [i14]Yidong Liu, Fukai Shang, Fang Wang, Rui Xu, Jun Wang, Wei Li, Yao Li, Conghui He:
MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large Models. CoRR abs/2309.13079 (2023) - [i13]Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023) - [i12]Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin:
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions. CoRR abs/2311.12793 (2023) - [i11]Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He:
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization. CoRR abs/2311.16839 (2023) - [i10]Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation. CoRR abs/2311.17911 (2023) - [i9]Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou:
Parrot Captions Teach CLIP to Spot Text. CoRR abs/2312.14232 (2023) - 2022
- [c15]Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan:
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark. ECCV (38) 2022: 550-567 - [i8]Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan:
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark. CoRR abs/2203.11089 (2022) - [i7]Stephen D. H. Yang, Bin Wang, Weijia Li, YiQi Lin, Conghui He:
Unified Interactive Image Matting. CoRR abs/2205.08324 (2022) - [i6]Weijia Li, Yawen Lai, Linning Xu, Yuanbo Xiangli, Jinhua Yu, Conghui He, Gui-Song Xia, Dahua Lin:
OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images. CoRR abs/2208.00928 (2022) - [i5]Yiqi Lin, Huabin Zheng, Huaping Zhong, Jinjing Zhu, Weijia Li, Conghui He, Lin Wang:
SEPT: Towards Scalable and Efficient Visual Pre-Training. CoRR abs/2212.05473 (2022) - 2021
- [c14]Weijia Li, Wenqian Zhao, Huaping Zhong, Conghui He, Dahua Lin:
Joint Semantic-geometric Learning for Polygonal Building Segmentation. AAAI 2021: 1958-1965 - [c13]Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He:
Influence Selection for Active Learning. ICCV 2021: 9254-9263 - [c12]Weijia Li, Lingxuan Meng, Jinwang Wang, Conghui He, Gui-Song Xia, Dahua Lin:
3D Building Reconstruction from Monocular Remote Sensing Images. ICCV 2021: 12528-12537 - [i4]Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He:
Influence Selection for Active Learning. CoRR abs/2108.09331 (2021) - [i3]Jing Shao, Siyu Chen, Yangguang Li, Kun Wang, Zhenfei Yin, Yinan He, Jianing Teng, Qinghong Sun, Mengya Gao, Jihao Liu, Gengshi Huang, Guanglu Song, Yichao Wu, Yuming Huang, Fenggang Liu, Huan Peng, Shuo Qin, Chengyu Wang, Yujie Wang, Conghui He, Ding Liang, Yu Liu, Fengwei Yu, Junjie Yan, Dahua Lin, Xiaogang Wang, Yu Qiao:
INTERN: A New Learning Paradigm Towards General Vision. CoRR abs/2111.08687 (2021) - 2020
- [c11]Tai Wang, Conghui He, Zhe Wang, Jianping Shi, Dahua Lin:
FLAVA: Find, Localize, Adjust and Verify to Annotate LiDAR-based Point Clouds. UIST (Adjunct Volume) 2020: 31-33 - [i2]Tai Wang, Conghui He, Zhe Wang, Jianping Shi, Dahua Lin:
FLAVA: Find, Localize, Adjust and Verify to Annotate LiDAR-Based Point Clouds. CoRR abs/2011.10174 (2020)
2010 – 2019
- 2019
- [j6]Weijia Li, Conghui He, Jiarui Fang, Juepeng Zheng, Haohuan Fu, Le Yu:
Semantic Segmentation-Based Building Footprint Extraction Using Very High-Resolution Satellite Images and Multi-Source GIS Data. Remote. Sens. 11(4): 403 (2019) - [j5]Weijia Li, Conghui He, Haohuan Fu, Juepeng Zheng, Runmin Dong, Maocai Xia, Le Yu, Wayne Luk:
A Real-Time Tree Crown Detection Approach for Large-Scale Remote Sensing Images on FPGAs. Remote. Sens. 11(9): 1025 (2019) - [j4]Jingheng Xu, Guangwen Yang, Haohuan Fu, Wayne Luk, Lin Gan, Wen Shi, Wei Xue, Chao Yang, Yong Jiang, Conghui He:
Optimizing Finite Volume Method Solvers on Nvidia GPUs. IEEE Trans. Parallel Distributed Syst. 30(12): 2790-2805 (2019) - [c10]Conghui He, Shijie Sun, Benli Li, Xiaogang Tu, Donghai Yu:
Finding Mutual X at WeChat-Scale Social Network in Ten Minitues. IEEE BigData 2019: 288-297 - [i1]Jiarui Fang, Liandeng Li, Haohuan Fu, Jinlei Jiang, Wenlai Zhao, Conghui He, Xin You, Guangwen Yang:
swCaffe: a Parallel Framework for Accelerating Deep Learning Applications on Sunway TaihuLight. CoRR abs/1903.06934 (2019) - 2018
- [c9]Liandeng Li, Jiarui Fang, Haohuan Fu, Jinlei Jiang, Wenlai Zhao, Conghui He, Xin You, Guangwen Yang:
swCaffe: A Parallel Framework for Accelerating Deep Learning Applications on Sunway TaihuLight. CLUSTER 2018: 413-422 - [c8]Weijia Li, Conghui He, Jiarui Fang, Haohuan Fu:
Semantic Segmentation Based Building Extraction Method Using Multi-Source GIS Map Datasets and Satellite Imagery. CVPR Workshops 2018: 238-241 - [c7]Bingwei Chen, Haohuan Fu, Yanwen Wei, Conghui He, Wenqiang Zhang, Yuxuan Li, Wubin Wan, Wei Zhang, Lin Gan, Wei Zhang, Zhenguo Zhang, Guangwen Yang, Xiaofei Chen:
Simulating the Wenchuan earthquake with accurate surface topography on Sunway TaihuLight. SC 2018: 40:1-40:12 - 2017
- [j3]Conghui He, Haohuan Fu, Ce Guo, Wayne Luk, Guangwen Yang:
A Fully-Pipelined Hardware Design for Gaussian Mixture Models. IEEE Trans. Computers 66(11): 1837-1850 (2017) - [c6]Haohuan Fu, Conghui He, Wayne Luk, Weijia Li, Guangwen Yang:
A Nanosecond-Level Hybrid Table Design for Financial Market Data Generators. FCCM 2017: 227-234 - [c5]Haohuan Fu, Conghui He, Huabin Ruan, Itay Greenspon, Wayne Luk, Yongkang Zheng, Junfeng Liao, Qing Zhang, Guangwen Yang:
Accelerating Financial Market Server through Hybrid List Design (Abstract Only). FPGA 2017: 289-290 - [c4]Conghui He, Haohuan Fu, Wayne Luk, Weijia Li, Guangen Yang:
Exploring the potential of reconfigurable platforms for order book update. FPL 2017: 1-8 - [c3]Weijia Li, Conghui He, Haohuan Fu, Wayne Luk:
An FPGA-based tree crown detection approach for remote sensing images. FPT 2017: 231-234 - [c2]Haohuan Fu, Conghui He, Bingwei Chen, Zekun Yin, Zhenguo Zhang, Wenqiang Zhang, Tingjian Zhang, Wei Xue, Weiguo Liu, Wanwang Yin, Guangwen Yang, Xiaofei Chen:
18.9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios. SC 2017: 2 - 2016
- [j2]Yushu Chen, Guangwen Yang, Xiao Ma, Conghui He, Guojie Song:
A time-space domain stereo finite difference method for 3D scalar wave propagation. Comput. Geosci. 96: 218-235 (2016) - [c1]Haohuan Fu, Junfeng Liao, Wei Xue, Lanning Wang, Dexun Chen, Long Gu, Jinxiu Xu, Nan Ding, Xinliang Wang, Conghui He, Shizhen Xu, Yishuang Liang, Jiarui Fang, Yuanchao Xu, Weijie Zheng, Jingheng Xu, Zhen Zheng, Wanjing Wei, Xu Ji, He Zhang, Bingwei Chen, Kaiwei Li, Xiaomeng Huang, Wenguang Chen, Guangwen Yang:
Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer. SC 2016: 969-980 - 2014
- [j1]Nicholas Clinton, Le Yu, Haohuan Fu, Conghui He, Peng Gong:
Global-Scale Associations of Vegetation Phenology with Rainfall and Temperature at a High Spatio-Temporal Resolution. Remote. Sens. 6(8): 7320-7338 (2014)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-11 21:42 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint