default search action
Hang Xu 0004
Person information
- affiliation: Huawei Noah's Ark Lab, Shanghai, China
- affiliation (PhD 2018): Hong Kong University
Other persons with the same name
- Hang Xu — disambiguation page
- Hang Xu 0001 — Nanyang Technological University, Singapore
- Hang Xu 0002 — Taiyuan University of Technology, Taiyuan, China
- Hang Xu 0003 — Putian University, School of Information Engineering, China (and 2 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j7]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. ACM Comput. Surv. 57(2): 41:1-41:42 (2025) - 2024
- [j6]Jian Ding, Enze Xie, Hang Xu, Chenhan Jiang, Zhenguo Li, Ping Luo, Gui-Song Xia:
Deeply Unsupervised Patch Re-Identification for Pre-Training Object Detectors. IEEE Trans. Pattern Anal. Mach. Intell. 46(3): 1348-1361 (2024) - [j5]Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang:
Correctable Landmark Discovery via Large Models for Vision-Language Navigation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 8534-8548 (2024) - [j4]Yanxin Long, Jianhua Han, Runhui Huang, Hang Xu, Yi Zhu, Chunjing Xu, Xiaodan Liang:
Fine-Grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. IEEE Trans. Neural Networks Learn. Syst. 35(11): 16277-16287 (2024) - [c102]Renyuan Peng, Xinyue Cai, Hang Xu, Jiachen Lu, Feng Wen, Wei Zhang, Li Zhang:
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement. AAAI 2024: 4497-4505 - [c101]Qingping Zheng, Yuanfan Guo, Jiankang Deng, Jianhua Han, Ying Li, Songcen Xu, Hang Xu:
Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images. AAAI 2024: 7571-7578 - [c100]Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang:
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation. ACL (Findings) 2024: 12538-12559 - [c99]Tianyu Huang, Yihan Zeng, Zhilu Zhang, Wan Xu, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo:
DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior. CVPR 2024: 5364-5373 - [c98]Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-Vocabulary Object Detection. CVPR 2024: 5610-5619 - [c97]Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li:
Holistic Autonomous Driving Understanding by Bird'View Injected Multi-Modal Large Models. CVPR 2024: 13668-13677 - [c96]Qingping Zheng, Ling Zheng, Yuanfan Guo, Ying Li, Songcen Xu, Jiankang Deng, Hang Xu:
Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution. CVPR 2024: 25806-25816 - [c95]Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:
LayerDiff: Exploring Text-Guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. ECCV (76) 2024: 144-160 - [c94]Guansong Lu, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei Zhang, Hang Xu:
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion. ECCV (45) 2024: 159-176 - [c93]Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang:
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-Fine Pose-Reversible Guidance. ECCV (32) 2024: 201-217 - [c92]Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
MagDiff: Multi-alignment Diffusion for High-Fidelity Video Generation and Editing. ECCV (18) 2024: 205-221 - [c91]Ming Nie, Renyuan Peng, Chunwei Wang, Xinyue Cai, Jianhua Han, Hang Xu, Li Zhang:
Reason2Drive: Towards Interpretable and Chain-Based Reasoning for Autonomous Driving. ECCV (26) 2024: 292-308 - [c90]Yunhao Gou, Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James T. Kwok, Yu Zhang:
Eyes Closed, Safety on: Protecting Multimodal LLMs via Image-to-Text Transformation. ECCV (17) 2024: 388-404 - [c89]Chenhan Jiang, Yihan Zeng, Tianyang Hu, Songcun Xu, Wei Zhang, Hang Xu, Dit-Yan Yeung:
JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation. ECCV (26) 2024: 439-456 - [c88]Zhili Liu, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James T. Kwok:
Implicit Concept Removal of Diffusion Models. ECCV (21) 2024: 457-473 - [c87]Kai Chen, Chunwei Wang, Kuo Yang, Jianhua Han, Lanqing Hong, Fei Mi, Hang Xu, Zhengying Liu, Wenyong Huang, Zhenguo Li, Dit-Yan Yeung, Lifeng Shang:
Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis. ICLR 2024 - [c86]Tianyu Huang, Yihan Zeng, Bowen Dong, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo:
TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields. ICLR 2024 - [c85]Renjie Pi, Lewei Yao, Jianhua Han, Xiaodan Liang, Wei Zhang, Hang Xu:
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction. ICLR 2024 - [c84]Tianyi Lu, Xing Zhang, Jiaxi Gu, Renjing Pei, Songcen Xu, Xingjun Ma, Hang Xu, Zuxuan Wu:
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models. ACM Multimedia 2024: 6745-6754 - [i118]Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li:
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models. CoRR abs/2401.00988 (2024) - [i117]Renyuan Peng, Xinyue Cai, Hang Xu, Jiachen Lu, Feng Wen, Wei Zhang, Li Zhang:
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement. CoRR abs/2401.17609 (2024) - [i116]Zhili Liu, Kai Chen, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, James T. Kwok:
Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts. CoRR abs/2402.05382 (2024) - [i115]Jiachen Lu, Renyuan Peng, Xinyue Cai, Hang Xu, Hongyang Li, Feng Wen, Wei Zhang, Li Zhang:
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach. CoRR abs/2402.08207 (2024) - [i114]Yulong Liu, Yunlong Yuan, Chunwei Wang, Jianhua Han, Yongqiang Ma, Li Zhang, Nanning Zheng, Hang Xu:
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs. CoRR abs/2402.18157 (2024) - [i113]Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang:
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning. CoRR abs/2403.07376 (2024) - [i112]Yunhao Gou, Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James T. Kwok, Yu Zhang:
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation. CoRR abs/2403.09572 (2024) - [i111]Haochen Jiang, Yueming Xu, Yihan Zeng, Hang Xu, Wei Zhang, Jianfeng Feng, Li Zhang:
OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation. CoRR abs/2403.11796 (2024) - [i110]Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. CoRR abs/2403.11929 (2024) - [i109]Qingping Zheng, Ling Zheng, Yuanfan Guo, Ying Li, Songcen Xu, Jiankang Deng, Hang Xu:
Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution. CoRR abs/2403.16643 (2024) - [i108]Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection. CoRR abs/2404.09216 (2024) - [i107]Ming Nie, Xinyue Cai, Hang Xu, Li Zhang:
LaneCorrect: Self-supervised Lane Detection. CoRR abs/2404.14671 (2024) - [i106]Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang:
Correctable Landmark Discovery via Large Models for Vision-Language Navigation. CoRR abs/2405.18721 (2024) - [i105]Yang Cao, Yihan Zeng, Hang Xu, Dan Xu:
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection. CoRR abs/2406.00830 (2024) - [i104]Xing Zhang, Jiaxi Gu, Haoyu Zhao, Shicong Wang, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu, Yu-Gang Jiang:
AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding. CoRR abs/2406.07091 (2024) - [i103]Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang:
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance. CoRR abs/2407.06937 (2024) - [i102]Runhui Huang, Xinpeng Ding, Chunwei Wang, Jianhua Han, Yulong Liu, Hengshuang Zhao, Hang Xu, Lu Hou, Wei Zhang, Xiaodan Liang:
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models. CoRR abs/2407.08706 (2024) - [i101]Chenhan Jiang, Yihan Zeng, Tianyang Hu, Songcun Xu, Wei Zhang, Hang Xu, Dit-Yan Yeung:
JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation. CoRR abs/2407.12291 (2024) - [i100]Cong Wang, Jiaxi Gu, Panwen Hu, Haoyu Zhao, Yuanfan Guo, Jianhua Han, Hang Xu, Xiaodan Liang:
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation. CoRR abs/2408.13005 (2024) - [i99]Yi Zhu, Yanpeng Zhou, Chunwei Wang, Yang Cao, Jianhua Han, Lu Hou, Hang Xu:
UNIT: Unifying Image and Text Recognition in One Vision Encoder. CoRR abs/2409.04095 (2024) - [i98]Kaidong Zhang, Pengzhen Ren, Bingqian Lin, Junfan Lin, Shikui Ma, Hang Xu, Xiaodan Liang:
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation. CoRR abs/2410.10394 (2024) - 2023
- [j3]Dapeng Feng, Songfang Han, Hang Xu, Xiaodan Liang, Xiaojun Tan:
Point-Guided Contrastive Learning for Monocular 3-D Object Detection. IEEE Trans. Cybern. 53(2): 954-966 (2023) - [c83]Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chunjing Xu, Xiaodan Liang:
NLIP: Noise-Robust Language-Image Pre-training. AAAI 2023: 926-934 - [c82]Zutao Jiang, Guansong Lu, Xiaodan Liang, Jihua Zhu, Wei Zhang, Xiaojun Chang, Hang Xu:
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation. AAAI 2023: 1051-1059 - [c81]Benjin Zhu, Zhe Wang, Shaoshuai Shi, Hang Xu, Lanqing Hong, Hongsheng Li:
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection. CVPR 2023: 9296-9305 - [c80]Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving. CVPR 2023: 9611-9621 - [c79]Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang:
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining. CVPR 2023: 15233-15243 - [c78]Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu:
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data. CVPR 2023: 15244-15253 - [c77]Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung:
Mixed Autoencoder for Self-Supervised Visual Representation Learning. CVPR 2023: 22742-22751 - [c76]Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu:
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CVPR 2023: 23497-23506 - [c75]Renjie Pi, Jiahui Gao, Shizhe Diao, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang:
DetGPT: Detect What You Need via Reasoning. EMNLP 2023: 14172-14189 - [c74]Jiachen Lu, Hongyang Li, Renyuan Peng, Feng Wen, Xinyue Cai, Wei Zhang, Hang Xu, Li Zhang:
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach. ICCV 2023: 23-33 - [c73]Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang:
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation. ICCV 2023: 1196-1205 - [c72]Zhijian Huang, Sihao Lin, Guiyu Liu, Mukun Luo, Chaoqiang Ye, Hang Xu, Xiaojun Chang, Xiaodan Liang:
FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration. ICCV 2023: 3479-3488 - [c71]Ming Nie, Yujing Xue, Chunwei Wang, Chaoqiang Ye, Hang Xu, Xinge Zhu, Qingqiu Huang, Michael Bi Mi, Xinchao Wang, Li Zhang:
PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection. ICCV 2023: 3778-3790 - [c70]Peiyan Guan, Renjing Pei, Bin Shao, Jianzhuang Liu, Weimian Li, Jiaxi Gu, Hang Xu, Songcen Xu, Youliang Yan, Edmund Y. Lam:
PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval. ICCV 2023: 11130-11139 - [c69]Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei Zhang, Hang Xu:
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images. ICCV 2023: 15280-15291 - [c68]Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu:
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability. ICCV 2023: 15667-15677 - [c67]Xinchi Deng, Han Shi, Runhui Huang, Changlin Li, Hang Xu, Jianhua Han, James T. Kwok, Shen Zhao, Wei Zhang, Xiaodan Liang:
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training. ICCV 2023: 22121-22132 - [c66]Xujie Zhang, Binbin Yang, Michael C. Kampffmeyer, Wenqing Zhang, Shiyue Zhang, Guansong Lu, Liang Lin, Hang Xu, Xiaodan Liang:
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment. ICCV 2023: 23097-23106 - [c65]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Yu Qiao, Zhenguo Li, Ping Luo:
CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. ICLR 2023 - [c64]Jiahui Gao, Renjie Pi, Yong Lin, Hang Xu, Jiacheng Ye, Zhiyong Wu, Weizhong Zhang, Xiaodan Liang, Zhenguo Li, Lingpeng Kong:
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning. ICLR 2023 - [c63]Zhili Liu, Kai Chen, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, James T. Kwok:
Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts. ICLR 2023 - [c62]Pengzhen Ren, Changlin Li, Hang Xu, Yi Zhu, Guangrun Wang, Jianzhuang Liu, Xiaojun Chang, Xiaodan Liang:
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency. ICLR 2023 - [c61]Zheyuan Zhou, Jiachen Lu, Yihan Zeng, Hang Xu, Li Zhang:
SUIT: Learning Significance-Guided Information for 3D Temporal Detection. IROS 2023: 9399-9406 - [c60]Yang Cao, Yihan Zeng, Hang Xu, Dan Xu:
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection. NeurIPS 2023 - [c59]Huijie Wang, Tianyu Li, Yang Li, Li Chen, Chonghao Sima, Zhenbo Liu, Bangjun Wang, Peijin Jia, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, Ping Luo, Junchi Yan, Wei Zhang, Hongyang Li:
OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping. NeurIPS 2023 - [i97]Pengzhen Ren, Changlin Li, Hang Xu, Yi Zhu, Guangrun Wang, Jianzhuang Liu, Xiaojun Chang, Xiaodan Liang:
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency. CoRR abs/2302.10307 (2023) - [i96]Yikai Wang, Jianan Wang, Guansong Lu, Hang Xu, Zhenguo Li, Wei Zhang, Yanwei Fu:
Entity-Level Text-Guided Image Manipulation. CoRR abs/2302.11383 (2023) - [i95]Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving. CoRR abs/2303.01788 (2023) - [i94]Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang:
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining. CoRR abs/2303.02489 (2023) - [i93]Bowen Dong, Jiaxi Gu, Jianhua Han, Hang Xu, Wangmeng Zuo:
Towards Universal Vision-language Omni-supervised Segmentation. CoRR abs/2303.06547 (2023) - [i92]Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu:
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data. CoRR abs/2303.12417 (2023) - [i91]Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung:
Mixed Autoencoder for Self-supervised Visual Representation Learning. CoRR abs/2303.17152 (2023) - [i90]Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu:
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CoRR abs/2304.04514 (2023) - [i89]Tianyu Li, Li Chen, Xiangwei Geng, Huijie Wang, Yang Li, Zhenbo Liu, Shengyin Jiang, Yuting Wang, Hang Xu, Chunjing Xu, Feng Wen, Ping Luo, Junchi Yan, Wei Zhang, Xiaogang Wang, Yu Qiao, Hongyang Li:
Topology Reasoning for Driving Scenes. CoRR abs/2304.05277 (2023) - [i88]Huijie Wang, Zhenbo Liu, Yang Li, Tianyu Li, Li Chen, Chonghao Sima, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, Ping Luo, Junchi Yan, Wei Zhang, Jun Yao, Yu Qiao, Hongyang Li:
Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving. CoRR abs/2304.10440 (2023) - [i87]Bingqian Lin, Zicong Chen, Mingjie Li, Haokun Lin, Hang Xu, Yi Zhu, Jianzhuang Liu, Wenjia Cai, Lei Yang, Shen Zhao, Chenfei Wu, Ling Chen, Xiaojun Chang, Yi Yang, Lei Xing, Xiaodan Liang:
Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining. CoRR abs/2304.14204 (2023) - [i86]Haonan Wang, Minbin Huang, Runhui Huang, Lanqing Hong, Hang Xu, Tianyang Hu, Xiaodan Liang, Zhenguo Li:
Boosting Visual-Language Models by Exploiting Hard Samples. CoRR abs/2305.05208 (2023) - [i85]Renjie Pi, Jiahui Gao, Shizhe Diao, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang:
DetGPT: Detect What You Need via Reasoning. CoRR abs/2305.14167 (2023) - [i84]Guian Fang, Zutao Jiang, Jianhua Han, Guansong Lu, Hang Xu, Xiaodan Liang:
Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards. CoRR abs/2305.19599 (2023) - [i83]Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang:
MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation. CoRR abs/2306.10322 (2023) - [i82]Zheyuan Zhou, Jiachen Lu, Yihan Zeng, Hang Xu, Li Zhang:
SUIT: Learning Significance-guided Information for 3D Temporal Detection. CoRR abs/2307.01807 (2023) - [i81]Zhijian Huang, Sihao Lin, Guiyu Liu, Mukun Luo, Chaoqiang Ye, Hang Xu, Xiaojun Chang, Xiaodan Liang:
FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration. CoRR abs/2307.16617 (2023) - [i80]Ming Nie, Yujing Xue, Chunwei Wang, Chaoqiang Ye, Hang Xu, Xinge Zhu, Qingqiu Huang, Michael Bi Mi, Xinchao Wang, Li Zhang:
PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection. CoRR abs/2308.03982 (2023) - [i79]Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang:
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation. CoRR abs/2308.04829 (2023) - [i78]Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu:
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability. CoRR abs/2308.09306 (2023) - [i77]Xujie Zhang, Binbin Yang, Michael C. Kampffmeyer, Wenqing Zhang, Shiyue Zhang, Guansong Lu, Liang Lin, Hang Xu, Xiaodan Liang:
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment. CoRR abs/2308.11206 (2023) - [i76]Xinchi Deng, Han Shi, Runhui Huang, Changlin Li, Hang Xu, Jianhua Han, James T. Kwok, Shen Zhao, Wei Zhang, Xiaodan Liang:
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training. CoRR abs/2308.11331 (2023) - [i75]Qingping Zheng, Yuanfan Guo, Jiankang Deng, Jianhua Han, Ying Li, Songcen Xu, Hang Xu:
Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images. CoRR abs/2308.16582 (2023) - [i74]Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei Zhang, Hang Xu:
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images. CoRR abs/2308.16758 (2023) - [i73]Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu:
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation. CoRR abs/2309.03549 (2023) - [i72]Xinpeng Ding, Jianhua Han, Hang Xu, Wei Zhang, Xiaomeng Li:
HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving. CoRR abs/2309.05186 (2023) - [i71]Tianyu Huang, Yihan Zeng, Bowen Dong, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo:
TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields. CoRR abs/2309.17175 (2023) - [i70]Yang Cao, Yihan Zeng, Hang Xu, Dan Xu:
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection. CoRR abs/2310.02960 (2023) - [i69]Zhili Liu, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James T. Kwok:
Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models. CoRR abs/2310.05873 (2023) - [i68]Kai Chen, Chunwei Wang, Kuo Yang, Jianhua Han, Lanqing Hong, Fei Mi, Hang Xu, Zhengying Liu, Wenyong Huang, Zhenguo Li, Dit-Yan Yeung, Lifeng Shang, Xin Jiang, Qun Liu:
Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis. CoRR abs/2310.10477 (2023) - [i67]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. CoRR abs/2310.10647 (2023) - [i66]Tianyi Lu, Xing Zhang, Jiaxi Gu, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu:
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models. CoRR abs/2310.16400 (2023) - [i65]Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model. CoRR abs/2311.17338 (2023) - [i64]Cong Wang, Jiaxi Gu, Panwen Hu, Songcen Xu, Hang Xu, Xiaodan Liang:
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance. CoRR abs/2312.03018 (2023) - [i63]Ming Nie, Renyuan Peng, Chunwei Wang, Xinyue Cai, Jianhua Han, Hang Xu, Li Zhang:
Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving. CoRR abs/2312.03661 (2023) - [i62]Tianyu Huang, Yihan Zeng, Zhilu Zhang, Wan Xu, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo:
DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior. CoRR abs/2312.06439 (2023) - [i61]Jiahui Gao, Renjie Pi, Jipeng Zhang, Jiacheng Ye, Wanjun Zhong, Yufei Wang, Lanqing Hong, Jianhua Han, Hang Xu, Zhenguo Li, Lingpeng Kong:
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model. CoRR abs/2312.11370 (2023) - [i60]Yunhao Gou, Zhili Liu, Kai Chen, Lanqing Hong, Hang Xu, Aoxue Li, Dit-Yan Yeung, James T. Kwok, Yu Zhang:
Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning. CoRR abs/2312.12379 (2023) - [i59]Guansong Lu, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei Zhang, Hang Xu:
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion. CoRR abs/2312.16486 (2023) - 2022
- [c58]Jianhua Han, Xiajun Deng, Xinyue Cai, Zhen Yang, Hang Xu, Chunjing Xu, Xiaodan Liang:
Laneformer: Object-Aware Row-Column Transformers for Lane Detection. AAAI 2022: 799-807 - [c57]Zhili Liu, Jianhua Han, Lanqing Hong, Hang Xu, Kai Chen, Chunjing Xu, Zhenguo Li:
Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing. AAAI 2022: 1854-1862 - [c56]Jiahui Gao, Hang Xu, Han Shi, Xiaozhe Ren, Philip L. H. Yu, Xiaodan Liang, Xin Jiang, Zhenguo Li:
AutoBERT-Zero: Evolving BERT Backbone from Scratch. AAAI 2022: 10663-10671 - [c55]Xiwen Liang, Fengda Zhu, Lingling Li, Hang Xu, Xiaodan Liang:
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration. ACL (1) 2022: 4837-4851 - [c54]Yujing Xue, Jiageng Mao, Minzhe Niu, Hang Xu, Michael Bi Mi, Wei Zhang, Xiaogang Wang, Xinchao Wang:
Point2Seq: Detecting 3D Objects as Sequences. CVPR 2022: 8511-8520 - [c53]Binbin Yang, Xinchi Deng, Han Shi, Changlin Li, Gengwei Zhang, Hang Xu, Shen Zhao, Liang Lin, Xiaodan Liang:
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism. CVPR 2022: 9245-9254 - [c52]Jianan Wang, Guansong Lu, Hang Xu, Zhenguo Li, Chunjing Xu, Yanwei Fu:
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation. CVPR 2022: 10697-10707 - [c51]Minbin Huang, Zhijian Huang, Changlin Li, Xin Chen, Hang Xu, Zhenguo Li, Xiaodan Liang:
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search. CVPR 2022: 11871-11881 - [c50]Fan Yan, Ming Nie, Xinyue Cai, Jianhua Han, Hang Xu, Zhen Yang, Chaoqiang Ye, Yanwei Fu, Michael Bi Mi, Li Zhang:
ONCE-3DLanes: Building Monocular 3D Lane Detection. CVPR 2022: 17122-17131 - [c49]Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He:
Generative Negative Text Replay for Continual Vision-Language Pretraining. ECCV (36) 2022: 22-38 - [c48]Kaichen Zhou, Lanqing Hong, Changhao Chen, Hang Xu, Chaoqiang Ye, Qingyong Hu, Zhenguo Li:
DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction. ECCV (39) 2022: 125-142 - [c47]Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang:
Learning Ego 3D Representation as Ray Tracing. ECCV (26) 2022: 129-144 - [c46]Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang:
Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding. ECCV (20) 2022: 275-292 - [c45]Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, Dit-Yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu:
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving. ECCV (38) 2022: 406-423 - [c44]Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, Xiangyang Xue:
RCLane: Relay Chain Prediction for Lane Detection. ECCV (38) 2022: 461-477 - [c43]Han Shi, Jiahui Gao, Hang Xu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong, Stephen M. S. Lee, James T. Kwok:
Revisiting Over-smoothing in BERT from the Perspective of Graph. ICLR 2022 - [c42]Lewei Yao, Runhui Huang, Lu Hou, Guansong Lu, Minzhe Niu, Hang Xu, Xiaodan Liang, Zhenguo Li, Xin Jiang, Chunjing Xu:
FILIP: Fine-grained Interactive Language-Image Pre-Training. ICLR 2022 - [c41]Jiaxi Gu, Xiaojun Meng, Guansong Lu, Lu Hou, Niu Minzhe, Xiaodan Liang, Lewei Yao, Runhui Huang, Wei Zhang, Xin Jiang, Chunjing Xu, Hang Xu:
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark. NeurIPS 2022 - [c40]Xiwen Liang, Yangxin Wu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving. NeurIPS 2022 - [c39]Lewei Yao, Jianhua Han, Youpeng Wen, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Chunjing Xu, Hang Xu:
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection. NeurIPS 2022 - [i58]Jiaxi Gu, Xiaojun Meng, Guansong Lu, Lu Hou, Minzhe Niu, Hang Xu, Xiaodan Liang, Wei Zhang, Xin Jiang, Chunjing Xu:
Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework. CoRR abs/2202.06767 (2022) - [i57]Han Shi, Jiahui Gao, Hang Xu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong, Stephen M. S. Lee, James T. Kwok:
Revisiting Over-smoothing in BERT from the Perspective of Graph. CoRR abs/2202.08625 (2022) - [i56]Xiwen Liang, Fengda Zhu, Lingling Li, Hang Xu, Xiaodan Liang:
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration. CoRR abs/2203.04006 (2022) - [i55]Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, Dit-Yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu:
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving. CoRR abs/2203.07724 (2022) - [i54]Jianhua Han, Xiajun Deng, Xinyue Cai, Zhen Yang, Hang Xu, Chunjing Xu, Xiaodan Liang:
Laneformer: Object-aware Row-Column Transformers for Lane Detection. CoRR abs/2203.09830 (2022) - [i53]Yujing Xue, Jiageng Mao, Minzhe Niu, Hang Xu, Michael Bi Mi, Wei Zhang, Xiaogang Wang, Xinchao Wang:
Point2Seq: Detecting 3D Objects as Sequences. CoRR abs/2203.13394 (2022) - [i52]Jianan Wang, Guansong Lu, Hang Xu, Zhenguo Li, Chunjing Xu, Yanwei Fu:
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation. CoRR abs/2204.04428 (2022) - [i51]Minbin Huang, Zhijian Huang, Changlin Li, Xin Chen, Hang Xu, Zhenguo Li, Xiaodan Liang:
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search. CoRR abs/2204.05941 (2022) - [i50]Fan Yan, Ming Nie, Xinyue Cai, Jianhua Han, Hang Xu, Zhen Yang, Chaoqiang Ye, Yanwei Fu, Michael Bi Mi, Li Zhang:
ONCE-3DLanes: Building Monocular 3D Lane Detection. CoRR abs/2205.00301 (2022) - [i49]Binbin Yang, Xinchi Deng, Han Shi, Changlin Li, Gengwei Zhang, Hang Xu, Shen Zhao, Liang Lin, Xiaodan Liang:
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism. CoRR abs/2205.03055 (2022) - [i48]Jiahui Gao, Renjie Pi, Yong Lin, Hang Xu, Jiacheng Ye, Zhiyong Wu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong:
ZeroGen+: Self-Guided High-Quality Data Generation in Efficient Zero-Shot Learning. CoRR abs/2205.12679 (2022) - [i47]Zhili Liu, Jianhua Han, Lanqing Hong, Hang Xu, Kai Chen, Chunjing Xu, Zhenguo Li:
Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing. CoRR abs/2205.13267 (2022) - [i46]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Zhenguo Li, Ping Luo:
CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. CoRR abs/2206.04028 (2022) - [i45]Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang:
Learning Ego 3D Representation as Ray Tracing. CoRR abs/2206.04042 (2022) - [i44]Jiachen Lu, Li Zhang, Junge Zhang, Xiatian Zhu, Hang Xu, Jianfeng Feng:
Softmax-free Linear Transformers. CoRR abs/2207.03341 (2022) - [i43]Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang:
Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding. CoRR abs/2207.08455 (2022) - [i42]Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, Xiangyang Xue:
RCLane: Relay Chain Prediction for Lane Detection. CoRR abs/2207.09399 (2022) - [i41]Kaichen Zhou, Lanqing Hong, Changhao Chen, Hang Xu, Chaoqiang Ye, Qingyong Hu, Zhenguo Li:
DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction. CoRR abs/2209.06351 (2022) - [i40]Xiwen Liang, Yangxin Wu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving. CoRR abs/2209.08953 (2022) - [i39]Lewei Yao, Jianhua Han, Youpeng Wen, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Chunjing Xu, Hang Xu:
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection. CoRR abs/2209.09407 (2022) - [i38]Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He:
Generative Negative Text Replay for Continual Vision-Language Pretraining. CoRR abs/2210.17322 (2022) - [i37]Zutao Jiang, Guansong Lu, Xiaodan Liang, Jihua Zhu, Wei Zhang, Xiaojun Chang, Hang Xu:
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation. CoRR abs/2212.01103 (2022) - [i36]Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chunjing Xu, Xiaodan Liang:
NLIP: Noise-robust Language-Image Pre-training. CoRR abs/2212.07086 (2022) - [i35]Benjin Zhu, Zhe Wang, Shaoshuai Shi, Hang Xu, Lanqing Hong, Hongsheng Li:
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection. CoRR abs/2212.07289 (2022) - 2021
- [c38]Xuefeng Du, Chenhan Jiang, Hang Xu, Gengwei Zhang, Zhenguo Li:
How to Save your Annotation Cost for Panoptic Segmentation? AAAI 2021: 1282-1290 - [c37]Gengwei Zhang, Yiming Gao, Hang Xu, Hao Zhang, Zhenguo Li, Xiaodan Liang:
Ada-Segment: Automated Multi-loss Adaptation for Panoptic Segmentation. AAAI 2021: 3333-3341 - [c36]Fuhui Tang, Chenhan Jiang, Dafeng Wei, Hang Xu, Andi Zhang, Wei Zhang, Hongtao Lu, Chunjing Xu:
Towards Dynamic and Scalable Active Learning with Neural Architecture Adaption for Object Detection. BMVC 2021: 280 - [c35]Xiao Zhou, Weizhong Zhang, Hang Xu, Tong Zhang:
Effective Sparsification of Neural Networks With Global Sparsity Constraint. CVPR 2021: 3599-3608 - [c34]Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li:
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search. CVPR 2021: 5251-5260 - [c33]Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang:
Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation. CVPR 2021: 10175-10184 - [c32]Chenhe Dong, Guangrun Wang, Hang Xu, Jiefeng Peng, Xiaozhe Ren, Xiaodan Liang:
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation. EMNLP (Findings) 2021: 1424-1437 - [c31]Jiageng Mao, Minzhe Niu, Haoyue Bai, Xiaodan Liang, Hang Xu, Chunjing Xu:
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection. ICCV 2021: 2703-2712 - [c30]Jiageng Mao, Yujing Xue, Minzhe Niu, Haoyue Bai, Jiashi Feng, Xiaodan Liang, Hang Xu, Chunjing Xu:
Voxel Transformer for 3D Object Detection. ICCV 2021: 3144-3153 - [c29]Hanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei Zhang, Zhenguo Li, Luc Van Gool:
Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object Detection. ICCV 2021: 3273-3282 - [c28]Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang:
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation. ICCV 2021: 3571-3580 - [c27]Hang Xu, Ning Kang, Gengwei Zhang, Chuanlong Xie, Xiaodan Liang, Zhenguo Li:
NASOA: Towards Faster Task-oriented Online Fine-tuning with a Zoo of Models. ICCV 2021: 5077-5086 - [c26]Yanning Zhou, Hang Xu, Wei Zhang, Bin Gao, Pheng-Ann Heng:
C3-SemiSeg: Contrastive Semi-supervised Segmentation via Cross-set Learning and Dynamic Class-balancing. ICCV 2021: 7016-7025 - [c25]Kai Chen, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung:
MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving. ICCV 2021: 7526-7534 - [c24]Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Peize Sun, Zhenguo Li, Ping Luo:
DetCo: Unsupervised Contrastive Learning for Object Detection. ICCV 2021: 8372-8381 - [c23]Muhammad Awais, Fengwei Zhou, Hang Xu, Lanqing Hong, Ping Luo, Sung-Ho Bae, Zhenguo Li:
Adversarial Robustness for Unsupervised Domain Adaptation. ICCV 2021: 8548-8557 - [c22]Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang:
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining. ICCV 2021: 11762-11771 - [c21]Peidong Liu, Gengwei Zhang, Bochao Wang, Hang Xu, Xiaodan Liang, Yong Jiang, Zhenguo Li:
Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search. ICLR 2021 - [c20]Han Shi, Jiahui Gao, Xiaozhe Ren, Hang Xu, Xiaodan Liang, Zhenguo Li, James Tin-Yau Kwok:
SparseBERT: Rethinking the Importance Analysis in Self-attention. ICML 2021: 9547-9557 - [c19]Enze Xie, Wenjia Wang, Wenhai Wang, Peize Sun, Hang Xu, Ding Liang, Ping Luo:
Segmenting Transparent Objects in the Wild with Transformer. IJCAI 2021: 1194-1200 - [c18]Jianhua Han, Xiwen Liang, Hang Xu, Kai Chen, Lanqing Hong, Jiageng Mao, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Xiaodan Liang, Chunjing Xu:
SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving. NeurIPS Datasets and Benchmarks 2021 - [c17]Jiachen Lu, Jinghan Yao, Junge Zhang, Xiatian Zhu, Hang Xu, Weiguo Gao, Chunjing Xu, Tao Xiang, Li Zhang:
SOFT: Softmax-free Transformer with Linear Complexity. NeurIPS 2021: 21297-21309 - [c16]Jiageng Mao, Minzhe Niu, Chenhan Jiang, Hanxue Liang, Jingheng Chen, Xiaodan Liang, Yamin Li, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Jie Yu, Chunjing Xu, Hang Xu:
One Million Scenes for Autonomous Driving: ONCE Dataset. NeurIPS Datasets and Benchmarks 2021 - [c15]Yihan Zeng, Chunwei Wang, Yunbo Wang, Hang Xu, Chaoqiang Ye, Zhen Yang, Chao Ma:
Learning Transferable Features for Point Cloud Detection via 3D Contrastive Co-training. NeurIPS 2021: 21493-21504 - [i34]Enze Xie, Wenjia Wang, Wenhai Wang, Peize Sun, Hang Xu, Ding Liang, Ping Luo:
Trans2Seg: Transparent Object Segmentation with Transformer. CoRR abs/2101.08461 (2021) - [i33]Peidong Liu, Gengwei Zhang, Bochao Wang, Hang Xu, Xiaodan Liang, Yong Jiang, Zhenguo Li:
Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search. CoRR abs/2102.04700 (2021) - [i32]Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Zhenguo Li, Ping Luo:
DetCo: Unsupervised Contrastive Learning for Object Detection. CoRR abs/2102.04803 (2021) - [i31]Han Shi, Jiahui Gao, Xiaozhe Ren, Hang Xu, Xiaodan Liang, Zhenguo Li, James T. Kwok:
SparseBERT: Rethinking the Importance Analysis in Self-attention. CoRR abs/2102.12871 (2021) - [i30]Jian Ding, Enze Xie, Hang Xu, Chenhan Jiang, Zhenguo Li, Ping Luo, Gui-Song Xia:
Unsupervised Pretraining for Object Detection by Patch Reidentification. CoRR abs/2103.04814 (2021) - [i29]Xiao Zhou, Weizhong Zhang, Hang Xu, Tong Zhang:
Effective Sparsification of Neural Networks with Global Sparsity Constraint. CoRR abs/2105.01571 (2021) - [i28]Wenqi Shao, Hang Yu, Zhaoyang Zhang, Hang Xu, Zhenguo Li, Ping Luo:
BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening. CoRR abs/2105.06423 (2021) - [i27]Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li:
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search. CoRR abs/2105.11871 (2021) - [i26]Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang:
Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation. CoRR abs/2105.12971 (2021) - [i25]Jiageng Mao, Minzhe Niu, Chenhan Jiang, Hanxue Liang, Xiaodan Liang, Yamin Li, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Jie Yu, Hang Xu, Chunjing Xu:
One Million Scenes for Autonomous Driving: ONCE Dataset. CoRR abs/2106.11037 (2021) - [i24]Jianhua Han, Xiwen Liang, Hang Xu, Kai Chen, Lanqing Hong, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Xiaodan Liang, Chunjing Xu:
SODA10M: Towards Large-Scale Object Detection Benchmark for Autonomous Driving. CoRR abs/2106.11118 (2021) - [i23]Jiahui Gao, Hang Xu, Han Shi, Xiaozhe Ren, Philip L. H. Yu, Xiaodan Liang, Xin Jiang, Zhenguo Li:
AutoBERT-Zero: Evolving BERT Backbone from Scratch. CoRR abs/2107.07445 (2021) - [i22]Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang:
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining. CoRR abs/2107.14572 (2021) - [i21]Hang Xu, Ning Kang, Gengwei Zhang, Chuanlong Xie, Xiaodan Liang, Zhenguo Li:
NASOA: Towards Faster Task-oriented Online Fine-tuning with a Zoo of Models. CoRR abs/2108.03434 (2021) - [i20]Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang:
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation. CoRR abs/2108.07482 (2021) - [i19]Kai Chen, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung:
MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving. CoRR abs/2108.12178 (2021) - [i18]Muhammad Awais, Fengwei Zhou, Hang Xu, Lanqing Hong, Ping Luo, Sung-Ho Bae, Zhenguo Li:
Adversarial Robustness for Unsupervised Domain Adaptation. CoRR abs/2109.00946 (2021) - [i17]Jiageng Mao, Yujing Xue, Minzhe Niu, Haoyue Bai, Jiashi Feng, Xiaodan Liang, Hang Xu, Chunjing Xu:
Voxel Transformer for 3D Object Detection. CoRR abs/2109.02497 (2021) - [i16]Jiageng Mao, Minzhe Niu, Haoyue Bai, Xiaodan Liang, Hang Xu, Chunjing Xu:
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection. CoRR abs/2109.02499 (2021) - [i15]Chenhe Dong, Guangrun Wang, Hang Xu, Jiefeng Peng, Xiaozhe Ren, Xiaodan Liang:
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation. CoRR abs/2109.07222 (2021) - [i14]Jiachen Lu, Jinghan Yao, Junge Zhang, Xiatian Zhu, Hang Xu, Weiguo Gao, Chunjing Xu, Tao Xiang, Li Zhang:
SOFT: Softmax-free Transformer with Linear Complexity. CoRR abs/2110.11945 (2021) - [i13]Lewei Yao, Runhui Huang, Lu Hou, Guansong Lu, Minzhe Niu, Hang Xu, Xiaodan Liang, Zhenguo Li, Xin Jiang, Chunjing Xu:
FILIP: Fine-grained Interactive Language-Image Pre-Training. CoRR abs/2111.07783 (2021) - 2020
- [c14]Linpu Fang, Hang Xu, Zhili Liu, Sarah Parisot, Zhenguo Li:
EHSOD: CAM-Guided End-to-End Hybrid-Supervised Object Detection with Cascade Refinement. AAAI 2020: 10778-10785 - [c13]Chenhan Jiang, Shaoju Wang, Xiaodan Liang, Hang Xu, Nong Xiao:
ElixirNet: Relation-Aware Network Architecture Adaptation for Medical Lesion Detection. AAAI 2020: 11093-11100 - [c12]Hang Xu, Linpu Fang, Xiaodan Liang, Wenxiong Kang, Zhenguo Li:
Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN. AAAI 2020: 12492-12499 - [c11]Lewei Yao, Hang Xu, Wei Zhang, Xiaodan Liang, Zhenguo Li:
SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection. AAAI 2020: 12661-12668 - [c10]Chenhan Jiang, Hang Xu, Wei Zhang, Xiaodan Liang, Zhenguo Li:
SP-NAS: Serial-to-Parallel Backbone Search for Object Detection. CVPR 2020: 11860-11869 - [c9]Xin Chen, Yawen Duan, Zewei Chen, Hang Xu, Zihao Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li:
CATCH: Context-Based Meta Reinforcement Learning for Transferrable Architecture Search. ECCV (19) 2020: 185-202 - [c8]Wenshuo Ma, Tingzhong Tian, Hang Xu, Yimin Huang, Zhenguo Li:
AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling. ECCV (5) 2020: 560-575 - [c7]Hang Xu, Shaoju Wang, Xinyue Cai, Wei Zhang, Xiaodan Liang, Zhenguo Li:
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending. ECCV (15) 2020: 689-704 - [c6]Han Shi, Renjie Pi, Hang Xu, Zhenguo Li, James T. Kwok, Tong Zhang:
Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS. NeurIPS 2020 - [c5]Yangxin Wu, Gengwei Zhang, Hang Xu, Xiaodan Liang, Liang Lin:
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation. NeurIPS 2020 - [i12]Hang Xu, Linpu Fang, Xiaodan Liang, Wenxiong Kang, Zhenguo Li:
Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN. CoRR abs/2002.07417 (2020) - [i11]Linpu Fang, Hang Xu, Zhili Liu, Sarah Parisot, Zhenguo Li:
EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with Cascade Refinement. CoRR abs/2002.07421 (2020) - [i10]Chenhan Jiang, Shaoju Wang, Hang Xu, Xiaodan Liang, Nong Xiao:
ElixirNet: Relation-aware Network Architecture Adaptation for Medical Lesion Detection. CoRR abs/2003.08770 (2020) - [i9]Wenshuo Ma, Tingzhong Tian, Hang Xu, Yimin Huang, Zhenguo Li:
AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling. CoRR abs/2007.09336 (2020) - [i8]Xin Chen, Yawen Duan, Zewei Chen, Hang Xu, Zihao Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li:
CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search. CoRR abs/2007.09380 (2020) - [i7]Hang Xu, Shaoju Wang, Xinyue Cai, Wei Zhang, Xiaodan Liang, Zhenguo Li:
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending. CoRR abs/2007.12147 (2020) - [i6]Yangxin Wu, Gengwei Zhang, Hang Xu, Xiaodan Liang, Liang Lin:
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation. CoRR abs/2010.16119 (2020) - [i5]Bochao Wang, Hang Xu, Jiajin Zhang, Chen Chen, Yixing Xu, Xiaozhi Fang, Ning Kang, Lanqing Hong, Chenhan Jiang, Xinyue Cai, Jiawei Li, Fengwei Zhou, Yong Li, Zhicheng Liu, Xinghao Chen, Kai Han, Han Shu, Dehua Song, Yunhe Wang, Wei Zhang, Chunjing Xu, Zhenguo Li, Wenzhi Liu, Tong Zhang:
VEGA: Towards an End-to-End Configurable AutoML Pipeline. CoRR abs/2011.01507 (2020) - [i4]Gengwei Zhang, Yiming Gao, Hang Xu, Hao Zhang, Zhenguo Li, Xiaodan Liang:
Ada-Segment: Automated Multi-loss Adaptation for Panoptic Segmentation. CoRR abs/2012.03603 (2020)
2010 – 2019
- 2019
- [j2]Philip L. H. Yu, Hang Xu:
Rank aggregation using latent-scale distance-based models. Stat. Comput. 29(2): 335-349 (2019) - [c4]Hang Xu, Chenhan Jiang, Xiaodan Liang, Liang Lin, Zhenguo Li:
Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection. CVPR 2019: 6419-6428 - [c3]Hang Xu, Chenhan Jiang, Xiaodan Liang, Zhenguo Li:
Spatial-Aware Graph Relation Network for Large-Scale Object Detection. CVPR 2019: 9298-9307 - [c2]Hang Xu, Lewei Yao, Zhenguo Li, Xiaodan Liang, Wei Zhang:
Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification. ICCV 2019: 6648-6657 - [i3]Han Shi, Renjie Pi, Hang Xu, Zhenguo Li, James T. Kwok, Tong Zhang:
Multi-objective Neural Architecture Search via Predictive Network Performance Optimization. CoRR abs/1911.09336 (2019) - [i2]Lewei Yao, Hang Xu, Wei Zhang, Xiaodan Liang, Zhenguo Li:
SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection. CoRR abs/1911.09929 (2019) - 2018
- [j1]Hang Xu, Mayer Alvo, Philip L. H. Yu:
Angle-based models for ranking data. Comput. Stat. Data Anal. 121: 113-136 (2018) - [c1]Chenhan Jiang, Hang Xu, Xiaodan Liang, Liang Lin:
Hybrid Knowledge Routed Modules for Large-scale Object Detection. NeurIPS 2018: 1559-1570 - [i1]Chenhan Jiang, Hang Xu, Xiaodan Liang, Liang Lin:
Hybrid Knowledge Routed Modules for Large-scale Object Detection. CoRR abs/1810.12681 (2018)
Coauthor Index
aka: James Tin-Yau Kwok
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-15 02:21 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint