default search action
Yu Qiao 0001
Person information
- affiliation: Shanghai AI Laboratory, OpenGVLab, China
- affiliation: Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, China
- affiliation (former): University of Tokyo, Graduate School of Information Science and Technology, Japan
- affiliation (PhD 2006): University of Electro-Communications, Tokyo, Japan
Other persons with the same name
- Yu Qiao — disambiguation page
- Yu Qiao 0002 — Biomedical Imaging Lab, Singapore
- Yu Qiao 0003 — Shanghai Jiao Tong University, Department of Automation, Institute of Image Processing and Pattern Recognition, China (and 1 more)
- Yu Qiao 0004 — Kyung Hee University, School of Computing, Department of Artificial Intelligence, Yongin, South Korea (and 1 more)
- Yu Qiao 0005 — RWTH Aachen University, Germany
- Yu Qiao 0006 — Nanjing University, National Key Laboratory for Novel Software Technology, Department of Computer Science and Technology, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j107]Yihao Liu, Hengyuan Zhao, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Yu Qiao, Chao Dong:
Temporally consistent video colorization with deep feature propagation and self-regularization learning. Comput. Vis. Media 10(2): 375-395 (2024) - [j106]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. Int. J. Comput. Vis. 132(2): 581-595 (2024) - [j105]Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang:
MixStyle Neural Networks for Domain Generalization and Adaptation. Int. J. Comput. Vis. 132(3): 822-836 (2024) - [j104]Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. Int. J. Comput. Vis. 132(5): 1546-1556 (2024) - [j103]Haibin He, Xinyuan Chen, Chaoyue Wang, Juhua Liu, Bo Du, Dacheng Tao, Yu Qiao:
Diff-Font: Diffusion Model for Robust One-Shot Font Generation. Int. J. Comput. Vis. 132(11): 5372-5386 (2024) - [j102]Hao Zhang, Lumin Xu, Shenqi Lai, Wenqi Shao, Nanning Zheng, Ping Luo, Yu Qiao, Kaipeng Zhang:
Open-Vocabulary Animal Keypoint Detection with Semantic-Feature Matching. Int. J. Comput. Vis. 132(12): 5741-5758 (2024) - [j101]Yuhui Wang, Yahan Xie, Yu Qiao, Zhaohui Xia, Yanying Chen:
Chinese CSUQ: Cross-Cultural Adaptation and Evaluation of Measurement Properties. Int. J. Hum. Comput. Interact. 40(22): 7623-7641 (2024) - [j100]Yi Liu, Yu Qiao, Yali Wang:
F2S-Net: learning frame-to-segment prediction for online action detection. J. Real Time Image Process. 21(3): 73 (2024) - [j99]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jianping Shi, Dahua Lin, Yu Qiao:
Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2151-2170 (2024) - [j98]Yuexin Ma, Tai Wang, Xuyang Bai, Huitong Yang, Yuenan Hou, Yaming Wang, Yu Qiao, Ruigang Yang, Xinge Zhu:
Vision-Centric BEV Perception: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10978-10997 (2024) - [j97]Yingqi Liu, Jingwen He, Yihao Liu, Xinqi Lin, Fanghua Yu, Jinfan Hu, Yu Qiao, Chao Dong:
AdaptBIR: Adaptive Blind Image Restoration with latent diffusion prior for higher fidelity. Pattern Recognit. 155: 110659 (2024) - [j96]Mingfei Han, Yali Wang, Mingjie Li, Xiaojun Chang, Yi Yang, Yu Qiao:
Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection. IEEE Trans. Image Process. 33: 1560-1573 (2024) - [j95]Siran Chen, Qinglin Xu, Yue Ma, Yu Qiao, Yali Wang:
Attentive Snippet Prompting for Video Retrieval. IEEE Trans. Multim. 26: 4348-4359 (2024) - [j94]Yuer Ma, Yi Liu, Limin Wang, Wenxiong Kang, Yu Qiao, Yali Wang:
Dual Masked Modeling for Weakly-Supervised Temporal Boundary Discovery. IEEE Trans. Multim. 26: 5694-5704 (2024) - [j93]Mingye Xu, Zhipeng Zhou, Hongbin Xu, Yu Qiao, Yali Wang:
CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning. IEEE Trans. Multim. 26: 8799-8810 (2024) - [c357]Siran Chen, Yue Ma, Yu Qiao, Yali Wang:
M-BEV: Masked BEV Perception for Robust Autonomous Driving. AAAI 2024: 1183-1191 - [c356]Ziteng Cui, Lin Gu, Xiao Sun, Xianzheng Ma, Yu Qiao, Tatsuya Harada:
Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption. AAAI 2024: 1435-1444 - [c355]Bo Peng, Xinyuan Chen, Yaohui Wang, Chaochao Lu, Yu Qiao:
ConditionVideo: Training-Free Condition-Guided Video Generation. AAAI 2024: 4459-4467 - [c354]Wenshuo Peng, Kaipeng Zhang, Yue Yang, Hao Zhang, Yu Qiao:
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification. AAAI 2024: 4506-4514 - [c353]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Hao Dong, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. AAAI 2024: 6449-6457 - [c352]Lingjun Zhang, Xinyuan Chen, Yaohui Wang, Yue Lu, Yu Qiao:
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model. AAAI 2024: 7215-7223 - [c351]Yuanfu Wang, Chao Yang, Ying Wen, Yu Liu, Yu Qiao:
Critic-Guided Decision Transformer for Offline Reinforcement Learning. AAAI 2024: 15706-15714 - [c350]Yan Ma, Yu Qiao, Pengfei Liu:
MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation. ACL (1) 2024: 2135-2169 - [c349]Lijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao:
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models. ACL (Findings) 2024: 3923-3954 - [c348]Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong Liu, Jing Shao:
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models. ACL (Findings) 2024: 4864-4888 - [c347]Guoxin Chen, Kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian:
SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning. ACL (1) 2024: 5901-5921 - [c346]Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. ACL (Findings) 2024: 7775-7803 - [c345]Zhanhui Zhou, Jie Liu, Jing Shao, Xiangyu Yue, Chao Yang, Wanli Ouyang, Yu Qiao:
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization. ACL (Findings) 2024: 10586-10613 - [c344]Zaibin Zhang, Yongting Zhang, Lijun Li, Jing Shao, Hongzhi Gao, Yu Qiao, Lijun Wang, Huchuan Lu, Feng Zhao:
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety. ACL (1) 2024: 15202-15231 - [c343]Zhanhui Zhou, Jie Liu, Zhichen Dong, Jiaheng Liu, Chao Yang, Wanli Ouyang, Yu Qiao:
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! ACL (1) 2024: 15810-15830 - [c342]Yuan Xu, Xiaoxuan Ma, Jiajun Su, Wentao Zhu, Yu Qiao, Yizhou Wang:
ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring. CVPR 2024: 979-989 - [c341]Xiaoliang Ju, Zhaoyang Huang, Yijiin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li:
DiffInDScene: Diffusion-Based High-Quality 3D Indoor Scene Generation. CVPR 2024: 4526-4535 - [c340]Xiaoyang Wu, Li Jiang, Peng-Shuai Wang, Zhijian Liu, Xihui Liu, Yu Qiao, Wanli Ouyang, Tong He, Hengshuang Zhao:
Point Transformer V3: Simpler, Faster, Stronger. CVPR 2024: 4840-4851 - [c339]Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications. CVPR 2024: 5652-5661 - [c338]Ziyan Chen, Jingwen He, Xinqi Lin, Yu Qiao, Chao Dong:
Towards Real-world Video Face Restoration: A New Benchmark. CVPR Workshops 2024: 5929-5939 - [c337]Lirui Zhao, Yue Yang, Kaipeng Zhang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Rongrong Ji:
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model. CVPR 2024: 6390-6399 - [c336]Yuming Jiang, Tianxing Wu, Shuai Yang, Chenyang Si, Dahua Lin, Yu Qiao, Chen Change Loy, Ziwei Liu:
VideoBooth: Diffusion-based Video Generation with Image Prompts. CVPR 2024: 6689-6700 - [c335]Shaobin Zhuang, Kunchang Li, Xinyuan Chen, Yaohui Wang, Ziwei Liu, Yu Qiao, Yali Wang:
Vlogger: Make Your Dream A Vlog. CVPR 2024: 8806-8817 - [c334]Zehuan Huang, Hao Wen, Junting Dong, Yaohui Wang, Yangguang Li, Xinyuan Chen, Yan-Pei Cao, Ding Liang, Yu Qiao, Bo Dai, Lu Sheng:
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion. CVPR 2024: 9784-9794 - [c333]Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao:
LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction. CVPR 2024: 14089-14099 - [c332]Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li:
Generalized Predictive Model for Autonomous Driving. CVPR 2024: 14662-14672 - [c331]Yiran Qin, Enshen Zhou, Qichang Liu, Zhenfei Yin, Lu Sheng, Ruimao Zhang, Yu Qiao, Jing Shao:
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception. CVPR 2024: 16307-16316 - [c330]Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai:
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft. CVPR 2024: 16426-16435 - [c329]Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan:
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-End Oriented Object Detection with Single Point Supervision. CVPR 2024: 16783-16793 - [c328]Zhiyu Zhao, Bingkun Huang, Sen Xing, Gangshan Wu, Yu Qiao, Limin Wang:
Asymmetric Masked Distillation for Pre-Training Small Foundation Models. CVPR 2024: 18516-18526 - [c327]Hao Wu, Huabin Liu, Yu Qiao, Xiao Sun:
DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement. CVPR 2024: 18699-18708 - [c326]Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
VBench: Comprehensive Benchmark Suite for Video Generative Models. CVPR 2024: 21807-21818 - [c325]Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, Limin Wang, Yu Qiao:
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World. CVPR 2024: 22072-22086 - [c324]Yutao Hu, Tianbin Li, Quanfeng Lu, Wenqi Shao, Junjun He, Yu Qiao, Ping Luo:
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM. CVPR 2024: 22170-22183 - [c323]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Lou, Limin Wang, Yu Qiao:
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark. CVPR 2024: 22195-22206 - [c322]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CVPR 2024: 24185-24198 - [c321]Fanghua Yu, Jinjin Gu, Zheyuan Li, Jinfan Hu, Xiangtao Kong, Xintao Wang, Jingwen He, Yu Qiao, Chao Dong:
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. CVPR 2024: 25669-25680 - [c320]Yufei Wang, Wenhan Yang, Xinyuan Chen, Yaohui Wang, Lanqing Guo, Lap-Pui Chau, Ziwei Liu, Yu Qiao, Alex C. Kot, Bihan Wen:
SinSR: Diffusion-Based Image Super-Resolution in a Single Step. CVPR 2024: 25796-25805 - [c319]Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue:
OneLLM: One Framework to Align All Modalities with Language. CVPR 2024: 26574-26585 - [c318]Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao:
Language-aware Visual Semantic Distillation for Video Question Answering. CVPR 2024: 27103-27113 - [c317]Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. ECCV (62) 2024: 36-55 - [c316]Yuchen Yang, Yu Qiao, Xiao Sun:
Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation. ECCV (44) 2024: 38-55 - [c315]Shuo Cao, Yihao Liu, Wenlong Zhang, Yu Qiao, Chao Dong:
GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity. ECCV (70) 2024: 70-87 - [c314]Xiangyu Chen, Zheyuan Li, Yuandong Pu, Yihao Liu, Jiantao Zhou, Yu Qiao, Chao Dong:
A Comparative Study of Image Restoration Networks for General Backbone Network Design. ECCV (71) 2024: 74-91 - [c313]Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Ziheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang:
ControlLLM: Augment Language Models with Tools by Searching on Graphs. ECCV (12) 2024: 89-105 - [c312]Yunsong Zhou, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li:
Embodied Understanding of Driving Scenarios. ECCV (62) 2024: 129-148 - [c311]Gang Li, Wenhai Wang, Xiang Li, Ziheng Li, Jian Yang, Jifeng Dai, Yu Qiao, Shanshan Zhang:
Distilling Knowledge from Large-Scale Image Models for Object Detection. ECCV (84) 2024: 142-160 - [c310]Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Yu Qiao, Peng Gao, Hongsheng Li:
MATHVERSE: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? ECCV (8) 2024: 169-186 - [c309]Jiakang Yuan, Bo Zhang, Kaixiong Gong, Xiangyu Yue, Botian Shi, Yu Qiao, Tao Chen:
Reg-TTA3D: Better Regression Makes Better Test-Time Adaptive 3D Object Detection. ECCV (43) 2024: 197-213 - [c308]Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao:
VideoMamba: State Space Model for Efficient Video Understanding. ECCV (26) 2024: 237-255 - [c307]Zhihang Zhong, Gurunandan Krishnan, Xiao Sun, Yu Qiao, Sizhuo Ma, Jian Wang:
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation. ECCV (33) 2024: 346-363 - [c306]Xin Liu, Yichen Zhu, Jindong Gu, Yunshi Lan, Chao Yang, Yu Qiao:
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models. ECCV (56) 2024: 386-403 - [c305]Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Jilan Xu, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang:
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding. ECCV (85) 2024: 396-416 - [c304]Xinqi Lin, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Bo Dai, Fanghua Yu, Yu Qiao, Wanli Ouyang, Chao Dong:
DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior. ECCV (59) 2024: 430-448 - [c303]Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai:
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World. ECCV (33) 2024: 471-490 - [c302]Yutong Chen, Yifan Zhan, Zhihang Zhong, Wei Wang, Xiao Sun, Yu Qiao, Yinqiang Zheng:
Within the Dynamic Context: Inertia-Aware 3D Human Modeling with Pose Sequence. ECCV (49) 2024: 491-508 - [c301]Zhaoxun Ju, Chao Yang, Fuchun Sun, Hongbo Wang, Yu Qiao:
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning. ICAPS 2024: 301-309 - [c300]Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo:
Align, Adapt and Inject: Audio-Guided Image Generation, Editing and Stylization. ICASSP 2024: 3475-3479 - [c299]Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai:
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning. ICLR 2024 - [c298]Bo Zhang, Xinyu Cai, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan, Yu Qiao:
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation. ICLR 2024 - [c297]Xinyuan Chen, Yaohui Wang, Lingjun Zhang, Shaobin Zhuang, Xin Ma, Jiashuo Yu, Yali Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction. ICLR 2024 - [c296]Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, Ping Luo:
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models. ICLR 2024 - [c295]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. ICLR 2024 - [c294]Weigao Sun, Zhen Qin, Weixuan Sun, Shidi Li, Dong Li, Xuyang Shen, Yu Qiao, Yiran Zhong:
CO2: Efficient Distributed Training with Full Communication-Computation Overlap. ICLR 2024 - [c293]Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao:
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World. ICLR 2024 - [c292]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinhao Li, Guo Chen, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. ICLR 2024 - [c291]Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao:
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models. ICLR 2024 - [c290]Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. ICLR 2024 - [c289]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. ICLR 2024 - [c288]Renrui Zhang, Jiaming Han, Chris Liu, Aojun Zhou, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention. ICLR 2024 - [c287]Wenlong Zhang, Xiaohui Li, Xiangyu Chen, Xiaoyun Zhang, Yu Qiao, Xiao-Ming Wu, Chao Dong:
SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution. ICLR 2024 - [c286]Mingzhou Liu, Xinwei Sun, Yu Qiao, Yizhou Wang:
Causal Discovery via Conditional Independence Testing with Proxy Variables. ICML 2024 - [c285]Yihao Liu, Xiangyu Chen, Xianzheng Ma, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong:
Unifying Image Processing as Visual Prompting Question Answering. ICML 2024 - [c284]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. ICML 2024 - [c283]Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. ICML 2024 - [c282]Yue Yang, Yuqi Lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao, Kaipeng Zhang, Ping Luo:
Position: Towards Implicit Prompt For Text-To-Image Models. ICML 2024 - [c281]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. ICML 2024 - [c280]Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiao:
Safety of Multimodal Large Language Models on Images and Text. IJCAI 2024: 8151-8159 - [c279]Daocheng Fu, Wenjie Lei, Licheng Wen, Pinlong Cai, Song Mao, Min Dou, Botian Shi, Yu Qiao:
LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving. IV 2024: 1084-1090 - [c278]Xiangyu Chen, Yihao Liu, Yuandong Pu, Wenlong Zhang, Jiantao Zhou, Yu Qiao, Chao Dong:
Learning A Low-Level Vision Generalist via Visual Task Prompt. ACM Multimedia 2024: 2671-2680 - [c277]Daocheng Fu, Xin Li, Licheng Wen, Min Dou, Pinlong Cai, Botian Shi, Yu Qiao:
Drive Like a Human: Rethinking Autonomous Driving with Large Language Models. WACV (Workshops) 2024: 910-919 - [c276]Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu:
Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation. WACV 2024: 5362-5371 - [i408]Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. CoRR abs/2401.02384 (2024) - [i407]Xin Ma, Yaohui Wang, Gengyun Jia, Xinyuan Chen, Ziwei Liu, Yuan-Fang Li, Cunjian Chen, Yu Qiao:
Latte: Latent Diffusion Transformer for Video Generation. CoRR abs/2401.03048 (2024) - [i406]Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications. CoRR abs/2401.06197 (2024) - [i405]Shaobin Zhuang, Kunchang Li, Xinyuan Chen, Yaohui Wang, Ziwei Liu, Yu Qiao, Yali Wang:
Vlogger: Make Your Dream A Vlog. CoRR abs/2401.09414 (2024) - [i404]Changyao Tian, Xizhou Zhu, Yuwen Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Yuntao Chen, Lewei Lu, Tong Lu, Jie Zhou, Hongsheng Li, Yu Qiao, Jifeng Dai:
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer. CoRR abs/2401.10208 (2024) - [i403]Zaibin Zhang, Yongting Zhang, Lijun Li, Hongzhi Gao, Lijun Wang, Huchuan Lu, Feng Zhao, Yu Qiao, Jing Shao:
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety. CoRR abs/2401.11880 (2024) - [i402]Guoxin Chen, Kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian:
SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning. CoRR abs/2401.13246 (2024) - [i401]Fanghua Yu, Jinjin Gu, Zheyuan Li, Jinfan Hu, Xiangtao Kong, Xintao Wang, Jingwen He, Yu Qiao, Chao Dong:
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. CoRR abs/2401.13627 (2024) - [i400]Chaochao Lu, Chen Qian, Guodong Zheng, Hongxing Fan, Hongzhi Gao, Jie Zhang, Jing Shao, Jingyi Deng, Jinlan Fu, Kexin Huang, Kunchang Li, Lijun Li, Limin Wang, Lu Sheng, Meiqi Chen, Ming Zhang, Qibing Ren, Sirui Chen, Tao Gui, Wanli Ouyang, Yali Wang, Yan Teng, Yaru Wang, Yi Wang, Yinan He, Yingchun Wang, Yixu Wang, Yongting Zhang, Yu Qiao, Yujiong Shen, Yurong Mou, Yuxi Chen, Zaibin Zhang, Zhelun Shi, Zhenfei Yin, Zhipin Wang:
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities. CoRR abs/2401.15071 (2024) - [i399]Weigao Sun, Zhen Qin, Weixuan Sun, Shidi Li, Dong Li, Xuyang Shen, Yu Qiao, Yiran Zhong:
CO2: Efficient Distributed Training with Full Communication-Computation Overlap. CoRR abs/2401.16265 (2024) - [i398]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024) - [i397]Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiao:
Safety of Multimodal Large Language Models on Images and Text. CoRR abs/2402.00357 (2024) - [i396]Daocheng Fu, Wenjie Lei, Licheng Wen, Pinlong Cai, Song Mao, Min Dou, Botian Shi, Yu Qiao:
LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving. CoRR abs/2402.01246 (2024) - [i395]Lijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao:
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models. CoRR abs/2402.05044 (2024) - [i394]Shikun Ban, Juling Fan, Wentao Zhu, Xiaoxuan Ma, Yu Qiao, Yizhou Wang:
Real-time Holistic Robot Pose Estimation with Unknown States. CoRR abs/2402.05655 (2024) - [i393]Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. CoRR abs/2402.05935 (2024) - [i392]Yutao Hu, Tianbin Li, Quanfeng Lu, Wenqi Shao, Junjun He, Yu Qiao, Ping Luo:
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM. CoRR abs/2402.09181 (2024) - [i391]Zhichen Dong, Zhanhui Zhou, Chao Yang, Jing Shao, Yu Qiao:
Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey. CoRR abs/2402.09283 (2024) - [i390]Renqiu Xia, Bo Zhang, Hancheng Ye, Xiangchao Yan, Qi Liu, Hongbin Zhou, Zijun Chen, Min Dou, Botian Shi, Junchi Yan, Yu Qiao:
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning. CoRR abs/2402.12185 (2024) - [i389]Zhanhui Zhou, Jie Liu, Zhichen Dong, Jiaheng Liu, Chao Yang, Wanli Ouyang, Yu Qiao:
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! CoRR abs/2402.12343 (2024) - [i388]Junting Chen, Yao Mu, Qiaojun Yu, Tianming Wei, Silang Wu, Zhecheng Yuan, Zhixuan Liang, Chao Yang, Kaipeng Zhang, Wenqi Shao, Yu Qiao, Huazhe Xu, Mingyu Ding, Ping Luo:
RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation. CoRR abs/2402.14623 (2024) - [i387]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. CoRR abs/2402.16117 (2024) - [i386]Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. CoRR abs/2402.16880 (2024) - [i385]Zhaoxun Ju, Chao Yang, Hongbo Wang, Yu Qiao, Fuchun Sun:
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning. CoRR abs/2402.17511 (2024) - [i384]Boyu Chen, Siran Chen, Kunchang Li, Qinglin Xu, Yu Qiao, Yali Wang:
Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition. CoRR abs/2402.18951 (2024) - [i383]Jiantao Qiu, Haijun Lv, Zhenjiang Jin, Rui Wang, Wenchang Ning, Jia Yu, ChaoBin Zhang, Zhenxiang Li, Pei Chu, Yuan Qu, Jin Shi, Lindong Lu, Runyu Peng, Zhiyuan Zeng, Huanze Tang, Zhikai Lei, Jiawei Hong, Keyu Chen, Zhaoye Fei, Ruiliang Xu, Wei Li, Zhongying Tu, Dahua Lin, Yu Qiao, Hang Yan, Conghui He:
WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset. CoRR abs/2402.19282 (2024) - [i382]Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong Liu, Jing Shao:
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models. CoRR abs/2402.19465 (2024) - [i381]Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai:
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World. CoRR abs/2402.19474 (2024) - [i380]Zishi Li, Xiaoxuan Ma, Qiuyan Shang, Wentao Zhu, Hai Ci, Yu Qiao, Yizhou Wang:
Efficient Action Counting with Dynamic Queries. CoRR abs/2403.01543 (2024) - [i379]Yue Yang, Yuqi lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao, Kaipeng Zhang, Ping Luo:
Towards Implicit Prompt For Text-To-Image Models. CoRR abs/2403.02118 (2024) - [i378]Yuchen Duan, Weiyun Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Hongsheng Li, Jifeng Dai, Wenhai Wang:
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures. CoRR abs/2403.02308 (2024) - [i377]Yunsong Zhou, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li:
Embodied Understanding of Driving Scenarios. CoRR abs/2403.04593 (2024) - [i376]Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao:
VideoMamba: State Space Model for Efficient Video Understanding. CoRR abs/2403.06977 (2024) - [i375]Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Yu Qiao, Wai Lam, Lizhuang Ma:
Exploring Safety Generalization Challenges of Large Language Models via Code. CoRR abs/2403.07865 (2024) - [i374]Hao Zhang, Wenqi Shao, Hong Liu, Yongqiang Ma, Ping Luo, Yu Qiao, Kaipeng Zhang:
AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions. CoRR abs/2403.09346 (2024) - [i373]Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li:
Generalized Predictive Model for Autonomous Driving. CoRR abs/2403.09630 (2024) - [i372]Enshen Zhou, Yiran Qin, Zhenfei Yin, Yuzhou Huang, Ruimao Zhang, Lu Sheng, Yu Qiao, Jing Shao:
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control. CoRR abs/2403.12037 (2024) - [i371]Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang:
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding. CoRR abs/2403.15377 (2024) - [i370]Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, Limin Wang, Yu Qiao:
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World. CoRR abs/2403.16182 (2024) - [i369]Zhelun Shi, Zhipin Wang, Hongxing Fan, Zaibin Zhang, Lijun Li, Yongting Zhang, Zhenfei Yin, Lu Sheng, Yu Qiao, Jing Shao:
Assessment of Multimodal Large Language Models in Alignment with Human Values. CoRR abs/2403.17830 (2024) - [i368]Yutong Chen, Yifan Zhan, Zhihang Zhong, Wei Wang, Xiao Sun, Yu Qiao, Yinqiang Zheng:
Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence. CoRR abs/2403.19160 (2024) - [i367]Zeren Chen, Zhelun Shi, Xiaoya Lu, Lehan He, Sucheng Qian, Haoshu Fang, Zhenfei Yin, Wanli Ouyang, Jing Shao, Yu Qiao, Cewu Lu, Lu Sheng:
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents. CoRR abs/2403.19622 (2024) - [i366]Shuo Liu, Kaining Ying, Hao Zhang, Yue Yang, Yuqi Lin, Tianle Zhang, Chuanhao Li, Yu Qiao, Ping Luo, Wenqi Shao, Kaipeng Zhang:
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models. CoRR abs/2403.20194 (2024) - [i365]Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? CoRR abs/2403.20330 (2024) - [i364]Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao:
LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction. CoRR abs/2404.00913 (2024) - [i363]Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao:
VideoDistill: Language-aware Vision Distillation for Video Question Answering. CoRR abs/2404.00973 (2024) - [i362]Lirui Zhao, Yue Yang, Kaipeng Zhang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Rongrong Ji:
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model. CoRR abs/2404.01342 (2024) - [i361]Hao Wu, Huabin Liu, Yu Qiao, Xiao Sun:
DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement. CoRR abs/2404.02755 (2024) - [i360]Weigao Sun, Zhen Qin, Dong Li, Xuyang Shen, Yu Qiao, Yiran Zhong:
Linear Attention Sequence Parallelism. CoRR abs/2404.02882 (2024) - [i359]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i358]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. CoRR abs/2404.16006 (2024) - [i357]Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang:
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites. CoRR abs/2404.16821 (2024) - [i356]Ziyan Chen, Jingwen He, Xinqi Lin, Yu Qiao, Chao Dong:
Towards Real-world Video Face Restoration: A New Benchmark. CoRR abs/2404.19500 (2024) - [i355]Sirui Chen, Bo Peng, Meiqi Chen, Ruiqi Wang, Mengying Xu, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Yu Qiao, Chaochao Lu:
Causal Evaluation of Language Models. CoRR abs/2405.00622 (2024) - [i354]Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024) - [i353]Chuanhao Li, Zhen Li, Chenchen Jing, Shuo Liu, Wenqi Shao, Yuwei Wu, Ping Luo, Yu Qiao, Kaipeng Zhang:
UDKAG: Augmenting Large Vision-Language Models with Up-to-Date Knowledge. CoRR abs/2405.14554 (2024) - [i352]Chongjie Si, Xuehui Wang, Xue Yang, Zhengqin Xu, Qingyun Li, Jifeng Dai, Yu Qiao, Xiaokang Yang, Wei Shen:
FLoRA: Low-Rank Core Space for N-dimension. CoRR abs/2405.14739 (2024) - [i351]Jianbiao Mei, Yukai Ma, Xuemeng Yang, Licheng Wen, Xinyu Cai, Xin Li, Daocheng Fu, Bo Zhang, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yong Liu, Yu Qiao:
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving. CoRR abs/2405.15324 (2024) - [i350]Zhanhui Zhou, Zhixuan Liu, Jie Liu, Zhichen Dong, Chao Yang, Yu Qiao:
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models. CoRR abs/2405.19262 (2024) - [i349]Jia Zeng, Qingwen Bu, Bangjun Wang, Wenke Xia, Li Chen, Hao Dong, Haoming Song, Dong Wang, Di Hu, Ping Luo, Heming Cui, Bin Zhao, Xuelong Li, Yu Qiao, Hongyang Li:
Learning Manipulation by Predicting Interaction. CoRR abs/2406.00439 (2024) - [i348]Hao Wen, Zehuan Huang, Yaohui Wang, Xinyuan Chen, Yu Qiao, Lu Sheng:
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion. CoRR abs/2406.03184 (2024) - [i347]Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. CoRR abs/2406.04325 (2024) - [i346]Xizhou Zhu, Xue Yang, Zhaokai Wang, Hao Li, Wenhan Dou, Junqi Ge, Lewei Lu, Yu Qiao, Jifeng Dai:
Parameter-Inverted Image Pyramid Networks. CoRR abs/2406.04330 (2024) - [i345]Chenxin Tao, Xizhou Zhu, Shiqian Su, Lewei Lu, Changyao Tian, Xuan Luo, Gao Huang, Hongsheng Li, Yu Qiao, Jie Zhou, Jifeng Dai:
Learning 1D Causal Visual Representation with De-focus Attention Networks. CoRR abs/2406.04342 (2024) - [i344]Weiyun Wang, Shuibo Zhang, Yiming Ren, Yuchen Duan, Tiantong Li, Shuo Liu, Mengkang Hu, Zhe Chen, Kaipeng Zhang, Lewei Lu, Xizhou Zhu, Ping Luo, Yu Qiao, Jifeng Dai, Wenqi Shao, Wenhai Wang:
Needle In A Multimodal Haystack. CoRR abs/2406.07230 (2024) - [i343]Chenyu Yang, Xizhou Zhu, Jinguo Zhu, Weijie Su, Junjie Wang, Xuan Dong, Wenhai Wang, Lewei Lu, Bin Li, Jie Zhou, Yu Qiao, Jifeng Dai:
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning. CoRR abs/2406.07543 (2024) - [i342]Jiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu, Wenhai Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo, Yu Qiao, Jifeng Dai:
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks. CoRR abs/2406.08394 (2024) - [i341]Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashuo Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Zhenxiang Li, Pei Chu, Yi Wang, Min Dou, Changyao Tian, Xizhou Zhu, Lewei Lu, Yushi Chen, Junjun He, Zhongying Tu, Tong Lu, Yali Wang, Limin Wang, Dahua Lin, Yu Qiao, Botian Shi, Conghui He, Jifeng Dai:
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text. CoRR abs/2406.08418 (2024) - [i340]Quanfeng Lu, Wenqi Shao, Zitao Liu, Fanqing Meng, Boxuan Li, Botong Chen, Siyuan Huang, Kaipeng Zhang, Yu Qiao, Ping Luo:
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices. CoRR abs/2406.08451 (2024) - [i339]Tianle Zhang, Langtian Ma, Yuchen Yan, Yuchen Zhang, Kai Wang, Yue Yang, Ziyao Guo, Wenqi Shao, Yang You, Yu Qiao, Ping Luo, Kaipeng Zhang:
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality. CoRR abs/2406.08845 (2024) - [i338]Renqiu Xia, Song Mao, Xiangchao Yan, Hongbin Zhou, Bo Zhang, Haoyang Peng, Jiahao Pi, Daocheng Fu, Wenjie Wu, Hancheng Ye, Shiyang Feng, Bin Wang, Chao Xu, Conghui He, Pinlong Cai, Min Dou, Botian Shi, Sheng Zhou, Yongwei Wang, Bin Wang, Junchi Yan, Fei Wu, Yu Qiao:
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models. CoRR abs/2406.11633 (2024) - [i337]Fanqing Meng, Wenqi Shao, Lixin Luo, Yahong Wang, Yiran Chen, Quanfeng Lu, Yue Yang, Tianshuo Yang, Kaipeng Zhang, Yu Qiao, Ping Luo:
PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models. CoRR abs/2406.11802 (2024) - [i336]Ziyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang, Zijian Liang, Yuanjun Xiong, Yu Qiao, Dahua Lin, Jiaqi Wang:
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs. CoRR abs/2406.11833 (2024) - [i335]Yongting Zhang, Lu Chen, Guodong Zheng, Yifeng Gao, Rui Zheng, Jinlan Fu, Zhenfei Yin, Senjie Jin, Yu Qiao, Xuanjing Huang, Feng Zhao, Tao Gui, Jing Shao:
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model. CoRR abs/2406.12030 (2024) - [i334]Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang, Dahua Lin, Yu Qiao, Pengfei Liu:
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI. CoRR abs/2406.12753 (2024) - [i333]Baoqi Pei, Guo Chen, Jilan Xu, Yuping He, Yicheng Liu, Kanghua Pan, Yifei Huang, Yali Wang, Tong Lu, Limin Wang, Yu Qiao:
EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation. CoRR abs/2406.18070 (2024) - [i332]Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT. CoRR abs/2406.18583 (2024) - [i331]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024) - [i330]Jingwen He, Tianfan Xue, Dongyang Liu, Xinqi Lin, Peng Gao, Dahua Lin, Yu Qiao, Wanli Ouyang, Ziwei Liu:
VEnhancer: Generative Space-Time Enhancement for Video Generation. CoRR abs/2407.07667 (2024) - [i329]Wenshuo Peng, Kaipeng Zhang, Yue Yang, Hao Zhang, Yu Qiao:
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification. CoRR abs/2407.08787 (2024) - [i328]Hanqing Wang, Jiahe Chen, Wensi Huang, Qingwei Ben, Tai Wang, Boyu Mi, Tao Huang, Siheng Zhao, Yilun Chen, Sizhe Yang, Peizhou Cao, Wenye Yu, Zichao Ye, Jialun Li, Junfeng Long, Zirui Wang, Huiling Wang, Ying Zhao, Zhongying Tu, Yu Qiao, Dahua Lin, Jiangmiao Pang:
GRUtopia: Dream General Robots in a City at Scale. CoRR abs/2407.10943 (2024) - [i327]Mengzhao Chen, Wenqi Shao, Peng Xu, Jiahao Wang, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models. CoRR abs/2407.11062 (2024) - [i326]Xuhong Wang, Haoyu Jiang, Yi Yu, Jingru Yu, Yilun Lin, Ping Yi, Yingchun Wang, Yu Qiao, Li Li, Fei-Yue Wang:
Building Intelligence Identification System via Large Language Model Watermarking: A Survey and Beyond. CoRR abs/2407.11100 (2024) - [i325]Yi Yu, Jingru Yu, Xuhong Wang, Juanjuan Li, Yilun Lin, Conghui He, Yanqing Yang, Yu Qiao, Li Li, Fei-Yue Wang:
Navigating the Data Trading Crossroads: An Interdisciplinary Survey. CoRR abs/2407.11466 (2024) - [i324]Shuo Cao, Yihao Liu, Wenlong Zhang, Yu Qiao, Chao Dong:
GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity. CoRR abs/2407.12273 (2024) - [i323]Jie Zhang, Dongrui Liu, Chen Qian, Ziyue Gan, Yong Liu, Yu Qiao, Jing Shao:
The Better Angels of Machine Personality: How Personality Relates to LLM Safety. CoRR abs/2407.12344 (2024) - [i322]Rongkun Zheng, Lu Qi, Xi Chen, Yi Wang, Kun Wang, Yu Qiao, Hengshuang Zhao:
ViLLa: Video Reasoning Segmentation with Large Language Model. CoRR abs/2407.14500 (2024) - [i321]Xin Ma, Yaohui Wang, Gengyun Jia, Xinyuan Chen, Yuan-Fang Li, Cunjian Chen, Yu Qiao:
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models. CoRR abs/2407.15642 (2024) - [i320]Yangzhou Liu, Yue Cao, Zhangwei Gao, Weiyun Wang, Zhe Chen, Wenhai Wang, Hao Tian, Lewei Lu, Xizhou Zhu, Tong Lu, Yu Qiao, Jifeng Dai:
MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity. CoRR abs/2407.15838 (2024) - [i319]Jingru Yu, Yi Yu, Xuhong Wang, Yilun Lin, Manzhi Yang, Yu Qiao, Fei-Yue Wang:
The Shadow of Fraud: The Emerging Danger of AI-powered Social Engineering and its Possible Cure. CoRR abs/2407.15912 (2024) - [i318]Lirui Zhao, Tianshuo Yang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Kaipeng Zhang, Rongrong Ji:
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model. CoRR abs/2407.16982 (2024) - [i317]Xuemeng Yang, Licheng Wen, Yukai Ma, Jianbiao Mei, Xin Li, Tiantian Wei, Wenjie Lei, Daocheng Fu, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yong Liu, Yu Qiao:
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving. CoRR abs/2408.00415 (2024) - [i316]Dongyang Liu, Shitian Zhao, Le Zhuo, Weifeng Lin, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining. CoRR abs/2408.02657 (2024) - [i315]Fanqing Meng, Jin Wang, Chuanhao Li, Quanfeng Lu, Hao Tian, Jiaqi Liao, Xizhou Zhu, Jifeng Dai, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models. CoRR abs/2408.02718 (2024) - [i314]Zihan Li, Diping Song, Zefeng Yang, Deming Wang, Fei Li, Xiulan Zhang, Paul E. Kinahan, Yu Qiao:
VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge. CoRR abs/2408.02865 (2024) - [i313]Pengcheng Chen, Jin Ye, Guoan Wang, Yanjun Li, Zhongying Deng, Wei Li, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang, Bin Fu, Jianfei Cai, Bohan Zhuang, Eric J. Seibel, Junjun He, Yu Qiao:
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. CoRR abs/2408.03361 (2024) - [i312]Xiangyu Chen, Yihao Liu, Yuandong Pu, Wenlong Zhang, Jiantao Zhou, Yu Qiao, Chao Dong:
Learning A Low-Level Vision Generalist via Visual Task Prompt. CoRR abs/2408.08601 (2024) - [i311]Yanbo Ding, Shaobin Zhuang, Kunchang Li, Zhengrong Yue, Yu Qiao, Yali Wang:
MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration. CoRR abs/2408.10605 (2024) - [i310]Xiangtao Kong, Jinjin Gu, Yihao Liu, Wenlong Zhang, Xiangyu Chen, Yu Qiao, Chao Dong:
A Preliminary Exploration Towards General Image Restoration. CoRR abs/2408.15143 (2024) - [i309]Junyi Chen, Weicai Ye, Yifan Wang, Danpeng Chen, Di Huang, Wanli Ouyang, Guofeng Zhang, Yu Qiao, Tong He:
GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction. CoRR abs/2409.06685 (2024) - [i308]Weifeng Lin, Xinyu Wei, Renrui Zhang, Le Zhuo, Shitian Zhao, Siyuan Huang, Junlin Xi, Yu Qiao, Peng Gao, Hongsheng Li:
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions. CoRR abs/2409.15278 (2024) - [i307]Fuxian Huang, Qi Zhang, Shaopeng Zhai, Jie Wang, Tianyi Zhang, Haoran Zhang, Ming Zhou, Yu Liu, Yu Qiao:
CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation. CoRR abs/2409.15806 (2024) - [i306]Bin Wang, Chao Xu, Xiaomeng Zhao, Linke Ouyang, Fan Wu, Zhiyuan Zhao, Rui Xu, Kaiwen Liu, Yuan Qu, Fukai Shang, Bo Zhang, Liqun Wei, Zhihao Sui, Wei Li, Botian Shi, Yu Qiao, Dahua Lin, Conghui He:
MinerU: An Open-Source Solution for Precise Document Content Extraction. CoRR abs/2409.18839 (2024) - [i305]Fanqing Meng, Jiaqi Liao, Xinyu Tan, Wenqi Shao, Quanfeng Lu, Kaipeng Zhang, Yu Cheng, Dianqi Li, Yu Qiao, Ping Luo:
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation. CoRR abs/2410.05363 (2024) - [i304]Qingwen Bu, Hongyang Li, Li Chen, Jisong Cai, Jia Zeng, Heming Cui, Maoqing Yao, Yu Qiao:
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation. CoRR abs/2410.08001 (2024) - [i303]Yifan Zhan, Qingtian Zhu, Muyao Niu, Mingze Ma, Jiancheng Zhao, Zhihang Zhong, Xiao Sun, Yu Qiao, Yinqiang Zheng:
ToMiE: Towards Modular Growth in Enhanced SMPL Skeleton for 3D Human with Animatable Garments. CoRR abs/2410.08082 (2024) - [i302]Gen Luo, Xue Yang, Wenhan Dou, Zhaokai Wang, Jifeng Dai, Yu Qiao, Xizhou Zhu:
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training. CoRR abs/2410.08202 (2024) - [i301]Qibing Ren, Hao Li, Dongrui Liu, Zhanxu Xie, Xiaoya Lu, Yu Qiao, Lei Sha, Junchi Yan, Lizhuang Ma, Jing Shao:
Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues. CoRR abs/2410.10700 (2024) - [i300]Ying Chen, Guoan Wang, Yuanfeng Ji, Yanjun Li, Jin Ye, Tianbin Li, Bin Zhang, Nana Pei, Rongshan Yu, Yu Qiao, Junjun He:
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding. CoRR abs/2410.11761 (2024) - [i299]Yiwei Guo, Shaobin Zhuang, Kunchang Li, Yu Qiao, Yali Wang:
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration. CoRR abs/2410.12183 (2024) - [i298]Jie Zhang, Dongrui Liu, Chen Qian, Linfeng Zhang, Yong Liu, Yu Qiao, Jing Shao:
REEF: Representation Encoding Fingerprints for Large Language Models. CoRR abs/2410.14273 (2024) - [i297]Zhi Hou, Tianyi Zhang, Yuwen Xiong, Hengjun Pu, Chengyang Zhao, Ronglei Tong, Yu Qiao, Jifeng Dai, Yuntao Chen:
Diffusion Transformer Policy. CoRR abs/2410.15959 (2024) - [i296]Zhangwei Gao, Zhe Chen, Erfei Cui, Yiming Ren, Weiyun Wang, Jinguo Zhu, Hao Tian, Shenglong Ye, Junjun He, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Jifeng Dai, Wenhai Wang:
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance. CoRR abs/2410.16261 (2024) - [i295]Kaiwen Zhu, Jinjin Gu, Zhiyuan You, Yu Qiao, Chao Dong:
An Intelligent Agentic System for Complex Image Restoration Problems. CoRR abs/2410.17809 (2024) - [i294]Hengwei Bian, Lingdong Kong, Haozhe Xie, Liang Pan, Yu Qiao, Ziwei Liu:
DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes. CoRR abs/2410.18084 (2024) - [i293]Zhengyao Lv, Chenyang Si, Junhao Song, Zhenyu Yang, Yu Qiao, Ziwei Liu, Kwan-Yee K. Wong:
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality. CoRR abs/2410.19355 (2024) - [i292]Yu Qiao, Lina Gong, Yu Zhao, Yongwei Wang, Mingqiang Wei:
DeMuVGN: Effective Software Defect Prediction Model by Learning Multi-view Software Dependency via Graph Neural Networks. CoRR abs/2410.19550 (2024) - [i291]Xiangyu Zeng, Kunchang Li, Chenting Wang, Xinhao Li, Tianxiang Jiang, Ziang Yan, Songze Li, Yansong Shi, Zhengrong Yue, Yi Wang, Yali Wang, Yu Qiao, Limin Wang:
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning. CoRR abs/2410.19702 (2024) - 2023
- [j92]Ruyun Hu, Lihao Fu, Yongcan Chen, Junyu Chen, Yu Qiao, Tong Si:
Protein engineering via Bayesian optimization-guided evolutionary algorithm and robotic experiments. Briefings Bioinform. 24(1) (2023) - [j91]Mingye Xu, Zhipeng Zhou, Yali Wang, Yu Qiao:
Towards robustness and generalization of point cloud representation: A geometry coding method and a large-scale object-level dataset. Comput. Vis. Media 10(1): 27-43 (2023) - [j90]Kaiyang Zhou, Ziwei Liu, Yu Qiao, Tao Xiang, Chen Change Loy:
Domain Generalization: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 45(4): 4396-4415 (2023) - [j89]Anran Liu, Yihao Liu, Jinjin Gu, Yu Qiao, Chao Dong:
Blind Image Super-Resolution: A Survey and Beyond. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 5461-5480 (2023) - [j88]Mingye Xu, Yali Wang, Yihao Liu, Tong He, Yu Qiao:
CP3: Unifying Point Cloud Completion by Pretrain-Prompt-Predict Paradigm. IEEE Trans. Pattern Anal. Mach. Intell. 45(8): 9583-9594 (2023) - [j87]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-Attention for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12581-12600 (2023) - [j86]Yihao Liu, Hengyuan Zhao, Jinjin Gu, Yu Qiao, Chao Dong:
Evaluating the Generalization Ability of Super-Resolution Networks. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14497-14513 (2023) - [j85]Weicong Su, Yali Wang, Kunchang Li, Peng Gao, Yu Qiao:
Hybrid token transformer for deep face recognition. Pattern Recognit. 139: 109443 (2023) - [j84]Shihua Li, Haobin Chen, Shijie Yu, Zhiqun He, Feng Zhu, Rui Zhao, Jie Chen, Yu Qiao:
COCAS+: Large-Scale Clothes-Changing Person Re-Identification With Clothes Templates. IEEE Trans. Circuits Syst. Video Technol. 33(4): 1839-1853 (2023) - [j83]Ming Li, Bin Fu, Zhengfu Zhang, Yu Qiao:
Character-Aware Sampling and Rectification for Scene Text Recognition. IEEE Trans. Multim. 25: 649-661 (2023) - [j82]Shixiang Wu, Chao Dong, Yu Qiao:
Blind Image Restoration Based on Cycle-Consistent Network. IEEE Trans. Multim. 25: 1111-1124 (2023) - [j81]Ming Li, Bin Fu, Han Chen, Junjun He, Yu Qiao:
Dual Relation Network for Scene Text Recognition. IEEE Trans. Multim. 25: 4094-4107 (2023) - [j80]Yihao Liu, Jingwen He, Xiangyu Chen, Zhengwen Zhang, Hengyuan Zhao, Chao Dong, Yu Qiao:
Very Lightweight Photo Retouching Network With Conditional Sequential Modulation. IEEE Trans. Multim. 25: 4638-4652 (2023) - [j79]Qitong Wang, Bin Fu, Ming Li, Junjun He, Xi Peng, Yu Qiao:
Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion. IEEE Trans. Multim. 25: 4718-4729 (2023) - [j78]Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang:
Hierarchical and Progressive Image Matting. ACM Trans. Multim. Comput. Commun. Appl. 19(2): 52:1-52:23 (2023) - [j77]Shidong Wang, Wei Zeng, Xi Chen, Yu Ye, Yu Qiao, Chi-Wing Fu:
ActFloor-GAN: Activity-Guided Adversarial Networks for Human-Centric Floorplan Design. IEEE Trans. Vis. Comput. Graph. 29(3): 1610-1624 (2023) - [c275]Jia Zeng, Li Chen, Hanming Deng, Lewei Lu, Junchi Yan, Yu Qiao, Hongyang Li:
Distilling Focal Knowledge from Imperfect Expert for 3D Object Detection. CVPR 2023: 992-1001 - [c274]Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng Dai:
Siamese Image Modeling for Self-Supervised Vision Representation Learning. CVPR 2023: 2132-2141 - [c273]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CVPR 2023: 2691-2700 - [c272]Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Yi Wang, Yu Qiao, Weidi Xie:
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision. CVPR 2023: 2935-2944 - [c271]Mingye Xu, Mutian Xu, Tong He, Wanli Ouyang, Yali Wang, Xiaoguang Han, Yu Qiao:
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency. CVPR 2023: 4380-4390 - [c270]Runnan Chen, Youquan Liu, Lingdong Kong, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao, Wenping Wang:
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP. CVPR 2023: 7020-7030 - [c269]Bo Zhang, Jiakang Yuan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao:
Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection. CVPR 2023: 9253-9262 - [c268]Xuyang Shen, Dong Li, Jinxing Zhou, Zhen Qin, Bowen He, Xiaodong Han, Aixuan Li, Yuchao Dai, Lingpeng Kong, Meng Wang, Yu Qiao, Yiran Zhong:
Fine-grained Audible Video Description. CVPR 2023: 10585-10596 - [c267]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CVPR 2023: 14408-14419 - [c266]Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao:
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking. CVPR 2023: 14549-14560 - [c265]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Yu Qiao, Peng Gao, Hongsheng Li:
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners. CVPR 2023: 15211-15222 - [c264]Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao:
Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection. CVPR 2023: 15599-15608 - [c263]Weijie Su, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie Zhou, Jifeng Dai:
Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information. CVPR 2023: 15888-15899 - [c262]Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He:
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross- Modal Fusion. CVPR 2023: 17524-17534 - [c261]Zhaoyang Xia, Youquan Liu, Xin Li, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao:
SCPNet: Semantic Scene Completion on Point Cloud. CVPR 2023: 17642-17651 - [c260]Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision. CVPR 2023: 17830-17839 - [c259]Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li:
Planning-oriented Autonomous Driving. CVPR 2023: 17853-17862 - [c258]Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng:
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior. CVPR 2023: 18053-18062 - [c257]Yurui Zhu, Tianyu Wang, Xueyang Fu, Xuanyu Yang, Xin Guo, Jifeng Dai, Yu Qiao, Xiaowei Hu:
Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions. CVPR 2023: 21747-21758 - [c256]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-Trained Models via Image-to-Point Masked Autoencoders. CVPR 2023: 21769-21780 - [c255]Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong:
Activating More Pixels in Image Super-Resolution Transformer. CVPR 2023: 22367-22377 - [c254]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CVPR 2023: 22721-22731 - [c253]Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CVPR 2023: 22732-22741 - [c252]Yihao Liu, Jingwen He, Jinjin Gu, Xiangtao Kong, Yu Qiao, Chao Dong:
DegAE: A New Pretraining Paradigm for Low-Level Vision. CVPR 2023: 23292-23303 - [c251]Lingdong Kong, Youquan Liu, Runnan Chen, Yuexin Ma, Xinge Zhu, Yikang Li, Yuenan Hou, Yu Qiao, Ziwei Liu:
Rethinking Range View Representation for LiDAR Segmentation. ICCV 2023: 228-240 - [c250]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Limin Wang, Yu Qiao:
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding. ICCV 2023: 1632-1643 - [c249]Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li:
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds. ICCV 2023: 6713-6724 - [c248]Renrui Zhang, Han Qiu, Tai Wang, Ziyu Guo, Ziteng Cui, Yu Qiao, Hongsheng Li, Peng Gao:
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection. ICCV 2023: 9121-9132 - [c247]Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao:
Scaling Data Generation in Vision-and-Language Navigation. ICCV 2023: 11975-11986 - [c246]Mingfei Han, Yali Wang, Zhihui Li, Lina Yao, Xiaojun Chang, Yu Qiao:
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation. ICCV 2023: 13368-13377 - [c245]Bingkun Huang, Zhiyu Zhao, Guozhen Zhang, Yu Qiao, Limin Wang:
MGMAE: Motion Guided Masking for Video Masked Autoencoding. ICCV 2023: 13447-13458 - [c244]Lihe Yang, Zhen Zhao, Lei Qi, Yu Qiao, Yinghuan Shi, Hengshuang Zhao:
Shrinking Class Space for Enhanced Certainty in Semi-Supervised Learning. ICCV 2023: 16141-16150 - [c243]Mengzhao Chen, Wenqi Shao, Peng Xu, Mingbao Lin, Kaipeng Zhang, Fei Chao, Rongrong Ji, Yu Qiao, Ping Luo:
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers. ICCV 2023: 17118-17128 - [c242]Kunchang Li, Yali Wang, Yizhuo Li, Yi Wang, Yinan He, Limin Wang, Yu Qiao:
Unmasked Teacher: Towards Training-Efficient Video Foundation Models. ICCV 2023: 19891-19903 - [c241]Youquan Liu, Runnan Chen, Xin Li, Lingdong Kong, Yuchen Yang, Zhaoyang Xia, Yeqi Bai, Xinge Zhu, Yuexin Ma, Yikang Li, Yu Qiao, Yuenan Hou:
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase. ICCV 2023: 21605-21616 - [c240]Yu Qiao, Bo Dong, Ao Jin, Yu Fu, Seung-Hwan Baek, Felix Heide, Pieter Peers, Xiaopeng Wei, Xin Yang:
Multi-view Spectral Polarization Propagation for Video Glass Segmentation. ICCV 2023: 23161-23171 - [c239]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. ICCV (Workshops) 2023: 272-283 - [c238]Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao:
Vision Transformer Adapter for Dense Predictions. ICLR 2023 - [c237]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Yu Qiao, Zhenguo Li, Ping Luo:
CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. ICLR 2023 - [c236]Penghao Wu, Li Chen, Hongyang Li, Xiaosong Jia, Junchi Yan, Yu Qiao:
Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling. ICLR 2023 - [c235]Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao:
Long-Term Rhythmic Video Soundtracker. ICML 2023: 40339-40353 - [c234]Yu Qiao, Hengyi Zhang, Pengfei Sun, Yuan Tian, Yong Guan, Zhenzhou Shao, Zhiping Shi:
Parallelizable Simple Recurrent Units with Hierarchical Memory. ICONIP (15) 2023: 380-392 - [c233]Licheng Wen, Daocheng Fu, Song Mao, Pinlong Cai, Min Dou, Yikang Li, Yu Qiao:
LimSim: A Long-Term Interactive Multi-Scenario Traffic Simulator. ITSC 2023: 1255-1262 - [c232]Yunkun Zhang, Jin Gao, Mu Zhou, Xiaosong Wang, Yu Qiao, Shaoting Zhang, Dequan Wang:
Text-Guided Foundation Model Adaptation for Pathological Image Classification. MICCAI (5) 2023: 272-282 - [c231]Hongjie Zhang, Yi Liu, Yali Wang, Limin Wang, Yu Qiao:
Learning Discriminative Feature Representation for Open Set Action Recognition. ACM Multimedia 2023: 7696-7705 - [c230]Jinjin Gu, Xianzheng Ma, Xiangtao Kong, Yu Qiao, Chao Dong:
Networks are Slacking Off: Understanding Generalization Problem in Image Deraining. NeurIPS 2023 - [c229]Linyan Huang, Zhiqi Li, Chonghao Sima, Wenhai Wang, Jingdong Wang, Yu Qiao, Hongyang Li:
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection. NeurIPS 2023 - [c228]Fanqing Meng, Wenqi Shao, Zhanglin Peng, Chonghe Jiang, Kaipeng Zhang, Yu Qiao, Ping Luo:
Foundation Model is Efficient Multimodal Multitask Model Selector. NeurIPS 2023 - [c227]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. NeurIPS 2023 - [c226]Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. NeurIPS 2023 - [c225]Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai:
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks. NeurIPS 2023 - [c224]Jiakang Yuan, Bo Zhang, Xiangchao Yan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao:
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset. NeurIPS 2023 - [c223]Wenlong Zhang, Xiaohui Li, Guangyuan Shi, Xiangyu Chen, Yu Qiao, Xiaoyun Zhang, Xiao-Ming Wu, Chao Dong:
Real-World Image Super-Resolution as Multi-Task Learning. NeurIPS 2023 - [c222]Rongkun Zheng, Lu Qi, Xi Chen, Yi Wang, Kun Wang, Yu Qiao, Hengshuang Zhao:
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation. NeurIPS 2023 - [i290]Penghao Wu, Li Chen, Hongyang Li, Xiaosong Jia, Junchi Yan, Yu Qiao:
Policy Pre-training for End-to-end Autonomous Driving via Self-supervised Geometric Modeling. CoRR abs/2301.01006 (2023) - [i289]Runnan Chen, Youquan Liu, Lingdong Kong, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao, Wenping Wang:
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP. CoRR abs/2301.04926 (2023) - [i288]Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Yi Wang, Yu Qiao, Weidi Xie:
Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision. CoRR abs/2301.09121 (2023) - [i287]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao:
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CoRR abs/2303.02151 (2023) - [i286]Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He:
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion. CoRR abs/2303.03595 (2023) - [i285]Lingdong Kong, Youquan Liu, Runnan Chen, Yuexin Ma, Xinge Zhu, Yikang Li, Yuenan Hou, Yu Qiao, Ziwei Liu:
Rethinking Range View Representation for LiDAR Segmentation. CoRR abs/2303.05367 (2023) - [i284]Ziteng Cui, Lin Gu, Xiao Sun, Yu Qiao, Tatsuya Harada:
Aleth-NeRF: Low-light Condition View Synthesis with Concealing Fields. CoRR abs/2303.05807 (2023) - [i283]Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao:
Bi3D: Bi-domain Active Learning for Cross-domain 3D Object Detection. CoRR abs/2303.05886 (2023) - [i282]Bo Zhang, Jiakang Yuan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao:
Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection. CoRR abs/2303.06880 (2023) - [i281]Zhaoyang Xia, Youquan Liu, Xin Li, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao:
SCPNet: Semantic Scene Completion on Point Cloud. CoRR abs/2303.06884 (2023) - [i280]Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng:
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior. CoRR abs/2303.09757 (2023) - [i279]Xuyang Shen, Dong Li, Jinxing Zhou, Zhen Qin, Bowen He, Xiaodong Han, Aixuan Li, Yuchao Dai, Lingpeng Kong, Meng Wang, Yu Qiao, Yiran Zhong:
Fine-grained Audible Video Description. CoRR abs/2303.15616 (2023) - [i278]Kunchang Li, Yali Wang, Yizhuo Li, Yi Wang, Yinan He, Limin Wang, Yu Qiao:
Unmasked Teacher: Towards Training-Efficient Video Foundation Models. CoRR abs/2303.16058 (2023) - [i277]Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023) - [i276]Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao:
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking. CoRR abs/2303.16727 (2023) - [i275]Tianyu Li, Li Chen, Xiangwei Geng, Huijie Wang, Yang Li, Zhenbo Liu, Shengyin Jiang, Yuting Wang, Hang Xu, Chunjing Xu, Feng Wen, Ping Luo, Junchi Yan, Wei Zhang, Xiaogang Wang, Yu Qiao, Hongyang Li:
Topology Reasoning for Driving Scenes. CoRR abs/2304.05277 (2023) - [i274]Ziyan Huang, Haoyu Wang, Zhongying Deng, Jin Ye, Yanzhou Su, Hui Sun, Junjun He, Yun Gu, Lixu Gu, Shaoting Zhang, Yu Qiao:
STU-Net: Scalable and Transferable Medical Image Segmentation Models Empowered by Large-Scale Supervised Pre-training. CoRR abs/2304.06716 (2023) - [i273]Xiaoliang Ju, Yiyang Sun, Yiming Hao, Yikang Li, Yu Qiao, Hongsheng Li:
Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles. CoRR abs/2304.09365 (2023) - [i272]Huijie Wang, Zhenbo Liu, Yang Li, Tianyu Li, Li Chen, Chonghao Sima, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, Ping Luo, Junchi Yan, Wei Zhang, Jun Yao, Yu Qiao, Hongyang Li:
Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving. CoRR abs/2304.10440 (2023) - [i271]Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu:
Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation. CoRR abs/2304.11829 (2023) - [i270]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i269]Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao:
Long-Term Rhythmic Video Soundtracker. CoRR abs/2305.01319 (2023) - [i268]Yaohui Wang, Xin Ma, Xinyuan Chen, Antitza Dantcheva, Bo Dai, Yu Qiao:
LEO: Generative Latent Image Animator for Human Video Synthesis. CoRR abs/2305.03989 (2023) - [i267]Mingzhou Liu, Xinwei Sun, Yu Qiao, Yizhou Wang:
Causal Discovery with Unobserved Variables: A Proxy Variable Approach. CoRR abs/2305.05281 (2023) - [i266]Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao:
InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language. CoRR abs/2305.05662 (2023) - [i265]Kunchang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao:
VideoChat: Chat-Centric Video Understanding. CoRR abs/2305.06355 (2023) - [i264]Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai:
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks. CoRR abs/2305.11175 (2023) - [i263]Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model. CoRR abs/2305.11176 (2023) - [i262]Guo Chen, Yin-Dong Zheng, Jiahao Wang, Jilan Xu, Yifei Huang, Junting Pan, Yi Wang, Yali Wang, Yu Qiao, Tong Lu, Limin Wang:
VideoLLM: Modeling Video Sequence with Large Language Models. CoRR abs/2305.13292 (2023) - [i261]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. CoRR abs/2305.15021 (2023) - [i260]Jinjin Gu, Xianzheng Ma, Xiangtao Kong, Yu Qiao, Chao Dong:
Networks are Slacking Off: Understanding Generalization Problem in Image Deraining. CoRR abs/2305.15134 (2023) - [i259]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. CoRR abs/2305.16318 (2023) - [i258]Xizhou Zhu, Yuntao Chen, Hao Tian, Chenxin Tao, Weijie Su, Chenyu Yang, Gao Huang, Bin Li, Lewei Lu, Xiaogang Wang, Yu Qiao, Zhaoxiang Zhang, Jifeng Dai:
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory. CoRR abs/2305.17144 (2023) - [i257]Mengzhao Chen, Wenqi Shao, Peng Xu, Mingbao Lin, Kaipeng Zhang, Fei Chao, Rongrong Ji, Yu Qiao, Ping Luo:
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers. CoRR abs/2305.17997 (2023) - [i256]Xiaoliang Ju, Zhaoyang Huang, Yijin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li:
DiffRoom: Diffusion-based High-Quality 3D Room Reconstruction and Generation with Occupancy Prior. CoRR abs/2306.00519 (2023) - [i255]Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao:
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset. CoRR abs/2306.00612 (2023) - [i254]Zeqiang Lai, Yuchen Duan, Jifeng Dai, Ziheng Li, Ying Fu, Hongsheng Li, Yu Qiao, Wenhai Wang:
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling. CoRR abs/2306.01721 (2023) - [i253]Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li:
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds. CoRR abs/2306.06023 (2023) - [i252]Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo:
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. CoRR abs/2306.09265 (2023) - [i251]Dequan Wang, Xiaosong Wang, Lilong Wang, Mengzhang Li, Qian Da, Xiaoqiang Liu, Xiangyu Gao, Jun Shen, Junjun He, Tian Shen, Qi Duan, Jie Zhao, Kang Li, Yu Qiao, Shaoting Zhang:
MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification. CoRR abs/2306.09579 (2023) - [i250]Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo:
Align, Adapt and Inject: Sound-guided Unified Image Generation. CoRR abs/2306.11504 (2023) - [i249]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. CoRR abs/2306.11732 (2023) - [i248]Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. CoRR abs/2307.00716 (2023) - [i247]Yuwei Guo, Ceyuan Yang, Anyi Rao, Yaohui Wang, Yu Qiao, Dahua Lin, Bo Dai:
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning. CoRR abs/2307.04725 (2023) - [i246]Licheng Wen, Daocheng Fu, Song Mao, Pinlong Cai, Min Dou, Yikang Li, Yu Qiao:
LimSim: A Long-term Interactive Multi-scenario Traffic Simulator. CoRR abs/2307.06648 (2023) - [i245]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. CoRR abs/2307.06942 (2023) - [i244]Daocheng Fu, Xin Li, Licheng Wen, Min Dou, Pinlong Cai, Botian Shi, Yu Qiao:
Drive Like a Human: Rethinking Autonomous Driving with Large Language Models. CoRR abs/2307.07162 (2023) - [i243]Yiyuan Zhang, Kaixiong Gong, Kaipeng Zhang, Hongsheng Li, Yu Qiao, Wanli Ouyang, Xiangyu Yue:
Meta-Transformer: A Unified Framework for Multimodal Learning. CoRR abs/2307.10802 (2023) - [i242]Yunkun Zhang, Jin Gao, Mu Zhou, Xiaosong Wang, Yu Qiao, Shaoting Zhang, Dequan Wang:
Text-guided Foundation Model Adaptation for Pathological Image Classification. CoRR abs/2307.14901 (2023) - [i241]Zhen Qin, Dong Li, Weigao Sun, Weixuan Sun, Xuyang Shen, Xiaodong Han, Yunshen Wei, Baohong Lv, Fei Yuan, Xiao Luo, Yu Qiao, Yiran Zhong:
Scaling TransNormer to 175 Billion Parameters. CoRR abs/2307.14995 (2023) - [i240]Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao:
Scaling Data Generation in Vision-and-Language Navigation. CoRR abs/2307.15644 (2023) - [i239]Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao:
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World. CoRR abs/2308.01907 (2023) - [i238]Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo:
Tiny LVLM-eHub: Early Multimodal Experiments with Bard. CoRR abs/2308.03729 (2023) - [i237]Fanqing Meng, Wenqi Shao, Zhanglin Peng, Chonghe Jiang, Kaipeng Zhang, Yu Qiao, Ping Luo:
Foundation Model is Efficient Multimodal Multitask Model Selector. CoRR abs/2308.06262 (2023) - [i236]Lihe Yang, Zhen Zhao, Lei Qi, Yu Qiao, Yinghuan Shi, Hengshuang Zhao:
Shrinking Class Space for Enhanced Certainty in Semi-Supervised Learning. CoRR abs/2308.06777 (2023) - [i235]Bingkun Huang, Zhiyu Zhao, Guozhen Zhang, Yu Qiao, Limin Wang:
MGMAE: Motion Guided Masking for Video Masked Autoencoding. CoRR abs/2308.10794 (2023) - [i234]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. CoRR abs/2308.13137 (2023) - [i233]Xinqi Lin, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Ben Fei, Bo Dai, Wanli Ouyang, Yu Qiao, Chao Dong:
DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior. CoRR abs/2308.15070 (2023) - [i232]Junlong Cheng, Jin Ye, Zhongying Deng, Jianpin Chen, Tianbin Li, Haoyu Wang, Yanzhou Su, Ziyan Huang, Jilong Chen, Lei Jiang, Hui Sun, Junjun He, Shaoting Zhang, Min Zhu, Yu Qiao:
SAM-Med2D. CoRR abs/2308.16184 (2023) - [i231]Wenlong Zhang, Xiaohui Li, Xiangyu Chen, Yu Qiao, Xiao-Ming Wu, Chao Dong:
SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution. CoRR abs/2309.03020 (2023) - [i230]Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao:
ImageBind-LLM: Multi-modality Instruction Tuning. CoRR abs/2309.03905 (2023) - [i229]Ziyan Huang, Zhongying Deng, Jin Ye, Haoyu Wang, Yanzhou Su, Tianbin Li, Hui Sun, Junlong Cheng, Jianpin Chen, Junjun He, Yun Gu, Shaoting Zhang, Lixu Gu, Yu Qiao:
A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation. CoRR abs/2309.03906 (2023) - [i228]Xiangyu Chen, Zheyuan Li, Zhengwen Zhang, Jimmy S. Ren, Yihao Liu, Jingwen He, Yu Qiao, Jiantao Zhou, Chao Dong:
Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation. CoRR abs/2309.04084 (2023) - [i227]Xiangyu Chen, Xintao Wang, Wenlong Zhang, Xiangtao Kong, Yu Qiao, Jiantao Zhou, Chao Dong:
HAT: Hybrid Attention Transformer for Image Restoration. CoRR abs/2309.05239 (2023) - [i226]Bo Zhang, Xinyu Cai, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan, Yu Qiao:
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation. CoRR abs/2309.05527 (2023) - [i225]Youquan Liu, Runnan Chen, Xin Li, Lingdong Kong, Yuchen Yang, Zhaoyang Xia, Yeqi Bai, Xinge Zhu, Yuexin Ma, Yikang Li, Yu Qiao, Yuenan Hou:
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase. CoRR abs/2309.05573 (2023) - [i224]Xiangchao Yan, Runjian Chen, Bo Zhang, Jiakang Yuan, Xinyu Cai, Botian Shi, Wenqi Shao, Junchi Yan, Ping Luo, Yu Qiao:
SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving. CoRR abs/2309.10527 (2023) - [i223]Renqiu Xia, Bo Zhang, Haoyang Peng, Ning Liao, Peng Ye, Botian Shi, Junchi Yan, Yu Qiao:
StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding. CoRR abs/2309.11268 (2023) - [i222]Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu:
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models. CoRR abs/2309.15103 (2023) - [i221]Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023) - [i220]Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao:
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models. CoRR abs/2309.16292 (2023) - [i219]Mingzhou Liu, Xinwei Sun, Ching-Wen Lee, Yu Qiao, Yizhou Wang:
Exploring Counterfactual Alignment Loss towards Human-centered AI. CoRR abs/2310.01766 (2023) - [i218]Zhanhui Zhou, Jie Liu, Chao Yang, Jing Shao, Yu Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao:
Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models. CoRR abs/2310.03708 (2023) - [i217]Hao Zhang, Kaipeng Zhang, Lumin Xu, Shenqi Lai, Wenqi Shao, Nanning Zheng, Ping Luo, Yu Qiao:
Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face. CoRR abs/2310.05056 (2023) - [i216]Ning Liao, Shaofeng Zhang, Renqiu Xia, Bo Zhang, Min Cao, Yu Qiao, Junchi Yan:
REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets. CoRR abs/2310.06594 (2023) - [i215]Zeqiang Lai, Xizhou Zhu, Jifeng Dai, Yu Qiao, Wenhai Wang:
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models. CoRR abs/2310.07653 (2023) - [i214]Bo Peng, Xinyuan Chen, Yaohui Wang, Chaochao Lu, Yu Qiao:
ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation. CoRR abs/2310.07697 (2023) - [i213]Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, Ping Luo:
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models. CoRR abs/2310.08582 (2023) - [i212]Haoyi Zhu, Honghui Yang, Xiaoyang Wu, Di Huang, Sha Zhang, Xianglong He, Tong He, Hengshuang Zhao, Chunhua Shen, Yu Qiao, Wanli Ouyang:
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm. CoRR abs/2310.08586 (2023) - [i211]Yihao Liu, Xiangyu Chen, Xianzheng Ma, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong:
Unifying Image Processing as Visual Prompting Question Answering. CoRR abs/2310.10513 (2023) - [i210]Xiangyu Chen, Zheyuan Li, Yuandong Pu, Yihao Liu, Jiantao Zhou, Yu Qiao, Chao Dong:
A Comparative Study of Image Restoration Networks for General Backbone Network Design. CoRR abs/2310.11881 (2023) - [i209]Haoyu Wang, Sizheng Guo, Jin Ye, Zhongying Deng, Junlong Cheng, Tianbin Li, Jianpin Chen, Yanzhou Su, Ziyan Huang, Yiqing Shen, Bin Fu, Shaoting Zhang, Junjun He, Yu Qiao:
SAM-Med3D. CoRR abs/2310.15161 (2023) - [i208]Linyan Huang, Zhiqi Li, Chonghao Sima, Wenhai Wang, Jingdong Wang, Yu Qiao, Hongyang Li:
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection. CoRR abs/2310.15670 (2023) - [i207]Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Zhiheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang:
ControlLLM: Augment Language Models with Tools by Searching on Graphs. CoRR abs/2310.17796 (2023) - [i206]Yizhuo Li, Kunchang Li, Yinan He, Yi Wang, Yali Wang, Limin Wang, Yu Qiao, Ping Luo:
Harvest Video Foundation Models via Efficient Post-Pretraining. CoRR abs/2310.19554 (2023) - [i205]Xinyuan Chen, Yaohui Wang, Lingjun Zhang, Shaobin Zhuang, Xin Ma, Jiashuo Yu, Yali Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction. CoRR abs/2310.20700 (2023) - [i204]Zeren Chen, Ziqin Wang, Zhen Wang, Huayang Liu, Zhenfei Yin, Si Liu, Lu Sheng, Wanli Ouyang, Yu Qiao, Jing Shao:
Octavius: Mitigating Task Interference in MLLMs via MoE. CoRR abs/2311.02684 (2023) - [i203]Zhelun Shi, Zhipin Wang, Hongxing Fan, Zhenfei Yin, Lu Sheng, Yu Qiao, Jing Shao:
ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models. CoRR abs/2311.02692 (2023) - [i202]Zhiyu Zhao, Bingkun Huang, Sen Xing, Gangshan Wu, Yu Qiao, Limin Wang:
Asymmetric Masked Distillation for Pre-Training Small Foundation Models. CoRR abs/2311.03149 (2023) - [i201]Licheng Wen, Xuemeng Yang, Daocheng Fu, Xiaofeng Wang, Pinlong Cai, Xin Li, Tao Ma, Yingxuan Li, Linran Xu, Dengke Shang, Zheng Zhu, Shaoyan Sun, Yeqi Bai, Xinyu Cai, Min Dou, Shuanglu Hu, Botian Shi, Yu Qiao:
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving. CoRR abs/2311.05332 (2023) - [i200]Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao:
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models. CoRR abs/2311.07575 (2023) - [i199]Zhihang Zhong, Gurunandan Krishnan, Xiao Sun, Yu Qiao, Sizhuo Ma, Jian Wang:
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation. CoRR abs/2311.08007 (2023) - [i198]Jin Ye, Junlong Cheng, Jianpin Chen, Zhongying Deng, Tianbin Li, Haoyu Wang, Yanzhou Su, Ziyan Huang, Jilong Chen, Lei Jiang, Hui Sun, Min Zhu, Shaoting Zhang, Junjun He, Yu Qiao:
SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks. CoRR abs/2311.11969 (2023) - [i197]Yangyang Xu, Shengfeng He, Wenqi Shao, Kwan-Yee K. Wong, Yu Qiao, Ping Luo:
DiffusionMat: Alpha Matting as Sequential Refinement Learning. CoRR abs/2311.13535 (2023) - [i196]Yu Yi, Xue Yang, Qingyun Li, Feipeng Da, Junchi Yan, Jifeng Dai, Yu Qiao:
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision. CoRR abs/2311.14758 (2023) - [i195]Yufei Wang, Wenhan Yang, Xinyuan Chen, Yaohui Wang, Lanqing Guo, Lap-Pui Chau, Ziwei Liu, Yu Qiao, Alex C. Kot, Bihan Wen:
SinSR: Diffusion-Based Image Super-Resolution in a Single Step. CoRR abs/2311.14760 (2023) - [i194]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao:
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark. CoRR abs/2311.17005 (2023) - [i193]Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiao:
Query-Relevant Images Jailbreak Large Multi-Modal Models. CoRR abs/2311.17600 (2023) - [i192]Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
VBench: Comprehensive Benchmark Suite for Video Generative Models. CoRR abs/2311.17982 (2023) - [i191]Yanqing Liu, Kai Wang, Wenqi Shao, Ping Luo, Yu Qiao, Mike Zheng Shou, Kaipeng Zhang, Yang You:
MLLMs-Augmented Visual-Language Representation Learning. CoRR abs/2311.18765 (2023) - [i190]Yuming Jiang, Tianxing Wu, Shuai Yang, Chenyang Si, Dahua Lin, Yu Qiao, Chen Change Loy, Ziwei Liu:
VideoBooth: Diffusion-based Video Generation with Image Prompts. CoRR abs/2312.00777 (2023) - [i189]Hongyang Li, Yang Li, Huijie Wang, Jia Zeng, Pinlong Cai, Huilin Xu, Dahua Lin, Junchi Yan, Feng Xu, Lu Xiong, Jingdong Wang, Futang Zhu, Kai Yan, Chunjing Xu, Tiancai Wang, Beipeng Mu, Shaoqing Ren, Zhihui Peng, Yu Qiao:
Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future. CoRR abs/2312.03408 (2023) - [i188]Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue:
OneLLM: One Framework to Align All Modalities with Language. CoRR abs/2312.03700 (2023) - [i187]Xin Li, Yeqi Bai, Pinlong Cai, Licheng Wen, Daocheng Fu, Bo Zhang, Xuemeng Yang, Xinyu Cai, Tao Ma, Jianfei Guo, Xing Gao, Min Dou, Yikang Li, Botian Shi, Yong Liu, Liang He, Yu Qiao:
Towards Knowledge-driven Autonomous Driving. CoRR abs/2312.04316 (2023) - [i186]Hongjie Zhang, Yi Liu, Lu Dong, Yifei Huang, Zhen-Hua Ling, Yali Wang, Limin Wang, Yu Qiao:
MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding. CoRR abs/2312.04817 (2023) - [i185]Rongkun Zheng, Lu Qi, Xi Chen, Yi Wang, Kun Wang, Yu Qiao, Hengshuang Zhao:
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation. CoRR abs/2312.06630 (2023) - [i184]Zehuan Huang, Hao Wen, Junting Dong, Yaohui Wang, Yangguang Li, Xinyuan Chen, Yan-Pei Cao, Ding Liang, Yu Qiao, Bo Dai, Lu Sheng:
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion. CoRR abs/2312.06725 (2023) - [i183]Yuchen Yang, Yu Qiao, Xiao Sun:
Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation. CoRR abs/2312.07051 (2023) - [i182]Yiran Qin, Enshen Zhou, Qichang Liu, Zhenfei Yin, Lu Sheng, Ruimao Zhang, Yu Qiao, Jing Shao:
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception. CoRR abs/2312.07472 (2023) - [i181]Ziteng Cui, Lin Gu, Xiao Sun, Xianzheng Ma, Yu Qiao, Tatsuya Harada:
Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption. CoRR abs/2312.09093 (2023) - [i180]Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai:
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft. CoRR abs/2312.09238 (2023) - [i179]Wenhai Wang, Jiangwei Xie, Chuanyang Hu, Haoming Zou, Jianan Fan, Wenwen Tong, Yang Wen, Silei Wu, Hanming Deng, Zhiqi Li, Hao Tian, Lewei Lu, Xizhou Zhu, Xiaogang Wang, Yu Qiao, Jifeng Dai:
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving. CoRR abs/2312.09245 (2023) - [i178]Xiaoyang Wu, Li Jiang, Peng-Shuai Wang, Zhijian Liu, Xihui Liu, Yu Qiao, Wanli Ouyang, Tong He, Hengshuang Zhao:
Point Transformer V3: Simpler, Faster, Stronger. CoRR abs/2312.10035 (2023) - [i177]Xu Liu, Tong Zhou, Yuanxin Wang, Yuping Wang, Qinjingwen Cao, Weizhi Du, Yonghuan Yang, Junjun He, Yu Qiao, Yiqing Shen:
Towards the Unification of Generative and Discriminative Visual Foundation Model: A Survey. CoRR abs/2312.10163 (2023) - [i176]Siran Chen, Yue Ma, Yu Qiao, Yali Wang:
M-BEV: Masked BEV Perception for Robust Autonomous Driving. CoRR abs/2312.12144 (2023) - [i175]Lingjun Zhang, Xinyuan Chen, Yaohui Wang, Yue Lu, Yu Qiao:
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model. CoRR abs/2312.12232 (2023) - [i174]Yuanfu Wang, Chao Yang, Ying Wen, Yu Liu, Yu Qiao:
Critic-Guided Decision Transformer for Offline Reinforcement Learning. CoRR abs/2312.13716 (2023) - [i173]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CoRR abs/2312.14238 (2023) - 2022
- [j76]Diping Song, Fei Li, Cheng Li, Jian Xiong, Junjun He, Xiulan Zhang, Yu Qiao:
Asynchronous feature regularization and cross-modal distillation for OCT based glaucoma diagnosis. Comput. Biol. Medicine 151(Part): 106283 (2022) - [j75]Xiaoxing Zeng, Zhelun Wu, Xiaojiang Peng, Yu Qiao:
Joint 3D facial shape reconstruction and texture completion from a single image. Comput. Vis. Media 8(2): 239-256 (2022) - [j74]Fei Li, Diping Song, Han Chen, Jian Xiong, Xingyi Li, Hua Zhong, Guangxian Tang, Sujie Fan, Dennis S. C. Lam, Weihua Pan, Yajuan Zheng, Ying Li, Guoxiang Qu, Junjun He, Zhe Wang, Ling Jin, Rouxi Zhou, Yunhe Song, Yi Sun, Weijing Cheng, Chunman Yang, Yazhi Fan, Yingjie Li, Hengli Zhang, Ye Yuan, Yang Xu, Yunfan Xiong, Lingfei Jin, Aiguo Lv, Lingzhi Niu, Yuhong Liu, Shaoli Li, Jiani Zhang, Linda M. Zangwill, Alejandro F. Frangi, Tin Aung, Ching-Yu Cheng, Yu Qiao, Xiulan Zhang, Daniel S. W. Ting:
Author Correction: Development and clinical deployment of a smartphone-based visual field deep learning system for glaucoma detection. npj Digit. Medicine 5 (2022) - [j73]Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao:
RankSRGAN: Super Resolution Generative Adversarial Networks With Learning to Rank. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 7149-7166 (2022) - [j72]Jingwen He, Chao Dong, Yihao Liu, Yu Qiao:
Interactive Multi-Dimension Modulation for Image Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 9363-9379 (2022) - [j71]Qing Li, Xiaojiang Peng, Yu Qiao, Qi Hao:
Unsupervised person re-identification with multi-label learning guided self-paced clustering. Pattern Recognit. 125: 108521 (2022) - [j70]Weijian Ruan, Yiran Tao, Linjun Ruan, Xiujun Shu, Yu Qiao:
Temporal Weighting Appearance-Aligned Network for Nighttime Video Retrieval. IEEE Signal Process. Lett. 29: 2008-2012 (2022) - [j69]Haiwei Wu, Jiantao Zhou, Jinyu Tian, Jun Liu, Yu Qiao:
Robust Image Forgery Detection Against Transmission Over Online Social Networks. IEEE Trans. Inf. Forensics Secur. 17: 443-456 (2022) - [j68]Yi Liu, Limin Wang, Yali Wang, Xiao Ma, Yu Qiao:
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization. IEEE Trans. Image Process. 31: 6937-6950 (2022) - [j67]Yuhao Liu, Jiake Xie, Yu Qiao, Yong Tang, Xin Yang:
Prior-Induced Information Alignment for Image Matting. IEEE Trans. Multim. 24: 2727-2738 (2022) - [c221]Yu Qiao, Jincheng Zhu, Chengjiang Long, Zeyao Zhang, Yuxin Wang, Zhenjun Du, Xin Yang:
CPRAL: Collaborative Panoptic-Regional Active Learning for Semantic Segmentation. AAAI 2022: 2108-2116 - [c220]Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada:
You Only Need 90K Parameters to Adapt Light: a Light Weight Transformer for Image Enhancement and Exposure Correction. BMVC 2022: 238 - [c219]Teli Ma, Shijie Geng, Mengmeng Wang, Sheng Xu, Hongsheng Li, Baochang Zhang, Peng Gao, Yu Qiao:
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition. BMVC 2022: 481 - [c218]Yu Qiao, Ziqi Wei, Yuhao Liu, Yuxin Wang, Dongsheng Zhou, Qiang Zhang, Xin Yang:
Wider and Higher: Intensive Integration and Global Foreground Perception for Image Matting. CGI 2022: 541-553 - [c217]Xiaosong Jia, Li Chen, Penghao Wu, Jia Zeng, Junchi Yan, Hongyang Li, Yu Qiao:
Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach. CoRL 2022: 910-920 - [c216]Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Jinjin Gu, Yu Qiao, Chao Dong:
Blueprint Separable Residual Network for Efficient Image Super-Resolution. CVPR Workshops 2022: 832-842 - [c215]Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoglu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gao, Dengwen Zhou, Qian Ning, Jingzhu Tang, Han Huang, Yufei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang:
NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results. CVPR Workshops 2022: 1061-1101 - [c214]Mingfei Han, David Junhao Zhang, Yali Wang, Rui Yan, Lina Yao, Xiaojun Chang, Yu Qiao:
Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition. CVPR 2022: 2980-2989 - [c213]Xiangtao Kong, Xina Liu, Jinjin Gu, Yu Qiao, Chao Dong:
Reflash Dropout in Image Super-Resolution. CVPR 2022: 5992-6002 - [c212]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552 - [c211]Mengzhe He, Yali Wang, Jiaxi Wu, Yiru Wang, Hanqing Li, Bo Li, Weihao Gan, Wei Wu, Yu Qiao:
Cross Domain Object Detection by Target-Perceived Dual Branch Distillation. CVPR 2022: 9560-9570 - [c210]Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai:
BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers. ECCV (9) 2022: 1-18 - [c209]Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lü, Guodong Guo:
Recurrent Bilinear Optimization for Binary Neural Networks. ECCV (24) 2022: 19-35 - [c208]Changyao Tian, Wenhai Wang, Xizhou Zhu, Jifeng Dai, Yu Qiao:
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition. ECCV (25) 2022: 73-91 - [c207]David Junhao Zhang, Kunchang Li, Yali Wang, Yunpeng Chen, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou:
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning. ECCV (35) 2022: 230-248 - [c206]Lin Zhou, Haoming Cai, Jinjin Gu, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Yu Qiao, Chao Dong:
Efficient Image Super-Resolution Using Vast-Receptive-Field Attention. ECCV Workshops (2) 2022: 256-272 - [c205]Yi Wang, Menghan Xia, Lu Qi, Jing Shao, Yu Qiao:
PalGAN: Image Colorization with Palette Generative Adversarial Networks. ECCV (15) 2022: 271-288 - [c204]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c203]Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu:
Self-slimmed Vision Transformer. ECCV (11) 2022: 432-448 - [c202]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c201]Yinan He, Gengshi Huang, Siyu Chen, Jianing Teng, Kun Wang, Zhenfei Yin, Lu Sheng, Ziwei Liu, Yu Qiao, Jing Shao:
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation. ECCV (26) 2022: 509-528 - [c200]Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan:
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark. ECCV (38) 2022: 550-567 - [c199]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning. ICLR 2022 - [c198]Yi Liu, Xuan Zhang, Ying Li, Guixin Liang, Yabing Jiang, Lixia Qiu, Haiping Tang, Fei Xie, Wei Yao, Yi Dai, Yu Qiao, Yali Wang:
VideoPipe 2022 Challenge: Real-World Video Understanding for Urban Pipe Inspection. ICPR 2022: 4967-4973 - [c197]Bin Wang, Yu Qiao, Dahua Lin, Stephen D. H. Yang, Weijia Li:
Cycle-Consistent Learning for Weakly Supervised Semantic Segmentation. HCMA@MM 2022: 7-13 - [c196]Yue Ma, Yali Wang, Yue Wu, Ziyu Lyu, Siran Chen, Xiu Li, Yu Qiao:
Visual Knowledge Graph for Human Action Reasoning in Videos. ACM Multimedia 2022: 4132-4141 - [c195]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022 - [c194]Penghao Wu, Xiaosong Jia, Li Chen, Junchi Yan, Hongyang Li, Yu Qiao:
Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline. NeurIPS 2022 - [c193]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. NeurIPS 2022 - [i172]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning. CoRR abs/2201.04676 (2022) - [i171]Mingye Xu, Zhipeng Zhou, Hongbin Xu, Yali Wang, Yu Qiao:
CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning. CoRR abs/2201.08215 (2022) - [i170]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-attention for Visual Recognition. CoRR abs/2201.09450 (2022) - [i169]Kexue Fu, Peng Gao, Renrui Zhang, Hongsheng Li, Yu Qiao, Manning Wang:
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning. CoRR abs/2202.04241 (2022) - [i168]Yuanhan Zhang, Qinghong Sun, Yichun Zhou, Zexin He, Zhenfei Yin, Kun Wang, Lu Sheng, Yu Qiao, Jing Shao, Ziwei Liu:
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy. CoRR abs/2203.07845 (2022) - [i167]Yinan He, Gengshi Huang, Siyu Chen, Jianing Teng, Wang Kun, Zhenfei Yin, Lu Sheng, Ziwei Liu, Yu Qiao, Jing Shao:
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation. CoRR abs/2203.08764 (2022) - [i166]Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan:
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark. CoRR abs/2203.11089 (2022) - [i165]Renrui Zhang, Han Qiu, Tai Wang, Xuanzhuo Xu, Ziyu Guo, Yu Qiao, Peng Gao, Hongsheng Li:
MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection. CoRR abs/2203.13310 (2022) - [i164]Kexue Fu, Peng Gao, Shaolei Liu, Renrui Zhang, Yu Qiao, Manning Wang:
POS-BERT: Point Cloud One-Stage BERT Pre-Training. CoRR abs/2204.00989 (2022) - [i163]Mingfei Han, David Junhao Zhang, Yali Wang, Rui Yan, Lina Yao, Xiaojun Chang, Yu Qiao:
Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition. CoRR abs/2204.02148 (2022) - [i162]Mengzhe He, Yali Wang, Jiaxi Wu, Yiru Wang, Hanqing Li, Bo Li, Weihao Gan, Wei Wu, Yu Qiao:
Cross Domain Object Detection by Target-Perceived Dual Branch Distillation. CoRR abs/2205.01291 (2022) - [i161]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
ConvMAE: Masked Convolution Meets Masked Autoencoders. CoRR abs/2205.03892 (2022) - [i160]Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, et al.:
NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results. CoRR abs/2205.05675 (2022) - [i159]Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Jinjin Gu, Yu Qiao, Chao Dong:
Blueprint Separable Residual Network for Efficient Image Super-Resolution. CoRR abs/2205.05996 (2022) - [i158]Yihao Liu, Hengyuan Zhao, Jinjin Gu, Yu Qiao, Chao Dong:
Evaluating the Generalization Ability of Super-Resolution Networks. CoRR abs/2205.07019 (2022) - [i157]Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao:
Vision Transformer Adapter for Dense Predictions. CoRR abs/2205.08534 (2022) - [i156]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. CoRR abs/2205.14401 (2022) - [i155]Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada:
Illumination Adaptive Transformer. CoRR abs/2205.14871 (2022) - [i154]Chenxin Tao, Xizhou Zhu, Gao Huang, Yu Qiao, Xiaogang Wang, Jifeng Dai:
Siamese Image Modeling for Self-Supervised Vision Representation Learning. CoRR abs/2206.01204 (2022) - [i153]Penghao Wu, Xiaosong Jia, Li Chen, Junchi Yan, Hongyang Li, Yu Qiao:
Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline. CoRR abs/2206.08129 (2022) - [i152]Li Chen, Tutian Tang, Zhitian Cai, Yang Li, Penghao Wu, Hongyang Li, Jianping Shi, Junchi Yan, Yu Qiao:
Level 2 Autonomous Driving on a Single Device: Diving into the Devils of Openpilot. CoRR abs/2206.08176 (2022) - [i151]Mingye Xu, Yali Wang, Yihao Liu, Yu Qiao:
CP3: Unifying Point Cloud Completion by Pretrain-Prompt-Predict Paradigm. CoRR abs/2207.05359 (2022) - [i150]Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022) - [i149]Yuexin Ma, Tai Wang, Xuyang Bai, Huitong Yang, Yuenan Hou, Yaming Wang, Yu Qiao, Ruigang Yang, Dinesh Manocha, Xinge Zhu:
Vision-Centric BEV Perception: A Survey. CoRR abs/2208.02797 (2022) - [i148]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022) - [i147]Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lv, Guodong Guo:
Recurrent Bilinear Optimization for Binary Neural Networks. CoRR abs/2209.01542 (2022) - [i146]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Enze Xie, Zhiqi Li, Hanming Deng, Hao Tian, Xizhou Zhu, Li Chen, Yulu Gao, Xiangwei Geng, Jia Zeng, Yang Li, Jiazhi Yang, Xiaosong Jia, Bohan Yu, Yu Qiao, Dahua Lin, Si Liu, Junchi Yan, Jianping Shi, Ping Luo:
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe. CoRR abs/2209.05324 (2022) - [i145]Renrui Zhang, Hanqiu Deng, Bohao Li, Wei Zhang, Hao Dong, Hongsheng Li, Peng Gao, Yu Qiao:
Collaboration of Pre-trained Models Makes Better Few-shot Learner. CoRR abs/2209.12255 (2022) - [i144]Boyu Chen, Yu Qiao, Yali Wang:
Low-Resolution Action Recognition for Tiny Actions Challenge. CoRR abs/2209.14711 (2022) - [i143]Lin Zhou, Haoming Cai, Jinjin Gu, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Yu Qiao, Chao Dong:
Efficient Image Super-Resolution using Vast-Receptive-Field Attention. CoRR abs/2210.05960 (2022) - [i142]Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang:
Hierarchical and Progressive Image Matting. CoRR abs/2210.06906 (2022) - [i141]Yu Qiao, Ziqi Wei, Yuhao Liu, Yuxin Wang, Dongsheng Zhou, Qiang Zhang, Xin Yang:
Wider and Higher: Intensive Integration and Global Foreground Perception for Image Matting. CoRR abs/2210.06919 (2022) - [i140]Yi Liu, Xuan Zhang, Ying Li, Guixin Liang, Yabing Jiang, Lixia Qiu, Haiping Tang, Fei Xie, Wei Yao, Yi Dai, Yu Qiao, Yali Wang:
VideoPipe 2022 Challenge: Real-World Video Understanding for Urban Pipe Inspection. CoRR abs/2210.11158 (2022) - [i139]Yi Wang, Menghan Xia, Lu Qi, Jing Shao, Yu Qiao:
PalGAN: Image Colorization with Palette Generative Adversarial Networks. CoRR abs/2210.11204 (2022) - [i138]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CoRR abs/2211.05778 (2022) - [i137]Jifeng Dai, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhu, Lewei Lu, Jie Zhou, Xiaogang Wang, Yu Qiao, Xiaowei Hu:
Demystify Transformers & Convolutions in Modern Image Deep Networks. CoRR abs/2211.05781 (2022) - [i136]Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CoRR abs/2211.08887 (2022) - [i135]Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao:
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges. CoRR abs/2211.09529 (2022) - [i134]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Limin Wang, Yu Qiao:
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer. CoRR abs/2211.09552 (2022) - [i133]Weijie Su, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie Zhou, Jifeng Dai:
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information. CoRR abs/2211.09807 (2022) - [i132]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CoRR abs/2211.09808 (2022) - [i131]Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision. CoRR abs/2211.10439 (2022) - [i130]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CoRR abs/2212.00776 (2022) - [i129]Qihuang Zhong, Liang Ding, Yibing Zhan, Yu Qiao, Yonggang Wen, Li Shen, Juhua Liu, Baosheng Yu, Bo Du, Yixin Chen, Xinbo Gao, Chunyan Miao, Xiaoou Tang, Dacheng Tao:
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE. CoRR abs/2212.01853 (2022) - [i128]Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang, Zhiyu Zhao, Hongjie Zhang, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Jiashuo Yu, Yali Wang, Limin Wang, Yu Qiao:
InternVideo: General Video Foundation Models via Generative and Discriminative Learning. CoRR abs/2212.03191 (2022) - [i127]Haibin He, Xinyuan Chen, Chaoyue Wang, Juhua Liu, Bo Du, Dacheng Tao, Yu Qiao:
Diff-Font: Diffusion Model for Robust One-Shot Font Generation. CoRR abs/2212.05895 (2022) - [i126]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders. CoRR abs/2212.06785 (2022) - [i125]Mingye Xu, Mutian Xu, Tong He, Wanli Ouyang, Yali Wang, Xiaoguang Han, Yu Qiao:
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency. CoRR abs/2212.09948 (2022) - [i124]Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li:
Goal-oriented Autonomous Driving. CoRR abs/2212.10156 (2022) - [i123]Ben Fei, Siyuan Huang, Jiakang Yuan, Botian Shi, Bo Zhang, Tao Chen, Min Dou, Yu Qiao:
ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation. CoRR abs/2212.10390 (2022) - 2021
- [j66]Junjun He, Cheng Li, Jin Ye, Yu Qiao, Lixu Gu:
Multi-label ocular disease classification with a dense correlation deep neural network. Biomed. Signal Process. Control. 63: 102167 (2021) - [j65]Junjun He, Cheng Li, Jin Ye, Yu Qiao, Lixu Gu:
Self-speculation of clinical features based on knowledge distillation for accurate ocular disease classification. Biomed. Signal Process. Control. 67: 102491 (2021) - [j64]Lifang Wu, Qi Wang, Meng Jian, Yu Qiao, Boxuan Zhao:
A Comprehensive Review of Group Activity Recognition in Videos. Int. J. Autom. Comput. 18(3): 334-350 (2021) - [j63]Wen Wang, Xiaojiang Peng, Yanzhou Su, Yu Qiao, Jian Cheng:
TTPP: Temporal Transformer with Progressive Prediction for efficient action anticipation. Neurocomputing 438: 270-279 (2021) - [j62]Xiaoxing Zeng, Ruyun Hu, Wu Shi, Yu Qiao:
Multi-view self-supervised learning for 3D facial texture reconstruction from single image. Image Vis. Comput. 115: 104311 (2021) - [j61]Linwei Zhu, Yun Zhang, Shiqi Wang, Sam Kwong, Xin Jin, Yu Qiao:
Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding. IEEE Trans. Circuits Syst. Video Technol. 31(8): 3168-3181 (2021) - [j60]Junhao Zhang, Yali Wang, Zhipeng Zhou, Tianyu Luan, Zhe Wang, Yu Qiao:
Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos. IEEE Trans. Image Process. 30: 7914-7925 (2021) - [j59]Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang:
Domain Adaptive Ensemble Learning. IEEE Trans. Image Process. 30: 8008-8018 (2021) - [j58]Diping Song, Bin Fu, Fei Li, Jian Xiong, Junjun He, Xiulan Zhang, Yu Qiao:
Deep Relation Transformer for Diagnosing Glaucoma With Optical Coherence Tomography and Visual Field Function. IEEE Trans. Medical Imaging 40(9): 2392-2402 (2021) - [j57]Peiqin Zhuang, Yali Wang, Yu Qiao:
Wildfish++: A Comprehensive Fish Benchmark for Multimedia Research. IEEE Trans. Multim. 23: 3603-3617 (2021) - [j56]Xin Yang, Yu Qiao, Shaozhe Chen, Shengfeng He, Baocai Yin, Qiang Zhang, Xiaopeng Wei, Rynson W. H. Lau:
Smart Scribbles for Image Matting. ACM Trans. Multim. Comput. Commun. Appl. 16(4): 121:1-121:21 (2021) - [c192]Pei Liu, Guorun Yang, Peixuan Li, Zhe Wang, Jianping Shi, Zhidong Deng, Yu Qiao:
MP-Mono: Monocular 3D Detection Using Multiple Priors for Autonomous Driving. 3DV 2021: 535-544 - [c191]Tianyu Luan, Yali Wang, Junhao Zhang, Zhe Wang, Zhipeng Zhou, Yu Qiao:
PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos. AAAI 2021: 2269-2276 - [c190]Haisheng Su, Weihao Gan, Wei Wu, Yu Qiao, Junjie Yan:
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation. AAAI 2021: 2602-2610 - [c189]Hongbin Xu, Zhipeng Zhou, Yu Qiao, Wenxiong Kang, Qiuxia Wu:
Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation. AAAI 2021: 3030-3038 - [c188]Mingye Xu, Zhipeng Zhou, Junhao Zhang, Yu Qiao:
Investigate Indistinguishable Points in Semantic Segmentation of 3D Point Cloud. AAAI 2021: 3047-3055 - [c187]Mutian Xu, Junhao Zhang, Zhipeng Zhou, Mingye Xu, Xiaojuan Qi, Yu Qiao:
Learning Geometry-Disentangled Representation for Complementary Understanding of 3D Object Point Cloud. AAAI 2021: 3056-3064 - [c186]Haoming Cai, Jingwen He, Yu Qiao, Chao Dong:
Toward Interactive Modulation for Photo-Realistic Image Restoration. CVPR Workshops 2021: 294-303 - [c185]Xiangyu Chen, Yihao Liu, Zhengwen Zhang, Yu Qiao, Chao Dong:
HDRUNet: Single Image HDR Reconstruction With Denoising and Dequantization. CVPR Workshops 2021: 354-363 - [c184]Zhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang:
Temporal Context Aggregation Network for Temporal Action Proposal Refinement. CVPR 2021: 485-494 - [c183]Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao:
Affordance Transfer Learning for Human-Object Interaction Detection. CVPR 2021: 495-504 - [c182]Jinjin Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Yu Qiao, Shuhang Gu, Radu Timofte:
NTIRE 2021 Challenge on Perceptual Image Quality Assessment. CVPR Workshops 2021: 677-690 - [c181]Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li:
Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification. CVPR 2021: 3436-3445 - [c180]Xiangtao Kong, Hengyuan Zhao, Yu Qiao, Chao Dong:
ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic. CVPR 2021: 12016-12025 - [c179]Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao:
Detecting Human-Object Interaction via Fabricated Compositional Learning. CVPR 2021: 14646-14655 - [c178]Xiangyu Chen, Zhengwen Zhang, Jimmy S. Ren, Lynhoo Tian, Yu Qiao, Chao Dong:
A New Journey from SDRTV to HDRTV. ICCV 2021: 4480-4489 - [c177]Hongbin Xu, Zhipeng Zhou, Yali Wang, Wenxiong Kang, Baigui Sun, Hao Li, Yu Qiao:
Digging into Uncertainty in Self-supervised Multi-view Stereo. ICCV 2021: 6058-6067 - [c176]Yuhao Liu, Jiake Xie, Xiao Shi, Yu Qiao, Yujie Huang, Yong Tang, Xin Yang:
Tripartite Information Mining and Integration for Image Matting. ICCV 2021: 7535-7544 - [c175]Kunchang Li, Xianhang Li, Yali Wang, Jun Wang, Yu Qiao:
CT-Net: Channel Tensorization Network for Video Classification. ICLR 2021 - [c174]Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang:
Domain Generalization with MixStyle. ICLR 2021 - [c173]Cheng Li, Jin Ye, Junjun He, Shanshan Wang, Lixu Gu, Yu Qiao:
Collaborative Multi-View Convolutions With Gating For Accurate And Fast Volumetric Medical Image Segmentation. ISBI 2021: 571-574 - [c172]Junjun He, Jin Ye, Cheng Li, Diping Song, Wanli Chen, Shanshan Wang, Lixu Gu, Yu Qiao:
Group Shift Pointwise Convolution for Volumetric Medical Image Segmentation. MICCAI (3) 2021: 48-58 - [c171]Zijie Chen, Cheng Li, Junjun He, Jin Ye, Diping Song, Shanshan Wang, Lixu Gu, Yu Qiao:
A Novel Hybrid Convolutional Neural Network for Accurate Organ Segmentation in 3D Head and Neck CT Images. MICCAI (1) 2021: 569-578 - [i122]Yu Qiao, Yuhao Liu, Qiang Zhu, Xin Yang, Yuxin Wang, Qiang Zhang, Xiaopeng Wei:
Multi-scale Information Assembly for Image Matting. CoRR abs/2101.02391 (2021) - [i121]Kaiyang Zhou, Ziwei Liu, Yu Qiao, Tao Xiang, Chen Change Loy:
Domain Generalization: A Survey. CoRR abs/2103.02503 (2021) - [i120]Xiangtao Kong, Hengyuan Zhao, Yu Qiao, Chao Dong:
ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic. CoRR abs/2103.04039 (2021) - [i119]Qing Li, Xiaojiang Peng, Yu Qiao, Qi Hao:
Unsupervised Person Re-Identification with Multi-Label Learning Guided Self-Paced Clustering. CoRR abs/2103.04580 (2021) - [i118]Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao:
Detecting Human-Object Interaction via Fabricated Compositional Learning. CoRR abs/2103.08214 (2021) - [i117]Tianyu Luan, Yali Wang, Junhao Zhang, Zhe Wang, Zhipeng Zhou, Yu Qiao:
PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos. CoRR abs/2103.09009 (2021) - [i116]Mingye Xu, Zhipeng Zhou, Junhao Zhang, Yu Qiao:
Investigate Indistinguishable Points in Semantic Segmentation of 3D Point Cloud. CoRR abs/2103.10339 (2021) - [i115]Zhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang:
Temporal Context Aggregation Network for Temporal Action Proposal Refinement. CoRR abs/2103.13141 (2021) - [i114]Xin Yang, Yu Qiao, Shaozhe Chen, Shengfeng He, Baocai Yin, Qiang Zhang, Xiaopeng Wei, Rynson W. H. Lau:
Smart Scribbles for Image Mating. CoRR abs/2103.17062 (2021) - [i113]Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang:
Domain Generalization with MixStyle. CoRR abs/2104.02008 (2021) - [i112]Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao:
Affordance Transfer Learning for Human-Object Interaction Detection. CoRR abs/2104.02867 (2021) - [i111]Hongbin Xu, Zhipeng Zhou, Yu Qiao, Wenxiong Kang, Qiuxia Wu:
Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation. CoRR abs/2104.05374 (2021) - [i110]Yihao Liu, Jingwen He, Xiangyu Chen, Zhengwen Zhang, Hengyuan Zhao, Chao Dong, Yu Qiao:
Very Lightweight Photo Retouching Network with Conditional Sequential Modulation. CoRR abs/2104.06279 (2021) - [i109]Jinjin Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Yu Qiao, Shuhang Gu, Radu Timofte, Manri Cheon, Sung-Jun Yoon, Byungyeon Kang, Junwoo Lee, Qing Zhang, Haiyang Guo, Yi Bin, Yuqing Hou, Hengliang Luo, Jingyu Guo, Zirui Wang, Hai Wang, Wenming Yang, Qingyan Bai, Shuwei Shi, Weihao Xia, Mingdeng Cao, Jiahao Wang, Yifan Chen, Yujiu Yang, Yang Li, Tao Zhang, Longtao Feng, Yiting Liao, Junlin Li, William Thong, José Costa Pereira, Ales Leonardis, Steven McDonagh, Kele Xu, Lehan Yang, Hengxing Cai, Pengfei Sun, Seyed Mehdi Ayyoubzadeh, Ali Royat, Sid Ahmed Fezza, Dounia Hammou, Wassim Hamidouche, Sewoong Ahn, Gwangjin Yoon, Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa:
NTIRE 2021 Challenge on Perceptual Image Quality Assessment. CoRR abs/2105.03072 (2021) - [i108]Haoming Cai, Jingwen He, Yu Qiao, Chao Dong:
Toward Interactive Modulation for Photo-Realistic Image Restoration. CoRR abs/2105.03085 (2021) - [i107]Shijie Yu, Dapeng Chen, Rui Zhao, Haobin Chen, Yu Qiao:
Neighbourhood-guided Feature Reconstruction for Occluded Person Re-Identification. CoRR abs/2105.07345 (2021) - [i106]Yi Liu, Limin Wang, Xiao Ma, Yali Wang, Yu Qiao:
FineAction: A Fined Video Dataset for Temporal Action Localization. CoRR abs/2105.11107 (2021) - [i105]Shijie Yu, Feng Zhu, Dapeng Chen, Rui Zhao, Haobin Chen, Shixiang Tang, Jinguo Zhu, Yu Qiao:
Multiple Domain Experts Collaborative Learning: Multi-Source Domain Generalization For Person Re-Identification. CoRR abs/2105.12355 (2021) - [i104]Xiangyu Chen, Yihao Liu, Zhengwen Zhang, Yu Qiao, Chao Dong:
HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization. CoRR abs/2105.13084 (2021) - [i103]Haisheng Su, Jinyuan Feng, Dongliang Wang, Weihao Gan, Wei Wu, Yu Qiao:
TSI: Temporal Saliency Integration for Video Action Recognition. CoRR abs/2106.01088 (2021) - [i102]Kunchang Li, Xianhang Li, Yali Wang, Jun Wang, Yu Qiao:
CT-Net: Channel Tensorization Network for Video Classification. CoRR abs/2106.01603 (2021) - [i101]Peng Gao, Shijie Geng, Yu Qiao, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Scalable Transformers for Neural Machine Translation. CoRR abs/2106.02242 (2021) - [i100]Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li:
Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification. CoRR abs/2106.06133 (2021) - [i99]Yuhao Liu, Jiake Xie, Yu Qiao, Yong Tang, Xin Yang:
Prior-Induced Information Alignment for Image Matting. CoRR abs/2106.14439 (2021) - [i98]Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang:
MixStyle Neural Networks for Domain Generalization and Adaptation. CoRR abs/2107.02053 (2021) - [i97]Anran Liu, Yihao Liu, Jinjin Gu, Yu Qiao, Chao Dong:
Blind Image Super-Resolution: A Survey and Beyond. CoRR abs/2107.03055 (2021) - [i96]Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao:
RankSRGAN: Super Resolution Generative Adversarial Networks with Learning to Rank. CoRR abs/2107.09427 (2021) - [i95]Haisheng Su, Peiqin Zhuang, Yukun Li, Dongliang Wang, Weihao Gan, Wei Wu, Yu Qiao:
Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021. CoRR abs/2107.12618 (2021) - [i94]Yihao Liu, Anran Liu, Jinjin Gu, Zhipeng Zhang, Wenhao Wu, Yu Qiao, Chao Dong:
Discovering "Semantics" in Super-Resolution Networks. CoRR abs/2108.00406 (2021) - [i93]Xiangyu Chen, Zhengwen Zhang, Jimmy S. Ren, Lynhoo Tian, Yu Qiao, Chao Dong:
A New Journey from SDRTV to HDRTV. CoRR abs/2108.07978 (2021) - [i92]Hongbin Xu, Zhipeng Zhou, Yali Wang, Wenxiong Kang, Baigui Sun, Hao Li, Yu Qiao:
Digging into Uncertainty in Self-supervised Multi-view Stereo. CoRR abs/2108.12966 (2021) - [i91]Junhao Zhang, Yali Wang, Zhipeng Zhou, Tianyu Luan, Zhe Wang, Yu Qiao:
Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos. CoRR abs/2109.07353 (2021) - [i90]Junjun He, Jin Ye, Cheng Li, Diping Song, Wanli Chen, Shanshan Wang, Lixu Gu, Yu Qiao:
Group Shift Pointwise Convolution for Volumetric Medical Image Segmentation. CoRR abs/2109.12629 (2021) - [i89]Zijie Chen, Cheng Li, Junjun He, Jin Ye, Diping Song, Shanshan Wang, Lixu Gu, Yu Qiao:
A Novel Hybrid Convolutional Neural Network for Accurate Organ Segmentation in 3D Head and Neck CT Images. CoRR abs/2109.12634 (2021) - [i88]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. CoRR abs/2110.04544 (2021) - [i87]Yihao Liu, Hengyuan Zhao, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Yu Qiao, Chao Dong:
Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning. CoRR abs/2110.04562 (2021) - [i86]Shidong Wang, Wei Zeng, Xi Chen, Yu Ye, Yu Qiao, Chi-Wing Fu:
ActFloor-GAN: Activity-Guided Adversarial Networks for Human-Centric Floorplan Design. CoRR abs/2111.03545 (2021) - [i85]Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling. CoRR abs/2111.03930 (2021) - [i84]Jing Shao, Siyu Chen, Yangguang Li, Kun Wang, Zhenfei Yin, Yinan He, Jianing Teng, Qinghong Sun, Mengya Gao, Jihao Liu, Gengshi Huang, Guanglu Song, Yichao Wu, Yuming Huang, Fenggang Liu, Huan Peng, Shuo Qin, Chengyu Wang, Yujie Wang, Conghui He, Ding Liang, Yu Liu, Fengwei Yu, Junjie Yan, Dahua Lin, Xiaogang Wang, Yu Qiao:
INTERN: A New Learning Paradigm Towards General Vision. CoRR abs/2111.08687 (2021) - [i83]David Junhao Zhang, Kunchang Li, Yunpeng Chen, Yali Wang, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou:
MorphMLP: A Self-Attention Free, MLP-Like Backbone for Image and Video. CoRR abs/2111.12527 (2021) - [i82]Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu:
Self-slimmed Vision Transformer. CoRR abs/2111.12624 (2021) - [i81]Changyao Tian, Wenhai Wang, Xizhou Zhu, Xiaogang Wang, Jifeng Dai, Yu Qiao:
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition. CoRR abs/2111.13579 (2021) - [i80]Teli Ma, Shijie Geng, Mengmeng Wang, Jing Shao, Jiasen Lu, Hongsheng Li, Peng Gao, Yu Qiao:
A Simple Long-Tailed Recognition Baseline via Vision-Language Model. CoRR abs/2111.14745 (2021) - [i79]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CoRR abs/2112.02413 (2021) - [i78]Liang Pan, Tong Wu, Zhongang Cai, Ziwei Liu, Xumin Yu, Yongming Rao, Jiwen Lu, Jie Zhou, Mingye Xu, Xiaoyuan Luo, Kexue Fu, Peng Gao, Manning Wang, Yali Wang, Yu Qiao, Junsheng Zhou, Xin Wen, Peng Xiang, Yu-Shen Liu, Zhizhong Han, Yuanjie Yan, Junyi An, Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez Fernández, Qinlong Wang, Yang Yang:
Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results. CoRR abs/2112.12053 (2021) - [i77]Xiangtao Kong, Xina Liu, Jinjin Gu, Yu Qiao, Chao Dong:
Reflash Dropout in Image Super-Resolution. CoRR abs/2112.12089 (2021) - 2020
- [j55]Yuping Ye, Zhan Song, Junguang Guo, Yu Qiao:
SIAT-3DFE: A High-Resolution 3D Facial Expression Dataset. IEEE Access 8: 48205-48211 (2020) - [j54]Yu Qiao, Yuhao Liu, Qiang Zhu, Xin Yang, Yuxin Wang, Qiang Zhang, Xiaopeng Wei:
Multi-scale Information Assembly for Image Matting. Comput. Graph. Forum 39(7): 565-574 (2020) - [j53]Jiaze Wang, Xiaojiang Peng, Yu Qiao:
Cascade multi-head attention networks for action recognition. Comput. Vis. Image Underst. 192: 102898 (2020) - [j52]Qing Li, Xiaojiang Peng, Liangliang Cao, Wenbin Du, Hao Xing, Yu Qiao, Qiang Peng:
Product image recognition with guidance learning and noisy supervision. Comput. Vis. Image Underst. 196: 102963 (2020) - [j51]Xiaoxing Zeng, Xiaojiang Peng, Yali Wang, Yu Qiao:
Finding hard faces with better proposals and classifier. Mach. Vis. Appl. 31(7): 61 (2020) - [j50]Fei Li, Diping Song, Han Chen, Jian Xiong, Xingyi Li, Hua Zhong, Guangxian Tang, Sujie Fan, Dennis S. C. Lam, Weihua Pan, Yajuan Zheng, Ying Li, Guoxiang Qu, Junjun He, Zhe Wang, Ling Jin, Rouxi Zhou, Yunhe Song, Yi Sun, Weijing Cheng, Chunman Yang, Yazhi Fan, Yingjie Li, Hengli Zhang, Ye Yuan, Yang Xu, Yunfan Xiong, Lingfei Jin, Aiguo Lv, Lingzhi Niu, Yuhong Liu, Shaoli Li, Jiani Zhang, Linda M. Zangwill, Alejandro F. Frangi, Tin Aung, Ching-Yu Cheng, Yu Qiao, Xiulan Zhang, Daniel S. W. Ting:
Development and clinical deployment of a smartphone-based visual field deep learning system for glaucoma detection. npj Digit. Medicine 3 (2020) - [j49]Qing Li, Xiaojiang Peng, Yu Qiao, Qiang Peng:
Learning label correlations for multi-label image recognition with graph networks. Pattern Recognit. Lett. 138: 378-384 (2020) - [j48]Hao Chen, Yali Wang, Guoyou Wang, Xiang Bai, Yu Qiao:
Progressive Object Transfer Detection. IEEE Trans. Image Process. 29: 986-1000 (2020) - [j47]Kai Wang, Xiaojiang Peng, Jianfei Yang, Debin Meng, Yu Qiao:
Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition. IEEE Trans. Image Process. 29: 4057-4069 (2020) - [j46]Xianyu Chen, Yali Wang, Jianzhuang Liu, Yu Qiao:
DID: Disentangling-Imprinting-Distilling for Continuous Low-Shot Detection. IEEE Trans. Image Process. 29: 7765-7778 (2020) - [j45]Haidong Lan, Jintao Meng, Christian Hundt, Bertil Schmidt, Minwen Deng, Xiaoning Wang, Weiguo Liu, Yu Qiao, Shengzhong Feng:
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures. IEEE Trans. Parallel Distributed Syst. 31(3): 580-594 (2020) - [c170]Jing Li, Jing Xu, Fangwei Zhong, Xiangyu Kong, Yu Qiao, Yizhou Wang:
Pose-Assisted Multi-Camera Collaboration for Active Object Tracking. AAAI 2020: 759-766 - [c169]Yu Dong, Yihao Liu, He Zhang, Shifeng Chen, Yu Qiao:
FD-GAN: Generative Adversarial Networks with Fusion-Discriminator for Single Image Dehazing. AAAI 2020: 10729-10736 - [c168]Bin Fu, Junjun He, Zhengfu Zhang, Yu Qiao:
Dynamic Sampling Network for Semantic Segmentation. AAAI 2020: 10794-10801 - [c167]Mingye Xu, Zhipeng Zhou, Yu Qiao:
Geometry Sharing Network for 3D Point Cloud Classification and Segmentation. AAAI 2020: 12500-12507 - [c166]Ze Yang, Yali Wang, Xianyu Chen, Jianzhuang Liu, Yu Qiao:
Context-Transformer: Tackling Object Confusion for Few-Shot Detection. AAAI 2020: 12653-12660 - [c165]Peiqin Zhuang, Yali Wang, Yu Qiao:
Learning Attentive Pairwise Interaction for Fine-Grained Classification. AAAI 2020: 13130-13137 - [c164]Kuan Xu, Chilin Fu, Xiaolu Zhang, Cen Chen, Ya-Lin Zhang, Wenge Rong, Zujie Wen, Jun Zhou, Xiaolong Li, Yu Qiao:
aDMSCN: A Novel Perspective for User Intent Prediction in Customer Service Bots. CIKM 2020: 2853-2860 - [c163]Xianhang Li, Yali Wang, Zhipeng Zhou, Yu Qiao:
SmallBigNet: Integrating Core and Contextual Views for Video Classification. CVPR 2020: 1089-1098 - [c162]Sijie Ji, Kai Wang, Xiaojiang Peng, Jianfei Yang, Zhaoyang Zeng, Yu Qiao:
Multiple Transfer Learning and Multi-label Balanced Training Strategies for Facial AU Detection In the Wild. CVPR Workshops 2020: 1657-1661 - [c161]Shijie Yu, Shihua Li, Dapeng Chen, Rui Zhao, Junjie Yan, Yu Qiao:
COCAS: A Large-Scale Clothes Changing Person Dataset for Re-Identification. CVPR 2020: 3397-3406 - [c160]Shuai Bai, Zhiqun He, Yu Qiao, Hanzhe Hu, Wei Wu, Junjie Yan:
Adaptive Dilated Network With Self-Correction Supervision for Counting. CVPR 2020: 4593-4602 - [c159]Wu Shi, Yu Qiao:
Fast Texture Synthesis via Pseudo Optimizer. CVPR 2020: 5497-5506 - [c158]Kai Wang, Xiaojiang Peng, Jianfei Yang, Shijian Lu, Yu Qiao:
Suppressing Uncertainties for Large-Scale Facial Expression Recognition. CVPR 2020: 6896-6905 - [c157]Yu Qiao, Yuhao Liu, Xin Yang, Dongsheng Zhou, Mingliang Xu, Qiang Zhang, Xiaopeng Wei:
Attention-Guided Hierarchical Structure Aggregation for Image Matting. CVPR 2020: 13673-13682 - [c156]Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Xiaotong Luo, Liang Chen, Jiangtao Zhang, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, Jung Heum Kang, Sung-Ho Bae, Yongwoo Kim, Yanyun Qu, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Éric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan D. Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, C. V. Jiji, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni:
AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results. ECCV Workshops (3) 2020: 5-40 - [c155]Sanghyun Son, Jaerin Lee, Seungjun Nah, Radu Timofte, Kyoung Mu Lee, Yihao Liu, Liangbin Xie, Siyao Li, Wenxiu Sun, Yu Qiao, Chao Dong, Woonsung Park, Wonyong Seo, Munchurl Kim, Wenhao Zhang, Pablo Navarrete Michelini, Kazutoshi Akita, Norimichi Ukita:
AIM 2020 Challenge on Video Temporal Super-Resolution. ECCV Workshops (4) 2020: 23-40 - [c154]Yihao Liu, Liangbin Xie, Siyao Li, Wenxiu Sun, Yu Qiao, Chao Dong:
Enhanced Quadratic Video Interpolation. ECCV Workshops (4) 2020: 41-56 - [c153]Jingwen He, Chao Dong, Yu Qiao:
Interactive Multi-dimension Modulation with Dynamic Controllable Residual Learning for Image Restoration. ECCV (20) 2020: 53-68 - [c152]Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong:
Efficient Image Super-Resolution Using Pixel Attention. ECCV Workshops (3) 2020: 56-72 - [c151]Xiao Zhang, Rui Zhao, Yu Qiao, Hongsheng Li:
RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax. ECCV (26) 2020: 296-311 - [c150]Mingfei Han, Yali Wang, Xiaojun Chang, Yu Qiao:
Mining Inter-Video Proposal Relations for Video Object Detection. ECCV (21) 2020: 431-446 - [c149]Zhi Hou, Xiaojiang Peng, Yu Qiao, Dacheng Tao:
Visual Compositional Learning for Human-Object Interaction Detection. ECCV (15) 2020: 584-600 - [c148]Jin Ye, Junjun He, Xiaojiang Peng, Wenhao Wu, Yu Qiao:
Attention-Driven Dynamic Graph Convolutional Network for Multi-label Image Recognition. ECCV (21) 2020: 649-665 - [c147]Jingwen He, Yihao Liu, Yu Qiao, Chao Dong:
Conditional Sequential Modulation for Efficient Global Image Retouching. ECCV (13) 2020: 679-695 - [c146]Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wayne Wu, Chen Qian, Ran He, Yu Qiao, Chen Change Loy:
MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation. ECCV (21) 2020: 700-717 - [c145]Jianbo Liu, Junjun He, Yu Qiao, Jimmy S. Ren, Hongsheng Li:
Learning to Predict Context-Adaptive Convolution for Semantic Segmentation. ECCV (25) 2020: 769-786 - [c144]Xiaojiang Peng, Kai Wang, Zhaoyang Zeng, Qing Li, Jianfei Yang, Yu Qiao:
Suppressing Mislabeled Data via Grouping and Self-attention. ECCV (16) 2020: 786-802 - [c143]Xingyu Fan, Zhongying Deng, Kai Wang, Xiaojiang Peng, Yu Qiao:
Learning Discriminative Representation For Facial Expression Recognition From Uncertainties. ICIP 2020: 903-907 - [c142]Cheng Li, Jin Ye, Junjun He, Shanshan Wang, Yu Qiao, Lixu Gu:
Dense Correlation Network for Automated Multi-Label Ocular Disease Detection with Paired Color Fundus Photographs. ISBI 2020: 1-4 - [c141]Junjun He, Cheng Li, Jin Ye, Shanshan Wang, Yu Qiao, Lixu Gu:
Classification of Ocular Diseases Employing Attention-Based Unilateral and Bilateral Feature Weighting and Fusion. ISBI 2020: 1258-1261 - [i76]Jing Li, Jing Xu, Fangwei Zhong, Xiangyu Kong, Yu Qiao, Yizhou Wang:
Pose-Assisted Multi-Camera Collaboration for Active Object Tracking. CoRR abs/2001.05161 (2020) - [i75]Yu Dong, Yihao Liu, He Zhang, Shifeng Chen, Yu Qiao:
FD-GAN: Generative Adversarial Networks with Fusion-discriminator for Single Image Dehazing. CoRR abs/2001.06968 (2020) - [i74]Wen Wang, Xiaojiang Peng, Yu Qiao, Jian Cheng:
A Comprehensive Study on Temporal Modeling for Online Action Detection. CoRR abs/2001.07501 (2020) - [i73]Hao Chen, Yali Wang, Guoyou Wang, Xiang Bai, Yu Qiao:
Progressive Object Transfer Detection. CoRR abs/2002.04741 (2020) - [i72]Peiqin Zhuang, Yali Wang, Yu Qiao:
Learning Attentive Pairwise Interaction for Fine-Grained Classification. CoRR abs/2002.10191 (2020) - [i71]Kai Wang, Xiaojiang Peng, Jianfei Yang, Shijian Lu, Yu Qiao:
Suppressing Uncertainties for Large-Scale Facial Expression Recognition. CoRR abs/2002.10392 (2020) - [i70]Zhanzhan Cheng, Yunlu Xu, Mingjian Cheng, Yu Qiao, Shiliang Pu, Yi Niu, Fei Wu:
Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units. CoRR abs/2002.11338 (2020) - [i69]Wen Wang, Xiaojiang Peng, Yanzhou Su, Yu Qiao, Jian Cheng:
TTPP: Temporal Transformer with Progressive Prediction for Efficient Action Anticipation. CoRR abs/2003.03530 (2020) - [i68]Ze Yang, Yali Wang, Xianyu Chen, Jianzhuang Liu, Yu Qiao:
Context-Transformer: Tackling Object Confusion for Few-Shot Detection. CoRR abs/2003.07304 (2020) - [i67]Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang:
Domain Adaptive Ensemble Learning. CoRR abs/2003.07325 (2020) - [i66]Jianbo Liu, Junjun He, Jimmy S. Ren, Yu Qiao, Hongsheng Li:
Learning to Predict Context-adaptive Convolution for Semantic Segmentation. CoRR abs/2004.08222 (2020) - [i65]Shijie Yu, Shihua Li, Dapeng Chen, Rui Zhao, Junjie Yan, Yu Qiao:
COCAS: A Large-Scale Clothes Changing Person Dataset for Re-identification. CoRR abs/2005.07862 (2020) - [i64]Xianhang Li, Yali Wang, Zhipeng Zhou, Yu Qiao:
SmallBigNet: Integrating Core and Contextual Views for Video Classification. CoRR abs/2006.14582 (2020) - [i63]Zhi Hou, Xiaojiang Peng, Yu Qiao, Dacheng Tao:
Visual Compositional Learning for Human-Object Interaction Detection. CoRR abs/2007.12407 (2020) - [i62]Ruicheng Feng, Weipeng Guan, Yu Qiao, Chao Dong:
Exploring Multi-Scale Feature Propagation and Communication for Image Super Resolution. CoRR abs/2008.00239 (2020) - [i61]Yihao Liu, Liangbin Xie, Siyao Li, Wenxiu Sun, Yu Qiao, Chao Dong:
Enhanced Quadratic Video Interpolation. CoRR abs/2009.04642 (2020) - [i60]Haisheng Su, Jing Su, Dongliang Wang, Weihao Gan, Wei Wu, Mengmeng Wang, Junjie Yan, Yu Qiao:
Collaborative Distillation in the Parameter and Spectrum Domains for Video Action Recognition. CoRR abs/2009.06902 (2020) - [i59]Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Jiangtao Zhang, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, Jung Heum Kang, Sung-Ho Bae, Yongwoo Kim, Liang Chen, Xiaotong Luo, Yanyun Qu, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Éric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan D. Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, C. Victor Jiji, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni:
AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results. CoRR abs/2009.06943 (2020) - [i58]Haisheng Su, Weihao Gan, Wei Wu, Junjie Yan, Yu Qiao:
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation. CoRR abs/2009.07641 (2020) - [i57]Jingwen He, Yihao Liu, Yu Qiao, Chao Dong:
Conditional Sequential Modulation for Efficient Global Image Retouching. CoRR abs/2009.10390 (2020) - [i56]Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong:
Efficient Image Super-Resolution Using Pixel Attention. CoRR abs/2010.01073 (2020) - [i55]Xiaojiang Peng, Kai Wang, Zhaoyang Zeng, Qing Li, Jianfei Yang, Yu Qiao:
Suppressing Mislabeled Data via Grouping and Self-Attention. CoRR abs/2010.15603 (2020) - [i54]Jin Ye, Junjun He, Xiaojiang Peng, Wenhao Wu, Yu Qiao:
Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition. CoRR abs/2012.02994 (2020) - [i53]Mutian Xu, Junhao Zhang, Zhipeng Zhou, Mingye Xu, Xiaojuan Qi, Yu Qiao:
Learning Geometry-Disentangled Representation for Complementary Understanding of 3D Object Point Cloud. CoRR abs/2012.10921 (2020) - [i52]Hengshun Zhou, Debin Meng, Yuanyuan Zhang, Xiaojiang Peng, Jun Du, Kai Wang, Yu Qiao:
Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition. CoRR abs/2012.13912 (2020)
2010 – 2019
- 2019
- [j44]Fei Li, Zhe Wang, Guoxiang Qu, Diping Song, Ye Yuan, Yang Xu, Kai Gao, Guangwei Luo, Zegu Xiao, Dennis S. C. Lam, Hua Zhong, Yu Qiao, Xiulan Zhang:
Correction to: Automatic differentiation of Glaucoma visual field from non-glaucoma visual field using deep convolutional neural network. BMC Medical Imaging 19(1): 40:1 (2019) - [j43]Yandong Wen, Kaipeng Zhang, Zhifeng Li, Yu Qiao:
A Comprehensive Study on Center Loss for Deep Face Recognition. Int. J. Comput. Vis. 127(6-7): 668-683 (2019) - [j42]Yanpeng Cao, Dayan Guan, Weilin Huang, Jiangxin Yang, Yanlong Cao, Yu Qiao:
Pedestrian detection with unsupervised multispectral feature learning using deep neural networks. Inf. Fusion 46: 206-217 (2019) - [j41]Jianhan Mei, Ziming Wu, Xiang Chen, Yu Qiao, Henghui Ding, Xudong Jiang:
DeepDeblur: text image recovery from blur to sharp. Multim. Tools Appl. 78(13): 18869-18885 (2019) - [j40]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks for Action Recognition in Videos. IEEE Trans. Pattern Anal. Mach. Intell. 41(11): 2740-2755 (2019) - [j39]Hanyu Peng, Junjun He, Shifeng Chen, Yali Wang, Yu Qiao:
Dual-supervised attention network for deep cross-modal hashing. Pattern Recognit. Lett. 128: 333-339 (2019) - [j38]Wenjuan Gong, Bin Zhang, Chaoqi Wang, Hanbing Yue, Chuantao Li, Linjie Xing, Yu Qiao, Weishan Zhang, Faming Gong:
A Literature Review: Geometric Methods and Their Applications in Human-Related Analysis. Sensors 19(12): 2809 (2019) - [j37]Zhongying Deng, Xiaojiang Peng, Zhifeng Li, Yu Qiao:
Mutual Component Convolutional Neural Networks for Heterogeneous Face Recognition. IEEE Trans. Image Process. 28(6): 3102-3114 (2019) - [c140]Zhongying Deng, Xiaojiang Peng, Yu Qiao:
Residual Compensation Networks for Heterogeneous Face Recognition. AAAI 2019: 8239-8246 - [c139]Ruicheng Feng, Jinjin Gu, Yu Qiao, Chao Dong:
Suppressing Model Overfitting for Image Super-Resolution Networks. CVPR Workshops 2019: 1964-1973 - [c138]Jianrui Cai, Shuhang Gu, Radu Timofte, Lei Zhang, Xiao Liu, Yukang Ding, Dongliang He, Chao Li, Yi Fu, Shilei Wen, Ruicheng Feng, Jinjin Gu, Yu Qiao, Chao Dong, Dongwon Park, Se Young Chun, Sanghoon Yoon, Junhyung Kwak, Donghee Son, Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Fahad Shahbaz Khan, Ling Shao, Zhengping Wei, Lei Liu, Hong Cai, Darui Li, Fujie Gao, Zheng Hui, Xiumei Wang, Xinbo Gao, Guoan Cheng, Ai Matsune, Qiuyu Li, Leilei Zhu, Huaijuan Zang, Shu Zhan, Yajun Qiu, Ruxin Wang, Jiawei Li, Yongcheng Jing, Mingli Song, Pengju Liu, Kai Zhang, Jingdong Liu, Jiye Liu, Hongzhi Zhang, Wangmeng Zuo, Wenyi Tang, Jing Liu, Youngjung Kim, Changyeop Shin, Minbeom Kim, Sungho Kim, Pablo Navarrete Michelini, Hanwen Liu, Dan Zhu, Xuan Xu, Xin Li, Furui Bai, Xiaopeng Sun, Lin Zha, Yuanfei Huang, Wen Lu, Yanpeng Cao, Du Chen, Zewei He, Anshun Sun, Siliang Tang, Hongfei Fan, Xiang Li, Guo Li, Wenjie Zhang, Yumei Zhang, Qingwen He, Jinghui Qin, Lishan Huang, Yukai Shi, Pengxu Wei, Wushao Wen, Liang Lin, Jun Yu, Guochen Xie, Mengyan Li, Rong Chen, Xiaotong Luo, Chen Hong, Yanyun Qu, Cuihua Li, Zhi-Song Liu, Li-Wen Wang, Chu-Tak Li, Can Zhao, Bowen Li, Chung-Chi Tsai, Shang-Chih Chuang, Joonhee Choi, Joonsoo Kim, Xiaoyun Jiang, Ze Pan, Qunbo Lv, Zheng Tan, Peidong He:
NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results. CVPR Workshops 2019: 2211-2223 - [c137]Weihe Zhang, Yali Wang, Yu Qiao:
MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled Visual Recognition. CVPR 2019: 7373-7382 - [c136]Junjun He, Zhongying Deng, Lei Zhou, Yali Wang, Yu Qiao:
Adaptive Pyramid Context Network for Semantic Segmentation. CVPR 2019: 7519-7528 - [c135]An Yan, Yali Wang, Zhifeng Li, Yu Qiao:
PA3D: Pose-Action 3D Machine for Video Recognition. CVPR 2019: 7922-7931 - [c134]Xiao Zhang, Rui Zhao, Junjie Yan, Mengya Gao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
P2SGrad: Refined Gradients for Optimizing Deep Face Models. CVPR 2019: 9906-9914 - [c133]Xiao Zhang, Rui Zhao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations. CVPR 2019: 10823-10832 - [c132]Jingwen He, Chao Dong, Yu Qiao:
Modulating Image Restoration With Continual Levels via Adaptive Feature Modification Layers. CVPR 2019: 11056-11064 - [c131]Xiaoxing Zeng, Xiaojiang Peng, Yu Qiao:
DF2Net: A Dense-Fine-Finer Network for Detailed 3D Face Reconstruction. ICCV 2019: 2315-2324 - [c130]Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao:
RankSRGAN: Generative Adversarial Networks With Ranker for Image Super-Resolution. ICCV 2019: 3096-3105 - [c129]Junjun He, Zhongying Deng, Yu Qiao:
Dynamic Multi-Scale Filters for Semantic Segmentation. ICCV 2019: 3561-3571 - [c128]Jin Ye, Xiaojiang Peng, Yu Qiao, Hao Xing, Junli Li, Rongrong Ji:
Visual-Textual Sentiment Analysis in Product Reviews. ICIP 2019: 869-873 - [c127]Debin Meng, Xiaojiang Peng, Kai Wang, Yu Qiao:
Frame Attention Networks for Facial Expression Recognition in Videos. ICIP 2019: 3866-3870 - [c126]Kai Wang, Jianfei Yang, Da Guo, Kaipeng Zhang, Xiaojiang Peng, Yu Qiao:
Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression. ICMI 2019: 551-556 - [c125]Da Guo, Kai Wang, Jianfei Yang, Kaipeng Zhang, Xiaojiang Peng, Yu Qiao:
Exploring Regularizations with Face, Body and Image Cues for Group Cohesion Prediction. ICMI 2019: 557-561 - [c124]Hengshun Zhou, Debin Meng, Yuanyuan Zhang, Xiaojiang Peng, Jun Du, Kai Wang, Yu Qiao:
Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition. ICMI 2019: 562-566 - [c123]Wanli Chen, Yue Zhang, Junjun He, Yu Qiao, Yifan Chen, Hongjian Shi, Ed X. Wu, Xiaoying Tang:
Prostate Segmentation using 2D Bridged U-net. IJCNN 2019: 1-7 - [c122]Zhanyu Wang, Zhe Wang, Guoxiang Qu, Fei Li, Ye Yuan, Dennis S. C. Lam, Xiulan Zhang, Yue Zhang, Yu Qiao:
Intelligent Glaucoma Diagnosis Via Active Learning And Adversarial Data Augmentation. ISBI 2019: 1234-1237 - [c121]Muchao Ye, Xiaojiang Peng, Weihao Gan, Wei Wu, Yu Qiao:
AnoPCN: Video Anomaly Detection via Deep Predictive Coding Network. ACM Multimedia 2019: 1805-1813 - [c120]Jiangyu Lai, Lanqing Guo, Yu Qiao, Xiaolong Chen, Zhengfu Zhang, Canping Liu, Ying Li, Bin Fu:
Robust Text Line Detection in Equipment Nameplate Images. ROBIO 2019: 889-894 - [c119]Xiaolong Chen, Zhengfu Zhang, Yu Qiao, Jiangyu Lai, Jian Jiang, Zeyu Zhang, Bin Fu:
Orientation Robust Scene Text Recognition in Natural Scene. ROBIO 2019: 901-906 - [c118]Xiaolong Chen, Zhengfu Zhang, Yu Qiao, Pu Zhang, Lanqing Guo, Wenrui Chen, Chen Chen, Bin Fu:
The Equipment Nameplate Dataset for Scene Text Detection and Recognition∗. ROBIO 2019: 907-912 - [i51]Jingwen He, Chao Dong, Yu Qiao:
Modulating Image Restoration with Continual Levels via Adaptive Feature Modification Layers. CoRR abs/1904.08118 (2019) - [i50]Xiao Zhang, Rui Zhao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations. CoRR abs/1905.00292 (2019) - [i49]Xiao Zhang, Rui Zhao, Junjie Yan, Mengya Gao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
P2SGrad: Refined Gradients for Optimizing Deep Face Models. CoRR abs/1905.02479 (2019) - [i48]Kai Wang, Xiaojiang Peng, Jianfei Yang, Debin Meng, Yu Qiao:
Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition. CoRR abs/1905.04075 (2019) - [i47]Ruicheng Feng, Jinjin Gu, Yu Qiao, Chao Dong:
Suppressing Model Overfitting for Image Super-Resolution Networks. CoRR abs/1906.04809 (2019) - [i46]Debin Meng, Xiaojiang Peng, Kai Wang, Yu Qiao:
frame attention networks for facial expression recognition in videos. CoRR abs/1907.00193 (2019) - [i45]Kai Wang, Jianfei Yang, Da Guo, Kaipeng Zhang, Xiaojiang Peng, Yu Qiao:
Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression. CoRR abs/1907.03422 (2019) - [i44]Qing Li, Xiaojiang Peng, Liangliang Cao, Wenbin Du, Hao Xing, Yu Qiao:
Product Image Recognition with Guidance Learning and Noisy Supervision. CoRR abs/1907.11384 (2019) - [i43]Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao:
RankSRGAN: Generative Adversarial Networks with Ranker for Image Super-Resolution. CoRR abs/1908.06382 (2019) - [i42]Qing Li, Xiaojiang Peng, Yu Qiao, Qiang Peng:
Learning Category Correlations for Multi-label Image Recognition with Graph Networks. CoRR abs/1909.13005 (2019) - [i41]Jingwen He, Chao Dong, Yu Qiao:
Multi-Dimension Modulation for Image Restoration with Dynamic Controllable Residual Learning. CoRR abs/1912.05293 (2019) - [i40]Mingye Xu, Zhipeng Zhou, Yu Qiao:
Geometry Sharing Network for 3D Point Cloud Classification and Segmentation. CoRR abs/1912.10644 (2019) - 2018
- [j36]Fei Li, Zhe Wang, Guoxiang Qu, Diping Song, Ye Yuan, Yang Xu, Kai Gao, Guangwei Luo, Zegu Xiao, Dennis S. C. Lam, Hua Zhong, Yu Qiao, Xiulan Zhang:
Automatic differentiation of Glaucoma visual field from non-glaucoma visual filed using deep convolutional neural network. BMC Medical Imaging 18(1): 35:1-35:7 (2018) - [j35]Limin Wang, Zhe Wang, Yu Qiao, Luc Van Gool:
Transferring Deep Object and Scene Representations for Event Recognition in Still Images. Int. J. Comput. Vis. 126(2-4): 390-409 (2018) - [j34]Lei Xiang, Qian Wang, Dong Nie, Lichi Zhang, Xiyao Jin, Yu Qiao, Dinggang Shen:
Deep embedding convolutional neural network for synthesizing CT image from T1-Weighted MR image. Medical Image Anal. 47: 31-44 (2018) - [j33]Wenbin Du, Yali Wang, Yu Qiao:
Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos. IEEE Trans. Image Process. 27(3): 1347-1360 (2018) - [j32]Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-Time Action Recognition With Deeply Transferred Motion Vector CNNs. IEEE Trans. Image Process. 27(5): 2326-2339 (2018) - [c117]Hao Chen, Yali Wang, Guoyou Wang, Yu Qiao:
LSTD: A Low-Shot Transfer Detector for Object Detection. AAAI 2018: 2836-2843 - [c116]Kaiyang Zhou, Yu Qiao, Tao Xiang:
Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward. AAAI 2018: 7582-7589 - [c115]Xiaoyu Yue, Zhanghui Kuang, Zhaoyang Zhang, Zhenfang Chen, Pan He, Yu Qiao, Wei Zhang:
Boosting up Scene Text Detectors with Guided CNN. BMVC 2018: 307 - [c114]Shixiang Wu, Tianqi Fan, Chao Dong, Yu Qiao:
RDS-Denoiser: a Detail-preserving Convolutional Neural Network for Image Denoising. CBS 2018: 127-132 - [c113]Tong He, Zhi Tian, Weilin Huang, Chunhua Shen, Yu Qiao, Changming Sun:
An End-to-End TextSpotter With Explicit Alignment and Attention. CVPR 2018: 5020-5029 - [c112]Yali Wang, Lei Zhou, Yu Qiao:
Temporal Hallucinating for Action Recognition With Few Still Images. CVPR 2018: 5314-5322 - [c111]Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan:
FOTS: Fast Oriented Text Spotting With a Unified Network. CVPR 2018: 5676-5685 - [c110]Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, Chen Change Loy:
ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks. ECCV Workshops (5) 2018: 63-79 - [c109]Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, Yu Qiao:
SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters. ECCV (8) 2018: 90-105 - [c108]Kaipeng Zhang, Zhanpeng Zhang, Chia-Wen Cheng, Winston H. Hsu, Yu Qiao, Wei Liu, Tong Zhang:
Super-Identity Convolutional Neural Network for Face Hallucination. ECCV (11) 2018: 196-211 - [c107]Dian Shao, Yu Xiong, Yue Zhao, Qingqiu Huang, Yu Qiao, Dahua Lin:
Find and Focus: Retrieve and Localize Video Events with Natural Language Queries. ECCV (9) 2018: 202-218 - [c106]Andrey Ignatov, Radu Timofte, Thang Van Vu, Tung Minh Luu, Trung X. Pham, Cao Van Nguyen, Yongwoo Kim, Jae-Seok Choi, Munchurl Kim, Jie Huang, Jiewen Ran, Chen Xing, Xingguang Zhou, Pengfei Zhu, Mingrui Geng, Yawei Li, Eirikur Agustsson, Shuhang Gu, Luc Van Gool, Etienne de Stoutz, Nikolay Kobyshev, Kehui Nie, Yan Zhao, Gen Li, Tong Tong, Qinquan Gao, Hanwen Liu, Pablo Navarrete Michelini, Dan Zhu, Hu Fengshuo, Zheng Hui, Xiumei Wang, Lirui Deng, Rang Meng, Jinghui Qin, Yukai Shi, Wushao Wen, Liang Lin, Ruicheng Feng, Shixiang Wu, Chao Dong, Yu Qiao, Subeesh Vasu, Thekke Madam Nimisha, Praveen Kandula, A. N. Rajagopalan, Jie Liu, Cheolkon Jung:
PIRM Challenge on Perceptual Image Enhancement on Smartphones: Report. ECCV Workshops (5) 2018: 315-333 - [c105]Jianfei Yang, Kai Wang, Xiaojiang Peng, Yu Qiao:
Deep Recurrent Multi-instance Learning with Spatio-temporal Features for Engagement Intensity Prediction. ICMI 2018: 594-598 - [c104]Kai Wang, Xiaoxing Zeng, Jianfei Yang, Debin Meng, Kaipeng Zhang, Xiaojiang Peng, Yu Qiao:
Cascade Attention Networks For Group Emotion Recognition with Face, Body and Image Cues. ICMI 2018: 640-645 - [c103]Wei Zhao, Benyou Wang, Jianbo Ye, Min Yang, Zhou Zhao, Ruotian Luo, Yu Qiao:
A Multi-task Learning Approach for Image Captioning. IJCAI 2018: 1205-1211 - [c102]Fei Li, Zhe Wang, Guoxiang Qu, Yu Qiao, Xiulan Zhang:
Visual Field Based Automatic Diagnosis of Glaucoma Using Deep Convolutional Neural Network. COMPAY/OMIA@MICCAI 2018: 285-293 - [c101]Guoxiang Qu, Wenwei Zhang, Zhe Wang, Xing Dai, Jianping Shi, Junjun He, Fei Li, Xiulan Zhang, Yu Qiao:
StripNet: Towards Topology Consistent Strip Structure Segmentation. ACM Multimedia 2018: 283-291 - [c100]Peiqin Zhuang, Yali Wang, Yu Qiao:
WildFish: A Large Benchmark for Fish Recognition in the Wild. ACM Multimedia 2018: 1301-1309 - [c99]Zhe Wang, Xiaoyi Liu, Limin Wang, Yu Qiao, Xiaohui Xie, Charless C. Fowlkes:
Structured Triplet Learning with POS-Tag Guided Attention for Visual Question Answering. WACV 2018: 1888-1896 - [i39]Kaiyang Zhou, Yu Qiao:
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward. CoRR abs/1801.00054 (2018) - [i38]Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan:
FOTS: Fast Oriented Text Spotting with a Unified Network. CoRR abs/1801.01671 (2018) - [i37]Zhe Wang, Xiaoyi Liu, Liangjian Chen, Limin Wang, Yu Qiao, Xiaohui Xie, Charless C. Fowlkes:
Structured Triplet Learning with POS-tag Guided Attention for Visual Question Answering. CoRR abs/1801.07853 (2018) - [i36]Hao Chen, Yali Wang, Guoyou Wang, Yu Qiao:
LSTD: A Low-Shot Transfer Detector for Object Detection. CoRR abs/1803.01529 (2018) - [i35]Tong He, Zhi Tian, Weilin Huang, Chunhua Shen, Yu Qiao, Changming Sun:
An end-to-end TextSpotter with Explicit Alignment and Attention. CoRR abs/1803.03474 (2018) - [i34]Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, Yu Qiao:
SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters. CoRR abs/1803.11527 (2018) - [i33]Xiaoyu Yue, Zhanghui Kuang, Zhaoyang Zhang, Zhenfang Chen, Pan He, Yu Qiao, Wei Zhang:
Boosting up Scene Text Detectors with Guided CNN. CoRR abs/1805.04132 (2018) - [i32]Wanli Chen, Yue Zhang, Junjun He, Yu Qiao, Yifan Chen, Hongjian Shi, Xiaoying Tang:
W-net: Bridged U-net for 2D Medical Image Segmentation. CoRR abs/1807.04459 (2018) - [i31]Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Chen Change Loy, Yu Qiao, Xiaoou Tang:
ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks. CoRR abs/1809.00219 (2018) - [i30]Andrey Ignatov, Radu Timofte, Thang Van Vu, Tung Minh Luu, Trung X. Pham, Cao Van Nguyen, Yongwoo Kim, Jae-Seok Choi, Munchurl Kim, Jie Huang, Jiewen Ran, Chen Xing, Xingguang Zhou, Pengfei Zhu, Mingrui Geng, Yawei Li, Eirikur Agustsson, Shuhang Gu, Luc Van Gool, Etienne de Stoutz, Nikolay Kobyshev, Kehui Nie, Yan Zhao, Gen Li, Tong Tong, Qinquan Gao, Hanwen Liu, Pablo Navarrete Michelini, Dan Zhu, Hu Fengshuo, Zheng Hui, Xiumei Wang, Lirui Deng, Rang Meng, Jinghui Qin, Yukai Shi, Wushao Wen, Liang Lin, Ruicheng Feng, Shixiang Wu, Chao Dong, Yu Qiao, Subeesh Vasu, Thekke Madam Nimisha, Praveen Kandula, A. N. Rajagopalan, Jie Liu, Cheolkon Jung:
PIRM Challenge on Perceptual Image Enhancement on Smartphones: Report. CoRR abs/1810.01641 (2018) - [i29]Kaipeng Zhang, Zhanpeng Zhang, Chia-Wen Cheng, Winston H. Hsu, Yu Qiao, Wei Liu, Tong Zhang:
Super-Identity Convolutional Neural Network for Face Hallucination. CoRR abs/1811.02328 (2018) - 2017
- [j31]Peng-peng Zhang, Yu Qiao, Shengzheng Wang, Jie Yang, Yuemin Zhu:
A robust coherent point drift approach based on rotation invariant shape context. Neurocomputing 219: 455-473 (2017) - [j30]Yongqiang Gao, Weilin Huang, Yu Qiao:
Learning multiple local binary descriptors for image matching. Neurocomputing 266: 239-246 (2017) - [j29]Lei Xiang, Yu Qiao, Dong Nie, Le An, Weili Lin, Qian Wang, Dinggang Shen:
Deep auto-context convolutional neural networks for standard-dose PET image estimation from low-dose PET/MRI. Neurocomputing 267: 406-416 (2017) - [j28]Sheng Guo, Weilin Huang, Yu Qiao:
Improving scale invariant feature transform with local color contrastive descriptor for image classification. J. Electronic Imaging 26(1): 13015 (2017) - [j27]Sheng Guo, Weilin Huang, Limin Wang, Yu Qiao:
Locally Supervised Deep Hybrid Model for Scene Recognition. IEEE Trans. Image Process. 26(2): 808-820 (2017) - [j26]Zhe Wang, Limin Wang, Yali Wang, Bowen Zhang, Yu Qiao:
Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition. IEEE Trans. Image Process. 26(4): 2028-2041 (2017) - [j25]Limin Wang, Sheng Guo, Weilin Huang, Yuanjun Xiong, Yu Qiao:
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs. IEEE Trans. Image Process. 26(4): 2055-2068 (2017) - [c98]Jiaming Liu, Yali Wang, Yu Qiao:
Sparse Deep Transfer Learning for Convolutional Neural Network. AAAI 2017: 2245-2251 - [c97]Huijuan Huang, Zhi Tian, Tong He, Weilin Huang, Yu Qiao:
Orientation-Aware Text Proposals Network for Scene Text Detection. CCBR 2017: 739-749 - [c96]Wei Zhao, Wei Xu, Min Yang, Jianbo Ye, Zhou Zhao, Yabing Feng, Yu Qiao:
Dual Learning for Cross-domain Image Captioning. CIKM 2017: 29-38 - [c95]Peiqin Zhuang, Linjie Xing, Yanlin Liu, Sheng Guo, Yu Qiao:
Marine Animal Detection and Recognition with Advanced Deep Learning Models. CLEF (Working Notes) 2017 - [c94]Radu Timofte, Eirikur Agustsson, Luc Van Gool, Ming-Hsuan Yang, Lei Zhang, Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, Kyoung Mu Lee, Xintao Wang, Yapeng Tian, Ke Yu, Yulun Zhang, Shixiang Wu, Chao Dong, Liang Lin, Yu Qiao, Chen Change Loy, Woong Bae, Jaejun Yoo, Yoseob Han, Jong Chul Ye, Jae-Seok Choi, Munchurl Kim, Yuchen Fan, Jiahui Yu, Wei Han, Ding Liu, Haichao Yu, Zhangyang Wang, Honghui Shi, Xinchao Wang, Thomas S. Huang, Yunjin Chen, Kai Zhang, Wangmeng Zuo, Zhimin Tang, Linkai Luo, Shaohui Li, Min Fu, Lei Cao, Wen Heng, Giang Bui, Truc Le, Ye Duan, Dacheng Tao, Ruxin Wang, Xu Lin, Jianxin Pang, Jinchang Xu, Yu Zhao, Xiangyu Xu, Jin-shan Pan, Deqing Sun, Yujin Zhang, Xibin Song, Yuchao Dai, Xueying Qin, Xuan-Phung Huynh, Tiantong Guo, Hojjat Seyed Mousavi, Tiep Huu Vu, Vishal Monga, Cristóvão Cruz, Karen O. Egiazarian, Vladimir Katkovnik, Rakesh Mehta, Arnav Kumar Jain, Abhinav Agarwalla, Ch V. Sai Praveen, Ruofan Zhou, Hongdiao Wen, Che Zhu, Zhiqiang Xia, Zhengtao Wang, Qi Guo:
NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results. CVPR Workshops 2017: 1110-1121 - [c93]Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li:
Single Shot Text Detector with Regional Attention. ICCV 2017: 3066-3074 - [c92]Kaipeng Zhang, Zhanpeng Zhang, Hao Wang, Zhifeng Li, Yu Qiao, Wei Liu:
Detecting Faces Using Inside Cascaded Contextual CNN. ICCV 2017: 3190-3198 - [c91]Wenbin Du, Yali Wang, Yu Qiao:
RPAN: An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos. ICCV 2017: 3745-3754 - [c90]Xiao Zhang, Zhiyuan Fang, Yandong Wen, Zhifeng Li, Yu Qiao:
Range Loss for Deep Face Recognition with Long-Tailed Training Data. ICCV 2017: 5419-5428 - [c89]Diping Song, Yu Qiao, Alessandro Corbetta:
Depth driven people counting using deep region proposal network. ICIA 2017: 416-421 - [c88]Lianzhi Tan, Kaipeng Zhang, Kai Wang, Xiaoxing Zeng, Xiaojiang Peng, Yu Qiao:
Group emotion recognition with individual facial emotion CNNs and global image based CNNs. ICMI 2017: 549-552 - [e1]Jie Zhou, Yunhong Wang, Zhenan Sun, Yong Xu, Linlin Shen, Jianjiang Feng, Shiguang Shan, Yu Qiao, Zhenhua Guo, Shiqi Yu:
Biometric Recognition - 12th Chinese Conference, CCBR 2017, Shenzhen, China, October 28-29, 2017, Proceedings. Lecture Notes in Computer Science 10568, Springer 2017, ISBN 978-3-319-69922-6 [contents] - [i28]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks for Action Recognition in Videos. CoRR abs/1705.02953 (2017) - [i27]Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li:
Single Shot Text Detector with Regional Attention. CoRR abs/1709.00138 (2017) - [i26]Lei Xiang, Qian Wang, Dong Nie, Yu Qiao, Dinggang Shen:
Deep Embedding Convolutional Neural Network for Synthesizing CT Image from T1-Weighted MR Image. CoRR abs/1709.02073 (2017) - 2016
- [j24]Xiaojiang Peng, Limin Wang, Xingxing Wang, Yu Qiao:
Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice. Comput. Vis. Image Underst. 150: 109-125 (2016) - [j23]Peng-peng Zhang, Yu Qiao, Sheng-Zheng Wang, Jie Yang:
Reference-omitted affine soft correspondence algorithm. IET Image Process. 10(8): 571-581 (2016) - [j22]Limin Wang, Yu Qiao, Xiaoou Tang:
MoFAP: A Multi-level Representation for Action Recognition. Int. J. Comput. Vis. 119(3): 254-271 (2016) - [j21]Yongqiang Gao, Zhifeng Li, Yu Qiao:
Adaptive Part-Level Model Knowledge Transfer for Gender Classification. IEEE Signal Process. Lett. 23(6): 888-892 (2016) - [j20]Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, Yu Qiao:
Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Process. Lett. 23(10): 1499-1503 (2016) - [j19]Tong He, Weilin Huang, Yu Qiao, Jian Yao:
Text-Attentional Convolutional Neural Network for Scene Text Detection. IEEE Trans. Image Process. 25(6): 2529-2541 (2016) - [j18]Xixuan Wu, Yu Qiao, Xiaogang Wang, Xiaoou Tang:
Bridging Music and Image via Cross-Modal Ranking Analysis. IEEE Trans. Multim. 18(7): 1305-1318 (2016) - [c87]Pan He, Weilin Huang, Yu Qiao, Chen Change Loy, Xiaoou Tang:
Reading Scene Text in Deep Convolutional Sequences. AAAI 2016: 3501-3508 - [c86]Kaipeng Zhang, Lianzhi Tan, Zhifeng Li, Yu Qiao:
Gender and Smile Classification Using Deep Convolutional Neural Networks. CVPR Workshops 2016: 739-743 - [c85]Wangjiang Zhu, Jie Hu, Gang Sun, Xudong Cao, Yu Qiao:
A Key Volume Mining Deep Framework for Action Recognition. CVPR 2016: 1991-1999 - [c84]Limin Wang, Yu Qiao, Xiaoou Tang, Luc Van Gool:
Actionness Estimation Using Hybrid Fully Convolutional Networks. CVPR 2016: 2708-2717 - [c83]Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-Time Action Recognition with Enhanced Motion Vector CNNs. CVPR 2016: 2718-2726 - [c82]Yandong Wen, Zhifeng Li, Yu Qiao:
Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition. CVPR 2016: 4893-4901 - [c81]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. ECCV (8) 2016: 20-36 - [c80]Zhi Tian, Weilin Huang, Tong He, Pan He, Yu Qiao:
Detecting Text in Natural Image with Connectionist Text Proposal Network. ECCV (8) 2016: 56-72 - [c79]Yandong Wen, Kaipeng Zhang, Zhifeng Li, Yu Qiao:
A Discriminative Feature Learning Approach for Deep Face Recognition. ECCV (7) 2016: 499-515 - [c78]Yali Wang, Lin Li, Yu Qiao:
Human action recognition with DeepAction Kernel Gaussian Process. ICARM 2016: 165-170 - [c77]Zhe Wang, Yali Wang, Limin Wang, Yu Qiao:
Codebook enhancement of vlad representation for visual recognition. ICASSP 2016: 1258-1262 - [c76]Linjie Xing, Yu Qiao:
DeepWriter: A Multi-stream Deep CNN for Text-Independent Writer Identification. ICFHR 2016: 584-589 - [c75]Lianzhi Tan, Zhifeng Li, Yu Qiao:
Deep face attributes recognition using spatial transformer network. ICIA 2016: 1928-1932 - [c74]Du-Xin Liu, Wenbin Du, Xinyu Wu, Can Wang, Yu Qiao:
Deep rehabilitation gait learning for modeling knee joints of lower-limb exoskeleton. ROBIO 2016: 1058-1063 - [c73]Jin Ye, Linjie Xing, Xiaolong Fan, Changzhi Song, Diping Song, Cai-Zhi Zhu, Yu Qiao:
Shenzhen Institutes of Advanced Technology, CAS, China at TRECVID INS 2016. TRECVID 2016 - [i25]Sheng Guo, Weilin Huang, Yu Qiao:
Locally-Supervised Deep Hybrid Model for Scene Recognition. CoRR abs/1601.07576 (2016) - [i24]Tong He, Weilin Huang, Yu Qiao, Jian Yao:
Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network. CoRR abs/1603.09423 (2016) - [i23]Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, Yu Qiao:
Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks. CoRR abs/1604.02878 (2016) - [i22]Limin Wang, Yu Qiao, Xiaoou Tang, Luc Van Gool:
Actionness Estimation Using Hybrid Fully Convolutional Networks. CoRR abs/1604.07279 (2016) - [i21]Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-time Action Recognition with Enhanced Motion Vector CNNs. CoRR abs/1604.07669 (2016) - [i20]Linjie Xing, Yu Qiao:
DeepWriter: A Multi-Stream Deep CNN for Text-independent Writer Identification. CoRR abs/1606.06472 (2016) - [i19]Yuanjun Xiong, Limin Wang, Zhe Wang, Bowen Zhang, Hang Song, Wei Li, Dahua Lin, Yu Qiao, Luc Van Gool, Xiaoou Tang:
CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016. CoRR abs/1608.00797 (2016) - [i18]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. CoRR abs/1608.00859 (2016) - [i17]Zhe Wang, Limin Wang, Yali Wang, Bowen Zhang, Yu Qiao:
Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition. CoRR abs/1609.00153 (2016) - [i16]Limin Wang, Zhe Wang, Yu Qiao, Luc Van Gool:
Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images. CoRR abs/1609.00162 (2016) - [i15]Zhi Tian, Weilin Huang, Tong He, Pan He, Yu Qiao:
Detecting Text in Natural Image with Connectionist Text Proposal Network. CoRR abs/1609.03605 (2016) - [i14]Limin Wang, Sheng Guo, Weilin Huang, Yuanjun Xiong, Yu Qiao:
Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs. CoRR abs/1610.01119 (2016) - [i13]Xiao Zhang, Zhiyuan Fang, Yandong Wen, Zhifeng Li, Yu Qiao:
Range Loss for Deep Face Recognition with Long-tail. CoRR abs/1611.08976 (2016) - 2015
- [j17]Lei Zhou, Yijun Li, Rocky Zhou, Yu Qiao, Jie Yang, Yonghui Gao:
On feature-specific parameter learning in conditional random field-based approach for interactive object segmentation. J. Electronic Imaging 24(2): 023012 (2015) - [j16]Yongqiang Gao, Weilin Huang, Yu Qiao:
Local Multi-Grouped Binary Descriptor With Ring-Based Pooling Configuration and Optimization. IEEE Trans. Image Process. 24(12): 4820-4833 (2015) - [c72]Zhe Wang, Limin Wang, Wenbin Du, Yu Qiao:
Exploring Fisher vector and deep networks for action spotting. CVPR Workshops 2015: 10-14 - [c71]Limin Wang, Zhe Wang, Wenbin Du, Yu Qiao:
Object-Scene Convolutional Neural Networks for event recognition in images. CVPR Workshops 2015: 30-35 - [c70]Limin Wang, Yu Qiao, Xiaoou Tang:
Action recognition with trajectory-pooled deep-convolutional descriptors. CVPR 2015: 4305-4314 - [c69]Limin Wang, Zhe Wang, Sheng Guo, Yu Qiao:
Better Exploiting OS-CNNs for Better Event Recognition in Images. ICCV Workshops 2015: 287-294 - [c68]Xixuan Wu, Yu Qiao, Xiaoou Tang:
MIL: Music Exploration and Visualization via Lyric and Image. ACM Multimedia 2015: 1011-1014 - [c67]Ximei Zhu, Ying Li, Yu Qiao:
Fast single image dehazing through Edge-Guided Interpolated Filter. MVA 2015: 443-446 - [c66]Feiyun Zhang, Xiao Xu, Yu Qiao:
Deep classification of vehicle makers and models: The effectiveness of pre-training and data enhancement. ROBIO 2015: 231-236 - [c65]Xiang Chen, Yu Qiao:
Road segmentation via iterative deep analysis. ROBIO 2015: 2640-2645 - [i12]Limin Wang, Zhe Wang, Wenbin Du, Yu Qiao:
Object-Scene Convolutional Neural Networks for Event Recognition in Images. CoRR abs/1505.00296 (2015) - [i11]Limin Wang, Yu Qiao, Xiaoou Tang:
Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors. CoRR abs/1505.04868 (2015) - [i10]Chao Dong, Ximei Zhu, Yubin Deng, Chen Change Loy, Yu Qiao:
Boosting Optical Character Recognition: A Super-Resolution Approach. CoRR abs/1506.02211 (2015) - [i9]Pan He, Weilin Huang, Yu Qiao, Chen Change Loy, Xiaoou Tang:
Reading Scene Text in Deep Convolutional Sequences. CoRR abs/1506.04395 (2015) - [i8]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao:
Towards Good Practices for Very Deep Two-Stream ConvNets. CoRR abs/1507.02159 (2015) - [i7]Sheng Guo, Weilin Huang, Yu Qiao:
Local Color Contrastive Descriptor for Image Classification. CoRR abs/1508.00307 (2015) - [i6]Limin Wang, Sheng Guo, Weilin Huang, Yu Qiao:
Places205-VGGNet Models for Scene Recognition. CoRR abs/1508.01667 (2015) - [i5]Yongqiang Gao, Weilin Huang, Yu Qiao:
Local Multi-Grouped Binary Descriptor with Ring-based Pooling Configuration and Optimization. CoRR abs/1509.06557 (2015) - [i4]Tong He, Weilin Huang, Yu Qiao, Jian Yao:
Text-Attentional Convolutional Neural Networks for Scene Text Detection. CoRR abs/1510.03283 (2015) - [i3]Limin Wang, Zhe Wang, Sheng Guo, Yu Qiao:
Better Exploiting OS-CNNs for Better Event Recognition in Images. CoRR abs/1510.03979 (2015) - 2014
- [j15]Xiaojiang Peng, Yu Qiao, Qiang Peng:
Motion boundary based sampling and 3D co-occurrence descriptors for action recognition. Image Vis. Comput. 32(9): 616-628 (2014) - [j14]Xianbiao Qi, Rong Xiao, Chun-Guang Li, Yu Qiao, Jun Guo, Xiaoou Tang:
Pairwise Rotation Invariant Co-Occurrence Local Binary Pattern. IEEE Trans. Pattern Anal. Mach. Intell. 36(11): 2199-2213 (2014) - [j13]Lei Zhou, Keren Fu, Yijun Li, Yu Qiao, Xiangjian He, Jie Yang:
Bayesian salient object detection based on saliency driven clustering. Signal Process. Image Commun. 29(3): 434-447 (2014) - [j12]Xiaojiang Peng, Yu Qiao, Qiang Peng, Qiong-Hua Wang:
Large Margin Dimensionality Reduction for Action Similarity Labeling. IEEE Signal Process. Lett. 21(8): 1022-1025 (2014) - [j11]Limin Wang, Yu Qiao, Xiaoou Tang:
Latent Hierarchical Model of Temporal Structure for Complex Activity Classification. IEEE Trans. Image Process. 23(2): 810-822 (2014) - [j10]Zhifeng Li, Dihong Gong, Yu Qiao, Dacheng Tao:
Common Feature Discriminant Analysis for Matching Infrared Face Images to Optical Face Images. IEEE Trans. Image Process. 23(6): 2436-2445 (2014) - [c64]Zhuowei Cai, Limin Wang, Xiaojiang Peng, Yu Qiao:
Multi-view Super Vector for Action Recognition. CVPR 2014: 596-603 - [c63]Weilin Huang, Yu Qiao, Xiaoou Tang:
Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees. ECCV (4) 2014: 497-511 - [c62]Xiaojiang Peng, Limin Wang, Zhuowei Cai, Yu Qiao:
Action and Gesture Temporal Spotting with Super Vector Representation. ECCV Workshops (1) 2014: 518-527 - [c61]Limin Wang, Yu Qiao, Xiaoou Tang:
Video Action Detection with Relational Dynamic-Poselets. ECCV (5) 2014: 565-580 - [c60]Xiaojiang Peng, Changqing Zou, Yu Qiao, Qiang Peng:
Action Recognition with Stacked Fisher Vectors. ECCV (5) 2014: 581-595 - [c59]Xiaojiang Peng, Limin Wang, Yu Qiao, Qiang Peng:
Boosting VLAD with Supervised Dictionary Learning and High-Order Statistics. ECCV (3) 2014: 660-674 - [c58]Yijun Li, Keren Fu, Lei Zhou, Yu Qiao, Jie Yang, Bai Li:
Saliency detection based on extended boundary prior with foci of attention. ICASSP 2014: 2798-2802 - [c57]Lei Zhou, Yijun Li, Yi Peng Song, Yu Qiao, Jie Yang:
Saliency driven clustering for salient object detection. ICASSP 2014: 5372-5376 - [c56]Yijun Li, Keren Fu, Lei Zhou, Yu Qiao, Jie Yang:
Saliency detection via foreground rendering and background exclusion. ICIP 2014: 3263-3267 - [c55]Xiaojiang Peng, Limin Wang, Yu Qiao, Qiang Peng:
A Joint Evaluation of Dictionary Learning and Feature Encoding for Action Recognition. ICPR 2014: 2607-2612 - [c54]Qiaozhe Li, Yu Qiao, Jie Yang:
Robust visual tracking based on local kernelized representation. ROBIO 2014: 2523-2528 - [i2]Xiaojiang Peng, Limin Wang, Xingxing Wang, Yu Qiao:
Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice. CoRR abs/1405.4506 (2014) - 2013
- [j9]Yu Qiao, Dean Luo, Nobuaki Minematsu:
Unsupervised optimal phoneme segmentation: theory and experimental evaluation. IET Signal Process. 7(7): 577-586 (2013) - [j8]Keren Fu, Chen Gong, Yu Qiao, Jie Yang, Irene Yu-Hua Gu:
One-class support vector machine-assisted robust tracking. J. Electronic Imaging 22(2): 023002 (2013) - [c53]Xiaojiang Peng, Yu Qiao, Qiang Peng, Xianbiao Qi:
Exploring Motion Boundary based Sampling and Spatial-Temporal Context Descriptors for Action Recognition. BMVC 2013 - [c52]Xianbiao Qi, Yu Qiao, Chun-Guang Li, Jun Guo:
Multi-scale Joint Encoding of Local Binary Patterns for Texture and Material Classification. BMVC 2013 - [c51]Xianbiao Qi, Yu Qiao, Chun-Guang Li, Jun Guo:
Exploring Cross-Channel Texture Correlation for Color Texture Classification. BMVC 2013 - [c50]Limin Wang, Yu Qiao, Xiaoou Tang:
Motionlets: Mid-level 3D Parts for Human Motion Recognition. CVPR 2013: 2674-2681 - [c49]Limin Wang, Yu Qiao, Xiaoou Tang:
Mining Motion Atoms and Phrases for Complex Action Recognition. ICCV 2013: 2680-2687 - [c48]Xiaojiang Peng, Xiao Wu, Qiang Peng, Xianbiao Qi, Yu Qiao, Yanhua Liu:
Exploring dense trajectory feature and encoding methods for human interaction recognition. ICIMCS 2013: 23-27 - [c47]Dihong Gong, Kai Zhu, Zhifeng Li, Yu Qiao:
A semantic model for video based face recognition. ICIA 2013: 1369-1374 - [c46]Yongqiang Gao, Yu Qiao, Zhifeng Li, Chunjing Xu:
LTD: Local Ternary Descriptor for image matching. ICIA 2013: 1375-1380 - [c45]Lei Zhou, Yu Qiao, Jie Yang, Yonghui Gao:
An active contour model based on multiple boundary measures. ICIP 2013: 524-528 - [c44]Peng-peng Zhang, Shengzheng Wang, Yu Qiao, Jie Yang, Yonghui Gao:
Affine SoftAssign with bidirectional distance for point matching. ICIP 2013: 1267-1271 - [c43]Lei Zhou, Chen Gong, Yijun Li, Yu Qiao, Jie Yang, Nikola K. Kasabov:
Salient Object Segmentation Based on Automatic Labeling. ICONIP (3) 2013: 584-591 - [c42]Dihong Gong, Zhifeng Li, Jianzhuang Liu, Yu Qiao:
Multi-feature canonical correlation analysis for face photo-sketch image retrieval. ACM Multimedia 2013: 617-620 - [i1]Xiaojiang Peng, Qiang Peng, Yu Qiao, Junzhou Chen, Mehtab Afzal:
A Study on Unsupervised Dictionary Learning and Feature Encoding for Action Classification. CoRR abs/1309.0309 (2013) - 2012
- [c41]Xingxing Wang, Limin Wang, Yu Qiao:
A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition. ACCV (3) 2012: 572-585 - [c40]Keren Fu, Chen Gong, Yu Qiao, Jie Yang, Irene Guy:
One-Class SVM assisted accurate tracking. ICDSC 2012: 1-6 - [c39]Qiao Huang, Jie Yang, Yu Qiao:
Person re-identification across multi-camera system based on local descriptors. ICDSC 2012: 1-6 - [c38]Lei Zhou, Yu Qiao, Jie Yang, Xiangjian He:
Learning geodesic CRF model for image segmentation. ICIP 2012: 1565-1568 - [c37]Na Li, Yu Qiao:
Bayesian Mixture of Probabilistic Linear Regressions for Voice Conversion. INTERSPEECH 2012: 82-85 - [c36]Na Li, Yu Qiao:
Voice conversion using Bayesian mixture of Probabilistic Linear Regressions and dynamic kernel features. ISCSLP 2012: 69-73 - [c35]Xixuan Wu, Yu Qiao, Xiaogang Wang, Xiaoou Tang:
Cross matching of music and image. ACM Multimedia 2012: 837-840 - [c34]Xixuan Wu, Bing Xu, Yu Qiao, Xiaoou Tang:
Automatic music video generation: cross matching of music and image. ACM Multimedia 2012: 1381-1382 - 2011
- [j7]Dean Luo, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Regularized Maximum Likelihood Linear Regression Adaptation for Computer-Assisted Language Learning Systems. IEICE Trans. Inf. Syst. 94-D(2): 308-316 (2011) - [c33]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Structure-constrained distribution matching using quadratic programming and its application to pronunciation evaluation. ACPR 2011: 350-354 - [c32]Nesma Houmani, Sonia Garcia-Salicetti, Bernadette Dorizzi, Jugurta Montalvão, Jânio Coutinho Canuto, Marcus V. A. Andrade, Yu Qiao, Xingxing Wang, Tobias Scheidat, Andrey Makrushin, Daigo Muramatsu, Joanna Putz-Leszczynska, Michal Kudelski, Marcos Faúndez-Zanuy, Juan Manuel Pascual-Gaspar, Valentín Cardeñoso-Payo, Carlos Vivaracho-Pascual, Enrique Argones-Rúa, José Luis Alba-Castro, Alisher Kholmatov, Berrin A. Yanikoglu:
BioSecure Signature Evaluation Campaign (ESRA'2011): evaluating systems on quality-based categories of skilled forgeries. IJCB 2011: 1-10 - [c31]Qiang Wang, Qingqing Chang, Yu Qiao, Yuyuan Zhu, Gang Huang, Jie Yang:
Knowledge-Based Segmentation of Spine and Ribs from Bone Scintigraphy. ICONIP (1) 2011: 241-248 - [c30]Yu Qiao, Jie Yang:
Adaptive Region Growing Based on Boundary Measures. ICONIP (1) 2011: 249-256 - [c29]Qingqing Chang, Qiang Wang, Yu Qiao, Yuyuan Zhu, Gang Huang, Jie Yang:
Adaptive Detection of Hotspots in Thoracic Spine from Bone Scintigraphy. ICONIP (1) 2011: 257-264 - [c28]Yu Qiao, Tong Tong, Nobuaki Minematsu:
A Study on Bag of Gaussian Model with Application to Voice Conversion. INTERSPEECH 2011: 657-660 - [c27]Aki Kunikoshi, Yu Qiao, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model. INTERSPEECH 2011: 3025-3028 - 2010
- [j6]Baochang Zhang, Yu Qiao:
Face recognition based on gradient gabor feature and Efficient Kernel Fisher analysis. Neural Comput. Appl. 19(4): 617-623 (2010) - [j5]Nobuaki Minematsu, Satoshi Asakawa, Masayuki Suzuki, Yu Qiao:
Speech Structure and Its Application to Robust Speech Processing. New Gener. Comput. 28(3): 299-319 (2010) - [j4]Yu Qiao, Nobuaki Minematsu:
A study on invariance of f-divergence and its application to speech recognition. IEEE Trans. Signal Process. 58(7): 3884-3890 (2010) - [c26]Yu Qiao, Daisuke Saito, Nobuaki Minematsu:
HMM-based sequence-to-frame mapping for voice conversion. ICASSP 2010: 4830-4833 - [c25]Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Integration of multilayer regression analysis with structure-based pronunciation assessment. INTERSPEECH 2010: 586-589 - [c24]Dean Luo, Yu Qiao, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Regularized-MLLR speaker adaptation for computer-assisted language learning system. INTERSPEECH 2010: 594-597 - [c23]Xuebin Ma, Ruiyuan Xu, Nobuaki Minematsu, Yu Qiao, Keikichi Hirose, Aijun Li:
Dialect-based speaker classification using speaker-invariant dialect features. ISCSLP 2010: 171-176
2000 – 2009
- 2009
- [j3]Yu Qiao, Wei Wang, Nobuaki Minematsu, Jianzhuang Liu, Mitsou Takeda, Xiaoou Tang:
A Theory of Phase Singularities for Image Representation and its Applications to Object Tracking and Image Matching. IEEE Trans. Image Process. 18(10): 2153-2166 (2009) - [c22]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu:
A study on Hidden Structural Model and its application to labeling sequences. ASRU 2009: 118-123 - [c21]Kun Yang, Jingwei Ye, Zhijun Li, Yu Qiao:
Free hand sketch understanding using SVMs-chain modeling for spatial and temporal patterns. GEC Summit 2009: 1029-1032 - [c20]Yu Qiao, Nobuaki Minematsu:
Mixture of Probabilistic Linear Regressions: A unified view of GMM-based mapping techiques. ICASSP 2009: 3913-3916 - [c19]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu:
Affine invariant features and their application to speech recognition. ICASSP 2009: 4629-4632 - [c18]Aki Kunikoshi, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Speech generation from hand gestures based on space mapping. INTERSPEECH 2009: 308-311 - [c17]Dean Luo, Yu Qiao, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Analysis and utilization of MLLR speaker adaptation technique for learners' pronunciation evaluation. INTERSPEECH 2009: 608-611 - [c16]Daisuke Saito, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Optimal event search using a structural cost function - improvement of structure to speech conversion. INTERSPEECH 2009: 2047-2050 - [c15]Xuebin Ma, Akira Nemoto, Nobuaki Minematsu, Yu Qiao, Keikichi Hirose:
Structural analysis of dialects, sub-dialects and sub-sub-dialects of Chinese. INTERSPEECH 2009: 2219-2222 - [c14]Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
On invariant structural representation for speech recognition: theoretical validation and experimental improvement. INTERSPEECH 2009: 3055-3058 - 2008
- [c13]Yu Qiao, Wei Wang, Nobuaki Minematsu, Jianzhuang Liu, Xiaoou Tang:
Phase singularities for image representation and matching. ICASSP 2008: 885-888 - [c12]Yu Qiao, Naoya Shimomura, Nobuaki Minematsu:
Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons. ICASSP 2008: 3989-3992 - [c11]Baochang Zhang, Yongsheng Gao, Yu Qiao:
Face recognition based on Gradient Gabor feature. ICIP 2008: 1904-1907 - [c10]Yu Qiao, Nobuaki Minematsu:
Metric learning for unsupervised phoneme segmentation. INTERSPEECH 2008: 1060-1063 - [c9]Yu Qiao, Nobuaki Minematsu:
f-divergence is a generalized invariant measure between distributions. INTERSPEECH 2008: 1349-1352 - 2007
- [j2]Yu Qiao, Makoto Yasuhara:
Optimal Euler Circuit of Maximum Contiguous Cost. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 90-A(1): 274-280 (2007) - [c8]Yu Qiao, Satoshi Asakawa, Nobuaki Minematsu:
Random discriminant structure analysis for automatic recognition of connected vowels. ASRU 2007: 576-581 - [c7]Yu Qiao, Jianzhuang Liu, Xiaoou Tang:
Offline Signature Verification Using Online Handwriting Registration. CVPR 2007 - 2006
- [j1]Yu Qiao, Mikihiko Nishiara, Makoto Yasuhara:
A Framework Toward Restoration of Writing Order from Single-Stroked Handwriting Image. IEEE Trans. Pattern Anal. Mach. Intell. 28(11): 1724-1737 (2006) - [c6]Yu Qiao, Makoto Yasuhara:
Recovering Drawing Order from Offline Handwritten Image Using Direction Context and Optimal Euler Path. ICASSP (2) 2006: 765-768 - [c5]Yu Qiao, Makoto Yasuhara:
Affine Invariant Dynamic Time Warping and its Application to Online Rotated Handwriting Recognition. ICPR (2) 2006: 905-908 - [c4]Yu Qiao, Makoto Yasuhara:
Recover Writing Trajectory from Multiple Stroked Image Using Bidirectional Dynamic Search. ICPR (2) 2006: 970-973 - 2005
- [c3]Yu Qiao, Makoto Yasuhara:
A Novel Approach to Recover Writing Order From Single Stroke Offline Handwritten Images. ICDAR 2005: 227-231 - 2004
- [c2]Yu Qiao, Makoto Yasuhara:
Recovering dynamic information from static handwritten images. IWFHR 2004: 118-123 - 2003
- [c1]Xin Zhou, Xiyue Huang, Chuanjin Liao, Yu Qiao:
Vehicle Detection on Highway Based on Direction-Fractal Dimension. WAA 2003: 986-991
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-17 21:54 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint