default search action
Limin Wang 0002
Person information
- affiliation: Nanjing University, State Key Laboratory for Novel Software Technology, China
- affiliation (former): ETH Zurich, Computer Vision Laboratory, Switzerland
- affiliation (former): Chinese University of Hong Kong, Department of Information Engineeing, China
- affiliation (former): Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, China
- unicode name: 王利民
Other persons with the same name
- Limin Wang — disambiguation page
- Limin Wang 0001 — London School of Economics, UK
- Limin Wang 0003 — Hainan Normal University, School of Mathematics and Statistics, Haikou, China
- Limin Wang 0004 — Chongqing Jiaotong University, School of Economic and Management, China
- Limin Wang 0005 — Ministry of Agriculture, Key Laboratory of Agri-informatics, Beijing, China (and 1 more)
- Limin Wang 0006 — Chinese Academy of Sciences, State Key Laboratory of Multiphase Complex Systems, Beijing, China
- Limin Wang 0007 — Jilin University, Changchun, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j33]Fengyuan Shi, Weilin Huang, Limin Wang:
End-to-end dense video grounding via parallel regression. Comput. Vis. Image Underst. 242: 103980 (2024) - [j32]Jun Tu, Gangshan Wu, Limin Wang:
Dual Graph Networks for Pose Estimation in Crowded Scenes. Int. J. Comput. Vis. 132(3): 633-653 (2024) - [j31]Liang Zhao, Yao Teng, Limin Wang:
Logit Normalization for Long-Tail Object Detection. Int. J. Comput. Vis. 132(6): 2114-2134 (2024) - [j30]Jintao Lin, Zhaoyang Liu, Wenhai Wang, Wayne Wu, Limin Wang:
VLG: General Video Recognition with Web Textual Knowledge. Int. J. Comput. Vis. 132(10): 4792-4817 (2024) - [j29]Fengyuan Shi, Ruopeng Gao, Weilin Huang, Limin Wang:
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 1181-1198 (2024) - [j28]Haisong Liu, Tao Lu, Yihui Xu, Jia Liu, Limin Wang:
Learning Optical Flow and Scene Flow With Bidirectional Camera-LiDAR Fusion. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2378-2395 (2024) - [j27]Yutao Cui, Cheng Jiang, Gangshan Wu, Limin Wang:
MixFormer: End-to-End Tracking With Iterative Mixed Attention. IEEE Trans. Pattern Anal. Mach. Intell. 46(6): 4129-4146 (2024) - [j26]Tao Wu, Mengqi Cao, Ziteng Gao, Gangshan Wu, Limin Wang:
STMixer: A One-Stage Sparse Action Detector. IEEE Trans. Pattern Anal. Mach. Intell. 46(10): 6842-6857 (2024) - [j25]Yixuan Li, Zhenzhi Wang, Zhifeng Li, Limin Wang:
Sparse Action Tube Detection. IEEE Trans. Image Process. 33: 1740-1752 (2024) - [j24]Yuer Ma, Yi Liu, Limin Wang, Wenxiong Kang, Yu Qiao, Yali Wang:
Dual Masked Modeling for Weakly-Supervised Temporal Boundary Discovery. IEEE Trans. Multim. 26: 5694-5704 (2024) - [c109]Fengyuan Shi, Jiaxi Gu, Hang Xu, Songcen Xu, Wei Zhang, Limin Wang:
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models. CVPR 2024: 7393-7402 - [c108]Zhiyu Zhao, Bingkun Huang, Sen Xing, Gangshan Wu, Yu Qiao, Limin Wang:
Asymmetric Masked Distillation for Pre-Training Small Foundation Models. CVPR 2024: 18516-18526 - [c107]Tao Wu, Runyu He, Gangshan Wu, Limin Wang:
SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos. CVPR 2024: 18537-18546 - [c106]Yuhan Zhu, Guozhen Zhang, Jing Tan, Gangshan Wu, Limin Wang:
Dual DETRs for Multi-Label Temporal Action Detection. CVPR 2024: 18559-18569 - [c105]Min Yang, Huan Gao, Ping Guo, Limin Wang:
Adapting Short-Term Transformers for Action Detection in Untrimmed Videos. CVPR 2024: 18570-18579 - [c104]Chunxu Liu, Guozhen Zhang, Rui Zhao, Limin Wang:
Sparse Global Matching for Video Frame Interpolation with Large Motion. CVPR 2024: 19125-19134 - [c103]Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, Limin Wang, Dahua Lin, Bo Dai:
Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering. CVPR 2024: 20654-20664 - [c102]Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
VBench: Comprehensive Benchmark Suite for Video Generative Models. CVPR 2024: 21807-21818 - [c101]Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, Limin Wang, Yu Qiao:
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World. CVPR 2024: 22072-22086 - [c100]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Lou, Limin Wang, Yu Qiao:
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark. CVPR 2024: 22195-22206 - [c99]Haisong Liu, Yang Chen, Haiguang Wang, Zetong Yang, Tianyu Li, Jia Zeng, Li Chen, Hongyang Li, Limin Wang:
Fully Sparse 3D Occupancy Prediction. ECCV (25) 2024: 54-71 - [c98]Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao:
VideoMamba: State Space Model for Efficient Video Understanding. ECCV (26) 2024: 237-255 - [c97]Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, Limin Wang:
Accelerating Image Generation with Sub-path Linear Approximation Model. ECCV (53) 2024: 323-339 - [c96]Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, Limin Wang:
StableDrag: Stable Dragging for Point-Based Image Editing. ECCV (58) 2024: 340-356 - [c95]Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Jilan Xu, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang:
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding. ECCV (85) 2024: 396-416 - [c94]Xinhao Li, Yuhan Zhu, Limin Wang:
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video. ECCV (83) 2024: 425-443 - [c93]Ziteng Gao, Zhan Tong, Limin Wang, Mike Zheng Shou:
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens. ICLR 2024 - [c92]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinhao Li, Guo Chen, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. ICLR 2024 - [i136]Chaochao Lu, Chen Qian, Guodong Zheng, Hongxing Fan, Hongzhi Gao, Jie Zhang, Jing Shao, Jingyi Deng, Jinlan Fu, Kexin Huang, Kunchang Li, Lijun Li, Limin Wang, Lu Sheng, Meiqi Chen, Ming Zhang, Qibing Ren, Sirui Chen, Tao Gui, Wanli Ouyang, Yali Wang, Yan Teng, Yaru Wang, Yi Wang, Yinan He, Yingchun Wang, Yixu Wang, Yongting Zhang, Yu Qiao, Yujiong Shen, Yurong Mou, Yuxi Chen, Zaibin Zhang, Zhelun Shi, Zhenfei Yin, Zhipin Wang:
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities. CoRR abs/2401.15071 (2024) - [i135]Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, Limin Wang:
StableDrag: Stable Dragging for Point-based Image Editing. CoRR abs/2403.04437 (2024) - [i134]Jiange Yang, Bei Liu, Jianlong Fu, Bocheng Pan, Gangshan Wu, Limin Wang:
Spatiotemporal Predictive Pre-training for Robotic Motor Control. CoRR abs/2403.05304 (2024) - [i133]Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao:
VideoMamba: State Space Model for Efficient Video Understanding. CoRR abs/2403.06977 (2024) - [i132]Guo Chen, Yifei Huang, Jilan Xu, Baoqi Pei, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang:
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding. CoRR abs/2403.09626 (2024) - [i131]Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang:
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding. CoRR abs/2403.15377 (2024) - [i130]Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, Limin Wang, Yu Qiao:
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World. CoRR abs/2403.16182 (2024) - [i129]Ruopeng Gao, Yijun Zhang, Limin Wang:
Multiple Object Tracking as ID Prediction. CoRR abs/2403.16848 (2024) - [i128]Yuhan Zhu, Guozhen Zhang, Jing Tan, Gangshan Wu, Limin Wang:
Dual DETRs for Multi-Label Temporal Action Detection. CoRR abs/2404.00653 (2024) - [i127]Tao Wu, Runyu He, Gangshan Wu, Limin Wang:
SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos. CoRR abs/2404.04565 (2024) - [i126]Tao Wu, Mengqi Cao, Ziteng Gao, Gangshan Wu, Limin Wang:
STMixer: A One-Stage Sparse Action Detector. CoRR abs/2404.09842 (2024) - [i125]Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, Limin Wang:
Accelerating Image Generation with Sub-path Linear Approximation Model. CoRR abs/2404.13903 (2024) - [i124]Tao Wu, Shuqiu Ge, Jie Qin, Gangshan Wu, Limin Wang:
Open-Vocabulary Spatio-Temporal Action Detection. CoRR abs/2405.10832 (2024) - [i123]Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashuo Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Zhenxiang Li, Pei Chu, Yi Wang, Min Dou, Changyao Tian, Xizhou Zhu, Lewei Lu, Yushi Chen, Junjun He, Zhongying Tu, Tong Lu, Yali Wang, Limin Wang, Dahua Lin, Yu Qiao, Botian Shi, Conghui He, Jifeng Dai:
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text. CoRR abs/2406.08418 (2024) - [i122]Baoqi Pei, Guo Chen, Jilan Xu, Yuping He, Yicheng Liu, Kanghua Pan, Yifei Huang, Yali Wang, Tong Lu, Limin Wang, Yu Qiao:
EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation. CoRR abs/2406.18070 (2024) - [i121]Guozhen Zhang, Chunxu Liu, Yutao Cui, Xiaotong Zhao, Kai Ma, Limin Wang:
VFIMamba: Video Frame Interpolation with State Space Models. CoRR abs/2407.02315 (2024) - [i120]Xinhao Li, Zhenpeng Huang, Jing Wang, Kunchang Li, Limin Wang:
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model. CoRR abs/2407.06491 (2024) - [i119]Yisen Wang, Yao Teng, Limin Wang:
CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation. CoRR abs/2407.11433 (2024) - [i118]Yuhan Zhu, Guozhen Zhang, Chen Xu, Haocheng Shen, Xiaoxin Chen, Gangshan Wu, Limin Wang:
Efficient Test-Time Prompt Tuning for Vision-Language Models. CoRR abs/2408.05775 (2024) - [i117]Guozhen Zhang, Jingyu Liu, Shengming Cao, Xiaotong Zhao, Kevin Zhao, Kai Ma, Limin Wang:
Dynamic and Compressive Adaptation of Transformers From Images to Videos. CoRR abs/2408.06840 (2024) - [i116]Xiangyu Zeng, Kunchang Li, Chenting Wang, Xinhao Li, Tianxiang Jiang, Ziang Yan, Songze Li, Yansong Shi, Zhengrong Yue, Yi Wang, Yali Wang, Yu Qiao, Limin Wang:
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning. CoRR abs/2410.19702 (2024) - [i115]Shuai Wang, Zexian Li, Tianhui Song, Xubin Li, Tiezheng Ge, Bo Zheng, Limin Wang:
FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution. CoRR abs/2410.22655 (2024) - 2023
- [j23]Min Yang, Guo Chen, Yin-Dong Zheng, Tong Lu, Limin Wang:
BasicTAD: An astounding RGB-Only baseline for temporal action detection. Comput. Vis. Image Underst. 232: 103692 (2023) - [j22]Zuxian Huang, Gangshan Wu, Limin Wang:
Webly-supervised semantic segmentation via curriculum learning. Comput. Vis. Image Underst. 236: 103810 (2023) - [j21]Ziteng Gao, Limin Wang, Gangshan Wu:
LIP: Local Importance-Based Pooling. Int. J. Comput. Vis. 131(1): 363-384 (2023) - [j20]Jing Tan, Yuhong Wang, Gangshan Wu, Limin Wang:
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12506-12520 (2023) - [j19]Yating Tian, Hongwen Zhang, Yebin Liu, Limin Wang:
Recovering 3D Human Mesh From Monocular Images: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15406-15425 (2023) - [j18]Tao Lu, Chunxu Liu, Youxin Chen, Gangshan Wu, Limin Wang:
APP-Net: Auxiliary-Point-Based Push and Pull Operations for Efficient Point Cloud Recognition. IEEE Trans. Image Process. 32: 6500-6513 (2023) - [c91]Jiange Yang, Sheng Guo, Gangshan Wu, Limin Wang:
CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets. AAAI 2023: 3145-3154 - [c90]Tao Lu, Xiang Ding, Haisong Liu, Gangshan Wu, Limin Wang:
LinK: Linear Kernel for LiDAR-based 3D Perception. CVPR 2023: 1105-1115 - [c89]Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, Limin Wang:
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation. CVPR 2023: 5682-5692 - [c88]Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao:
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking. CVPR 2023: 14549-14560 - [c87]Tao Wu, Mengqi Cao, Ziteng Gao, Gangshan Wu, Limin Wang:
STMixer: A One-Stage Sparse Action Detector. CVPR 2023: 14720-14729 - [c86]Hanlin Wang, Yilu Wu, Sheng Guo, Limin Wang:
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos. CVPR 2023: 14836-14845 - [c85]Miao Cheng, Limin Wang:
Graph Routes From Local and Global Entrances. ICBDT 2023: 314-318 - [c84]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Limin Wang, Yu Qiao:
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding. ICCV 2023: 1632-1643 - [c83]Shuai Wang, Yao Teng, Limin Wang:
Deep Equilibrium Object Detection. ICCV 2023: 6273-6283 - [c82]Yao Teng, Haisong Liu, Sheng Guo, Limin Wang:
StageInteractor: Query-based Object Detector with Cross-stage Interaction. ICCV 2023: 6554-6565 - [c81]Ruopeng Gao, Limin Wang:
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking. ICCV 2023: 9867-9876 - [c80]Yutao Cui, Chenkai Zeng, Xiaoyu Zhao, Yichun Yang, Gangshan Wu, Limin Wang:
SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes. ICCV 2023: 9887-9897 - [c79]Lei Chen, Zhan Tong, Yibing Song, Gangshan Wu, Limin Wang:
Efficient Video Action Detection with Token Dropout and Context Refinement. ICCV 2023: 10354-10365 - [c78]Bingkun Huang, Zhiyu Zhao, Guozhen Zhang, Yu Qiao, Limin Wang:
MGMAE: Motion Guided Masking for Video Masked Autoencoding. ICCV 2023: 13447-13458 - [c77]Jiahao Wang, Guo Chen, Yifei Huang, Limin Wang, Tong Lu:
Memory-and-Anticipation Transformer for Online Action Understanding. ICCV 2023: 13778-13789 - [c76]Haisong Liu, Yao Teng, Tao Lu, Haiguang Wang, Limin Wang:
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos. ICCV 2023: 18534-18544 - [c75]Kunchang Li, Yali Wang, Yizhuo Li, Yi Wang, Yinan He, Limin Wang, Yu Qiao:
Unmasked Teacher: Towards Training-Efficient Video Foundation Models. ICCV 2023: 19891-19903 - [c74]Matej Kristan, Jirí Matas, Martin Danelljan, Michael Felsberg, Hyung Jin Chang, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Zhongqun Zhang, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Christoph Mayer, Yushan Zhang, Lei Ke, Jie Zhao, Gustavo Fernández, Noor Al-Shakarji, Dong An, Michael Arens, Stefan Becker, Goutam Bhat, Sebastian Bullinger, Antoni B. Chan, Shijie Chang, Hanyuan Chen, Xin Chen, Yan Chen, Zhenyu Chen, Yangming Cheng, Yutao Cui, Chunyuan Deng, Jiahua Dong, Matteo Dunnhofer, Wei Feng, Jianlong Fu, Jie Gao, Ruize Han, Zeqi Hao, Jun-Yan He, Keji He, Zhenyu He, Xiantao Hu, Kaer Huang, Yuqing Huang, Yi Jiang, Ben Kang, Jin-Peng Lan, Hyungjun Lee, Chenyang Li, Jiahao Li, Ning Li, Wangkai Li, Xiaodi Li, Xin Li, Pengyu Liu, Yue Liu, Huchuan Lu, Bin Luo, Ping Luo, Yinchao Ma, Deshui Miao, Christian Micheloni, Kannappan Palaniappan, Hancheol Park, Matthieu Paul, Houwen Peng, Zekun Qian, Gani Rahmon, Norbert Scherer-Negenborn, Pengcheng Shao, Wooksu Shin, Elham Soltani Kazemi, Tianhui Song, Rainer Stiefelhagen, Rui Sun, Chuanming Tang, Zhangyong Tang, Imad Eddine Toubal, Jack Valmadre, Joost van de Weijer, Luc Van Gool, Jash Vira, Stéphane Vujasinovic, Cheng Wan, Jia Wan, Dong Wang, Fei Wang, Feifan Wang, He Wang, Limin Wang, Song Wang, Yaowei Wang, Zhepeng Wang, Gangshan Wu, Jiannan Wu, Qiangqiang Wu, Xiaojun Wu, Anqi Xiao, Jinxia Xie, Chenlong Xu, Min Xu, Tianyang Xu, Yuanyou Xu, Bin Yan, Dawei Yang, Ming-Hsuan Yang, Tianyu Yang, Yi Yang, Zongxin Yang, Xuanwu Yin, Fisher Yu, Hongyuan Yu, Qianjin Yu, Weichen Yu, Yongsheng Yuan, Zehuan Yuan, Jianlin Zhang, Lu Zhang, Tianzhu Zhang, Guodongfang Zhao, Shaochuan Zhao, Yaozong Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang, ChengAo Zong, Kunlong Zuo:
The First Visual Object Tracking Segmentation VOTS2023 Challenge Results. ICCV (Workshops) 2023: 1788-1810 - [c73]Haoyue Cheng, Zhaoyang Liu, Wayne Wu, Limin Wang:
Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation. ICLR 2023 - [c72]Yue Feng, Zhengye Zhang, Rong Quan, Limin Wang, Jie Qin:
RefineTAD: Learning Proposal-free Refinement for Temporal Action Detection. ACM Multimedia 2023: 135-143 - [c71]Hongjie Zhang, Yi Liu, Yali Wang, Limin Wang, Yu Qiao:
Learning Discriminative Feature Representation for Open Set Action Recognition. ACM Multimedia 2023: 7696-7705 - [c70]Yutao Cui, Tianhui Song, Gangshan Wu, Limin Wang:
MixFormerV2: Efficient Fully Transformer Tracking. NeurIPS 2023 - [c69]Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. NeurIPS 2023 - [i114]Yutao Cui, Cheng Jiang, Gangshan Wu, Limin Wang:
MixFormer: End-to-End Tracking with Iterative Mixed Attention. CoRR abs/2302.02814 (2023) - [i113]Jiange Yang, Sheng Guo, Gangshan Wu, Limin Wang:
CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets. CoRR abs/2302.06148 (2023) - [i112]Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, Limin Wang:
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation. CoRR abs/2303.00440 (2023) - [i111]Haisong Liu, Tao Lu, Yihui Xu, Jia Liu, Limin Wang:
Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion. CoRR abs/2303.12017 (2023) - [i110]Hanlin Wang, Yilu Wu, Sheng Guo, Limin Wang:
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos. CoRR abs/2303.14676 (2023) - [i109]Tao Wu, Mengqi Cao, Ziteng Gao, Gangshan Wu, Limin Wang:
STMixer: A One-Stage Sparse Action Detector. CoRR abs/2303.15879 (2023) - [i108]Kunchang Li, Yali Wang, Yizhuo Li, Yi Wang, Yinan He, Limin Wang, Yu Qiao:
Unmasked Teacher: Towards Training-Efficient Video Foundation Models. CoRR abs/2303.16058 (2023) - [i107]Tao Lu, Xiang Ding, Haisong Liu, Gangshan Wu, Limin Wang:
LinK: Linear Kernel for LiDAR-based 3D Perception. CoRR abs/2303.16094 (2023) - [i106]Lei Chen, Zhan Tong, Yibing Song, Gangshan Wu, Limin Wang:
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection. CoRR abs/2303.16118 (2023) - [i105]Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao:
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking. CoRR abs/2303.16727 (2023) - [i104]Ziteng Gao, Zhan Tong, Limin Wang, Mike Zheng Shou:
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens. CoRR abs/2304.03768 (2023) - [i103]Yao Teng, Haisong Liu, Sheng Guo, Limin Wang:
StageInteractor: Query-based Object Detector with Cross-stage Interaction. CoRR abs/2304.04978 (2023) - [i102]Yutao Cui, Chenkai Zeng, Xiaoyu Zhao, Yichun Yang, Gangshan Wu, Limin Wang:
SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes. CoRR abs/2304.05170 (2023) - [i101]Chen Xu, Haocheng Shen, Fengyuan Shi, Boheng Chen, Yixuan Liao, Xiaoxin Chen, Limin Wang:
Progressive Visual Prompt Learning with Contrastive Feature Re-formation. CoRR abs/2304.08386 (2023) - [i100]Lei Chen, Zhan Tong, Yibing Song, Gangshan Wu, Limin Wang:
Efficient Video Action Detection with Token Dropout and Context Refinement. CoRR abs/2304.08451 (2023) - [i99]Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao:
InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language. CoRR abs/2305.05662 (2023) - [i98]Kunchang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao:
VideoChat: Chat-Centric Video Understanding. CoRR abs/2305.06355 (2023) - [i97]Guo Chen, Yin-Dong Zheng, Jiahao Wang, Jilan Xu, Yifei Huang, Junting Pan, Yi Wang, Yali Wang, Yu Qiao, Tong Lu, Limin Wang:
VideoLLM: Modeling Video Sequence with Large Language Models. CoRR abs/2305.13292 (2023) - [i96]Yutao Cui, Tianhui Song, Gangshan Wu, Limin Wang:
MixFormerV2: Efficient Fully Transformer Tracking. CoRR abs/2305.15896 (2023) - [i95]Chuhao Jin, Wenhui Tan, Jiange Yang, Bei Liu, Ruihua Song, Limin Wang, Jianlong Fu:
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation. CoRR abs/2305.18898 (2023) - [i94]Jiange Yang, Wenhui Tan, Chuhao Jin, Bei Liu, Jianlong Fu, Ruihua Song, Limin Wang:
Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots. CoRR abs/2306.05716 (2023) - [i93]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. CoRR abs/2307.06942 (2023) - [i92]Ruopeng Gao, Limin Wang:
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking. CoRR abs/2307.15700 (2023) - [i91]Jiahao Wang, Guo Chen, Yifei Huang, Limin Wang, Tong Lu:
Memory-and-Anticipation Transformer for Online Action Understanding. CoRR abs/2308.07893 (2023) - [i90]Haisong Liu, Yao Teng, Tao Lu, Haiguang Wang, Limin Wang:
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos. CoRR abs/2308.09244 (2023) - [i89]Shuai Wang, Yao Teng, Limin Wang:
Deep Equilibrium Object Detection. CoRR abs/2308.09564 (2023) - [i88]Chen Xu, Yuhan Zhu, Guozhen Zhang, Haocheng Shen, Yixuan Liao, Xiaoxin Chen, Gangshan Wu, Limin Wang:
DPL: Decoupled Prompt Learning for Vision-Language Models. CoRR abs/2308.10061 (2023) - [i87]Bingkun Huang, Zhiyu Zhao, Guozhen Zhang, Yu Qiao, Limin Wang:
MGMAE: Motion Guided Masking for Video Masked Autoencoding. CoRR abs/2308.10794 (2023) - [i86]Jiaming Zhang, Yutao Cui, Gangshan Wu, Limin Wang:
Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation. CoRR abs/2308.13505 (2023) - [i85]Fengyuan Shi, Limin Wang:
Bridging The Gaps Between Token Pruning and Full Pre-training via Masked Fine-tuning. CoRR abs/2310.17177 (2023) - [i84]Yizhuo Li, Kunchang Li, Yinan He, Yi Wang, Yali Wang, Limin Wang, Yu Qiao, Ping Luo:
Harvest Video Foundation Models via Efficient Post-Pretraining. CoRR abs/2310.19554 (2023) - [i83]Zhiyu Zhao, Bingkun Huang, Sen Xing, Gangshan Wu, Yu Qiao, Limin Wang:
Asymmetric Masked Distillation for Pre-Training Small Foundation Models. CoRR abs/2311.03149 (2023) - [i82]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao:
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark. CoRR abs/2311.17005 (2023) - [i81]Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
VBench: Comprehensive Benchmark Suite for Video Generative Models. CoRR abs/2311.17982 (2023) - [i80]Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, Limin Wang, Dahua Lin, Bo Dai:
Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering. CoRR abs/2312.00109 (2023) - [i79]Min Yang, Huan Gao, Ping Guo, Limin Wang:
Adapting Short-Term Transformers for Action Detection in Untrimmed Videos. CoRR abs/2312.01897 (2023) - [i78]Fengyuan Shi, Jiaxi Gu, Hang Xu, Songcen Xu, Wei Zhang, Limin Wang:
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models. CoRR abs/2312.02813 (2023) - [i77]Hongjie Zhang, Yi Liu, Lu Dong, Yifei Huang, Zhen-Hua Ling, Yali Wang, Limin Wang, Yu Qiao:
MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding. CoRR abs/2312.04817 (2023) - 2022
- [j17]Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu:
Fully convolutional online tracking. Comput. Vis. Image Underst. 224: 103547 (2022) - [j16]Dapeng Du, Jiawei Chen, Yuexiang Li, Kai Ma, Gangshan Wu, Yefeng Zheng, Limin Wang:
Cross-Domain Gated Learning for Domain Generalization. Int. J. Comput. Vis. 130(11): 2842-2857 (2022) - [j15]Yi Liu, Limin Wang, Yali Wang, Xiao Ma, Yu Qiao:
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization. IEEE Trans. Image Process. 31: 6937-6950 (2022) - [c68]Guo Chen, Yin-Dong Zheng, Limin Wang, Tong Lu:
DCAN: Improving Temporal Action Detection via Dual Context Aggregation. AAAI 2022: 248-257 - [c67]Zhenzhi Wang, Limin Wang, Tao Wu, Tianhao Li, Gangshan Wu:
Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding. AAAI 2022: 2613-2623 - [c66]Jiaqi Tang, Zhaoyang Liu, Chen Qian, Wayne Wu, Limin Wang:
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection. CVPR 2022: 3345-3354 - [c65]Ziteng Gao, Limin Wang, Bing Han, Sheng Guo:
AdaMixer: A Fast-Converging Query-Based Object Detector. CVPR 2022: 5354-5363 - [c64]Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu:
MixFormer: End-to-End Tracking with Iterative Mixed Attention. CVPR 2022: 13598-13608 - [c63]Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang:
OCSampler: Compressing Videos to One Clip with Single-step Sampling. CVPR 2022: 13884-13893 - [c62]Liang Zhao, Limin Wang:
Task-specific Inconsistency Alignment for Domain Adaptive Object Detection. CVPR 2022: 14197-14206 - [c61]Sheng Guo, Zihua Xiong, Yujie Zhong, Limin Wang, Xiaobo Guo, Bing Han, Weilin Huang:
Cross-Architecture Self-supervised Video Representation Learning. CVPR 2022: 19248-19257 - [c60]Yao Teng, Limin Wang:
Structured Sparse R-CNN for Direct Scene Graph Generation. CVPR 2022: 19415-19424 - [c59]Haoyue Cheng, Zhaoyang Liu, Hang Zhou, Chen Qian, Wayne Wu, Limin Wang:
Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing. ECCV (34) 2022: 431-448 - [c58]Matej Kristan, Ales Leonardis, Jirí Matas, Michael Felsberg, Roman P. Pflugfelder, Joni-Kristian Kämäräinen, Hyung Jin Chang, Martin Danelljan, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Johanna Björklund, Yushan Zhang, Zhongqun Zhang, Song Yan, Wenyan Yang, Dingding Cai, Christoph Mayer, Gustavo Fernández, Kang Ben, Goutam Bhat, Hong Chang, Guangqi Chen, Jiaye Chen, Shengyong Chen, Xilin Chen, Xin Chen, Xiuyi Chen, Yiwei Chen, Yu-Hsi Chen, Zhixing Chen, Yangming Cheng, Angelo Ciaramella, Yutao Cui, Benjamin Dzubur, Mohana Murali Dasari, Qili Deng, Debajyoti Dhar, Shangzhe Di, Emanuel Di Nardo, Daniel K. Du, Matteo Dunnhofer, Heng Fan, Zhenhua Feng, Zhihong Fu, Shang Gao, Rama Krishna Gorthi, Eric Granger, Q. H. Gu, Himanshu Gupta, Jianfeng He, Keji He, Yan Huang, Deepak Jangid, Rongrong Ji, Cheng Jiang, Yingjie Jiang, Felix Järemo Lawin, Ze Kang, Madhu Kiran, Josef Kittler, Simiao Lai, Xiangyuan Lan, Dongwook Lee, Hyunjeong Lee, Seohyung Lee, Hui Li, Ming Li, Wangkai Li, Xi Li, Xianxian Li, Xiao Li, Zhe Li, Liting Lin, Haibin Ling, Bo Liu, Chang Liu, Si Liu, Huchuan Lu, Rafael M. O. Cruz, Bingpeng Ma, Chao Ma, Jie Ma, Yinchao Ma, Niki Martinel, Alireza Memarmoghadam, Christian Micheloni, Payman Moallem, Le Thanh Nguyen-Meidine, Siyang Pan, ChangBeom Park, Danda Pani Paudel, Matthieu Paul, Houwen Peng, Andreas Robinson, Litu Rout, Shiguang Shan, Kristian Simonato, Tianhui Song, Xiaoning Song, Chao Sun, Jingna Sun, Zhangyong Tang, Radu Timofte, Chi-Yi Tsai, Luc Van Gool, Om Prakash Verma, Dong Wang, Fei Wang, Liang Wang, Liangliang Wang, Lijun Wang, Limin Wang, Qiang Wang, Gangshan Wu, Jinlin Wu, Xiaojun Wu, Fei Xie, Tianyang Xu, Wei Xu, Yong Xu, Yuanyou Xu, Wanli Xue, Zizheng Xun, Bin Yan, Dawei Yang, Jinyu Yang, Wankou Yang, Xiaoyun Yang, Yi Yang, Yichun Yang, Zongxin Yang, Botao Ye, Fisher Yu, Hongyuan Yu, Jiaqian Yu, Qianjin Yu, Weichen Yu, Kang Ze, Jiang Zhai, Chengwei Zhang, Chunhu Zhang, Kaihua Zhang, Tianzhu Zhang, Wenkang Zhang, Zhibin Zhang, Zhipeng Zhang, Jie Zhao, Shao-Chuan Zhao, Feng Zheng, Haixia Zheng, Min Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang:
The Tenth Visual Object Tracking VOT2022 Challenge Results. ECCV Workshops (8) 2022: 431-460 - [c57]Mengqi Cao, Min Yang, Guozhen Zhang, Xiaotian Li, Yilu Wu, Gangshan Wu, Limin Wang:
SpotFormer: A Transformer-based Framework for Precise Soccer Action Spotting. MMSP 2022: 1-6 - [c56]Jing Tan, Xiaotong Zhao, Xintian Shi, Bin Kang, Limin Wang:
PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points. NeurIPS 2022 - [c55]Zhan Tong, Yibing Song, Jue Wang, Limin Wang:
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training. NeurIPS 2022 - [i76]Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang:
OCSampler: Compressing Videos to One Clip with Single-step Sampling. CoRR abs/2201.04388 (2022) - [i75]Jing Tan, Yuhong Wang, Gangshan Wu, Limin Wang:
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection. CoRR abs/2203.00307 (2022) - [i74]Yating Tian, Hongwen Zhang, Yebin Liu, Limin Wang:
Recovering 3D Human Mesh from Monocular Images: A Survey. CoRR abs/2203.01923 (2022) - [i73]Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu:
MixFormer: End-to-End Tracking with Iterative Mixed Attention. CoRR abs/2203.11082 (2022) - [i72]Zhan Tong, Yibing Song, Jue Wang, Limin Wang:
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training. CoRR abs/2203.12602 (2022) - [i71]Liang Zhao, Limin Wang:
Task-specific Inconsistency Alignment for Domain Adaptive Object Detection. CoRR abs/2203.15345 (2022) - [i70]Ziteng Gao, Limin Wang, Bing Han, Sheng Guo:
AdaMixer: A Fast-Converging Query-Based Object Detector. CoRR abs/2203.16507 (2022) - [i69]Liang Zhao, Yao Teng, Limin Wang:
Logit Normalization for Long-tail Object Detection. CoRR abs/2203.17020 (2022) - [i68]Haoyue Cheng, Zhaoyang Liu, Hang Zhou, Chen Qian, Wayne Wu, Limin Wang:
Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing. CoRR abs/2204.11573 (2022) - [i67]Tao Lu, Chunxu Liu, Youxin Chen, Gangshan Wu, Limin Wang:
APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Classification. CoRR abs/2205.00847 (2022) - [i66]Min Yang, Guo Chen, Yin-Dong Zheng, Tong Lu, Limin Wang:
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection. CoRR abs/2205.02717 (2022) - [i65]Sheng Guo, Zihua Xiong, Yujie Zhong, Limin Wang, Xiaobo Guo, Bing Han, Weilin Huang:
Cross-Architecture Self-supervised Video Representation Learning. CoRR abs/2205.13313 (2022) - [i64]Jiaqi Tang, Zhaoyang Liu, Jing Tan, Chen Qian, Wayne Wu, Limin Wang:
Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach. CoRR abs/2206.15268 (2022) - [i63]Fengyuan Shi, Ruopeng Gao, Weilin Huang, Limin Wang:
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding. CoRR abs/2209.13959 (2022) - [i62]Jing Tan, Xiaotong Zhao, Xintian Shi, Bin Kang, Limin Wang:
PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points. CoRR abs/2210.11035 (2022) - [i61]Yin-Dong Zheng, Guo Chen, Jiahao Wang, Tong Lu, Limin Wang:
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022. CoRR abs/2211.08728 (2022) - [i60]Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao:
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges. CoRR abs/2211.09529 (2022) - [i59]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Limin Wang, Yu Qiao:
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer. CoRR abs/2211.09552 (2022) - [i58]Jintao Lin, Zhaoyang Liu, Wenhai Wang, Wayne Wu, Limin Wang:
VLG: General Video Recognition with Web Textual Knowledge. CoRR abs/2212.01638 (2022) - [i57]Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang, Zhiyu Zhao, Hongjie Zhang, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Jiashuo Yu, Yali Wang, Limin Wang, Yu Qiao:
InternVideo: General Video Foundation Models via Generative and Discriminative Learning. CoRR abs/2212.03191 (2022) - 2021
- [j14]Dapeng Du, Limin Wang, Zhaoyang Li, Gangshan Wu:
Cross-Modal Pyramid Translation for RGB-D Scene Recognition. Int. J. Comput. Vis. 129(8): 2309-2327 (2021) - [j13]Zeyu Ruan, Changqing Zou, Longhai Wu, Gangshan Wu, Limin Wang:
SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction. IEEE Trans. Image Process. 30: 5793-5806 (2021) - [c54]Zhenxi Zhu, Limin Wang, Sheng Guo, Gangshan Wu:
A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark. BMVC 2021: 237 - [c53]Limin Wang, Zhan Tong, Bin Ji, Gangshan Wu:
TDN: Temporal Difference Networks for Efficient Action Recognition. CVPR 2021: 1895-1904 - [c52]Tao Lu, Limin Wang, Gangshan Wu:
CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation. CVPR 2021: 11693-11702 - [c51]Tianhao Li, Limin Wang, Gangshan Wu:
Self Supervision to Distillation for Long-Tailed Visual Recognition. ICCV 2021: 610-619 - [c50]Yuan Zhi, Zhan Tong, Limin Wang, Gangshan Wu:
MGSampler: An Explainable Sampling Strategy for Video Action Recognition. ICCV 2021: 1493-1502 - [c49]Ziteng Gao, Limin Wang, Gangshan Wu:
Mutual Supervision for Dense Object Detection. ICCV 2021: 3621-3630 - [c48]Hongwen Zhang, Yating Tian, Xinchi Zhou, Wanli Ouyang, Yebin Liu, Limin Wang, Zhenan Sun:
PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop. ICCV 2021: 11426-11436 - [c47]Jing Tan, Jiaqi Tang, Limin Wang, Gangshan Wu:
Relaxed Transformer Decoders for Direct Action Proposal Generation. ICCV 2021: 13506-13515 - [c46]Yixuan Li, Lei Chen, Runyu He, Zhenzhi Wang, Gangshan Wu, Limin Wang:
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions. ICCV 2021: 13516-13525 - [c45]Yao Teng, Limin Wang, Zhifeng Li, Gangshan Wu:
Target Adaptive Context Aggregation for Video Scene Graph Generation. ICCV 2021: 13668-13677 - [c44]Zhaoyang Liu, Limin Wang, Wayne Wu, Chen Qian, Tong Lu:
TAM: Temporal Adaptive Module for Video Recognition. ICCV 2021: 13688-13698 - [c43]Matej Kristan, Jirí Matas, Ales Leonardis, Michael Felsberg, Roman P. Pflugfelder, Joni-Kristian Kämäräinen, Hyung Jin Chang, Martin Danelljan, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Jani Käpylä, Gustav Häger, Song Yan, Jinyu Yang, Zhongqun Zhang, Gustavo Fernández, Mohamed H. Abdelpakey, Goutam Bhat, Llukman Cerkezi, Hakan Cevikalp, Shengyong Chen, Xin Chen, Miao Cheng, Ziyi Cheng, Yu-Chen Chiu, Ozgun Cirakman, Yutao Cui, Kenan Dai, Mohana Murali Dasari, Qili Deng, Xingping Dong, Daniel K. Du, Matteo Dunnhofer, Zhenhua Feng, Zhiyong Feng, Zhihong Fu, Shiming Ge, Rama Krishna Gorthi, Yuzhang Gu, Bilge Günsel, Qing Guo, Filiz Gurkan, Wencheng Han, Yanyan Huang, Felix Järemo Lawin, Shang-Jhih Jhang, Rongrong Ji, Cheng Jiang, Yingjie Jiang, Felix Juefei-Xu, J. Yin, Xiao Ke, Fahad Shahbaz Khan, Byeong Hak Kim, Josef Kittler, Xiangyuan Lan, Jun Ha Lee, Bastian Leibe, Hui Li, Jianhua Li, Xianxian Li, Yuezhou Li, Bo Liu, Chang Liu, Jingen Liu, Li Liu, Qingjie Liu, Huchuan Lu, Wei Lu, Jonathon Luiten, Jie Ma, Ziang Ma, Niki Martinel, Christoph Mayer, Alireza Memarmoghadam, Christian Micheloni, Yuzhen Niu, Danda Pani Paudel, Houwen Peng, Shoumeng Qiu, Aravindh Rajiv, Muhammad Rana, Andreas Robinson, Hasan Saribas, Ling Shao, Mohamed S. Shehata, Furao Shen, Jianbing Shen, Kristian Simonato, Xiaoning Song, Zhangyong Tang, Radu Timofte, Philip H. S. Torr, Chi-Yi Tsai, Bedirhan Uzun, Luc Van Gool, Paul Voigtlaender, Dong Wang, Guangting Wang, Liangliang Wang, Lijun Wang, Limin Wang, Linyuan Wang, Yong Wang, Yunhong Wang, Chenyan Wu, Gangshan Wu, Xiaojun Wu, Fei Xie, Tianyang Xu, Xiang Xu, Wanli Xue, Bin Yan, Wankou Yang, Xiaoyun Yang, Yu Ye, Jun Yin, Chengwei Zhang, Chunhui Zhang, Haitao Zhang, Kaihua Zhang, Kangkai Zhang, Xiaohan Zhang, Xiaolin Zhang, Xinyu Zhang, Zhibin Zhang, Shao-Chuan Zhao, Ming Zhen, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu:
The Ninth Visual Object Tracking VOT2021 Challenge Results. ICCVW 2021: 2711-2738 - [c42]Limin Wang:
Cross-modal Pretraining and Matching for Video Understanding. MMPT@ICMR 2021: 1-2 - [c41]Liwei Jin, Haoyue Cheng, Su Xu, Wayne Wu, Limin Wang:
NJU MCG - Sensetime Team Submission to Pre-training for Video Understanding Challenge Track II. ACM Multimedia 2021: 4799-4802 - [i56]Jing Tan, Jiaqi Tang, Limin Wang, Gangshan Wu:
Relaxed Transformer Decoders for Direct Action Proposal Generation. CoRR abs/2102.01894 (2021) - [i55]Hongwen Zhang, Yating Tian, Xinchi Zhou, Wanli Ouyang, Yebin Liu, Limin Wang, Zhenan Sun:
3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop. CoRR abs/2103.16507 (2021) - [i54]Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu:
Target Transformed Regression for Accurate Tracking. CoRR abs/2104.00403 (2021) - [i53]Yuan Zhi, Zhan Tong, Limin Wang, Gangshan Wu:
MGSampler: An Explainable Sampling Strategy for Video Action Recognition. CoRR abs/2104.09952 (2021) - [i52]Yixuan Li, Lei Chen, Runyu He, Zhenzhi Wang, Gangshan Wu, Limin Wang:
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions. CoRR abs/2105.07404 (2021) - [i51]Yi Liu, Limin Wang, Xiao Ma, Yali Wang, Yu Qiao:
FineAction: A Fined Video Dataset for Temporal Action Localization. CoRR abs/2105.11107 (2021) - [i50]Zeyu Ruan, Changqing Zou, Longhai Wu, Gangshan Wu, Limin Wang:
SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction. CoRR abs/2106.03021 (2021) - [i49]Yao Teng, Limin Wang:
Structured Sparse R-CNN for Direct Scene Graph Generation. CoRR abs/2106.10815 (2021) - [i48]Yao Teng, Limin Wang, Zhifeng Li, Gangshan Wu:
Target Adaptive Context Aggregation for Video Scene Graph Generation. CoRR abs/2108.08121 (2021) - [i47]Tianhao Li, Limin Wang, Gangshan Wu:
Self Supervision to Distillation for Long-Tailed Visual Recognition. CoRR abs/2109.04075 (2021) - [i46]Zhenzhi Wang, Limin Wang, Tao Wu, Tianhao Li, Gangshan Wu:
Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding. CoRR abs/2109.04872 (2021) - [i45]Ziteng Gao, Limin Wang, Gangshan Wu:
Mutual Supervision for Dense Object Detection. CoRR abs/2109.05986 (2021) - [i44]Fengyuan Shi, Limin Wang, Weilin Huang:
End-to-End Dense Video Grounding via Parallel Regression. CoRR abs/2109.11265 (2021) - [i43]Zhenxi Zhu, Limin Wang, Sheng Guo, Gangshan Wu:
A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark. CoRR abs/2110.12358 (2021) - [i42]Guo Chen, Yin-Dong Zheng, Limin Wang, Tong Lu:
DCAN: Improving Temporal Action Detection via Dual Context Aggregation. CoRR abs/2112.03612 (2021) - [i41]Jiaqi Tang, Zhaoyang Liu, Chen Qian, Wayne Wu, Limin Wang:
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection. CoRR abs/2112.04771 (2021) - 2020
- [j12]Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, Xiaoou Tang, Dahua Lin:
Temporal Action Detection with Structured Segment Networks. Int. J. Comput. Vis. 128(1): 74-95 (2020) - [j11]Yin-Dong Zheng, Zhaoyang Liu, Tong Lu, Limin Wang:
Dynamic Sampling Networks for Efficient Action Recognition in Videos. IEEE Trans. Image Process. 29: 7970-7983 (2020) - [c40]Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Limin Wang, Shugong Xu:
Finding Action Tubes with a Sparse-to-Dense Framework. AAAI 2020: 11466-11473 - [c39]Zhaoyang Liu, Donghao Luo, Yabiao Wang, Limin Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Tong Lu:
TEINet: Towards an Efficient Architecture for Video Recognition. AAAI 2020: 11669-11676 - [c38]Shiwen Zhang, Sheng Guo, Limin Wang, Weilin Huang, Matthew R. Scott:
Knowledge Integration Networks for Action Recognition. AAAI 2020: 12862-12869 - [c37]Yan Li, Bin Ji, Xintian Shi, Jianguo Zhang, Bin Kang, Limin Wang:
TEA: Temporal Excitation and Aggregation for Action Recognition. CVPR 2020: 906-915 - [c36]Chengying Gao, Qi Liu, Qi Xu, Limin Wang, Jianzhuang Liu, Changqing Zou:
SketchyCOCO: Image Generation From Freehand Scene Sketches. CVPR 2020: 5173-5182 - [c35]Zhenzhi Wang, Ziteng Gao, Limin Wang, Zhifeng Li, Gangshan Wu:
Boundary-Aware Cascade Networks for Temporal Action Segmentation. ECCV (25) 2020: 34-51 - [c34]Yixuan Li, Zixu Wang, Limin Wang, Gangshan Wu:
Actions as Moving Points. ECCV (16) 2020: 68-84 - [c33]Jianchao Wu, Zhanghui Kuang, Limin Wang, Wayne Zhang, Gangshan Wu:
Context-Aware RCNN: A Baseline for Action Detection in Videos. ECCV (25) 2020: 440-456 - [c32]Shiwen Zhang, Sheng Guo, Weilin Huang, Matthew R. Scott, Limin Wang:
V4D: 4D Convolutional Neural Networks for Video-level Representation Learning. ICLR 2020 - [i40]Yixuan Li, Zixu Wang, Limin Wang, Gangshan Wu:
Actions as Moving Points. CoRR abs/2001.04608 (2020) - [i39]Tianhao Li, Limin Wang:
Learning Spatiotemporal Features via Video and Text Pair Discrimination. CoRR abs/2001.05691 (2020) - [i38]Shiwen Zhang, Sheng Guo, Weilin Huang, Matthew R. Scott, Limin Wang:
V4D: 4D Convolutional Neural Networks for Video-level Representation Learning. CoRR abs/2002.07442 (2020) - [i37]Shiwen Zhang, Sheng Guo, Limin Wang, Weilin Huang, Matthew R. Scott:
Knowledge Integration Networks for Action Recognition. CoRR abs/2002.07471 (2020) - [i36]Chengying Gao, Qi Liu, Qi Xu, Jianzhuang Liu, Limin Wang, Changqing Zou:
SketchyCOCO: Image Generation from Freehand Scene Sketches. CoRR abs/2003.02683 (2020) - [i35]Yan Li, Bin Ji, Xintian Shi, Jianguo Zhang, Bin Kang, Limin Wang:
TEA: Temporal Excitation and Aggregation for Action Recognition. CoRR abs/2004.01398 (2020) - [i34]Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu:
Fully Convolutional Online Tracking. CoRR abs/2004.07109 (2020) - [i33]Zhaoyang Liu, Limin Wang, Wayne Wu, Chen Qian, Tong Lu:
TAM: Temporal Adaptive Module for Video Recognition. CoRR abs/2005.06803 (2020) - [i32]Yin-Dong Zheng, Zhaoyang Liu, Tong Lu, Limin Wang:
Dynamic Sampling Networks for Efficient Action Recognition in Videos. CoRR abs/2006.15560 (2020) - [i31]Jianchao Wu, Zhanghui Kuang, Limin Wang, Wayne Zhang, Gangshan Wu:
Context-Aware RCNN: A Baseline for Action Detection in Videos. CoRR abs/2007.09861 (2020) - [i30]Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Limin Wang, Shugong Xu:
Finding Action Tubes with a Sparse-to-Dense Framework. CoRR abs/2008.13196 (2020) - [i29]Limin Wang, Zhan Tong, Bin Ji, Gangshan Wu:
TDN: Temporal Difference Networks for Efficient Action Recognition. CoRR abs/2012.10071 (2020)
2010 – 2019
- 2019
- [j10]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks for Action Recognition in Videos. IEEE Trans. Pattern Anal. Mach. Intell. 41(11): 2740-2755 (2019) - [c31]Dongliang He, Zhichao Zhou, Chuang Gan, Fu Li, Xiao Liu, Yandong Li, Limin Wang, Shilei Wen:
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition. AAAI 2019: 8401-8408 - [c30]Bowen Pan, Jiankai Sun, Wuwei Lin, Limin Wang, Weiyao Lin:
Cross-Stream Selective Networks for Action Recognition. CVPR Workshops 2019: 454-460 - [c29]Jianchao Wu, Limin Wang, Li Wang, Jie Guo, Gangshan Wu:
Learning Actor Relation Graphs for Group Activity Recognition. CVPR 2019: 9964-9974 - [c28]Dapeng Du, Limin Wang, Huiling Wang, Kai Zhao, Gangshan Wu:
Translate-to-Recognize Networks for RGB-D Scene Recognition. CVPR 2019: 11836-11845 - [c27]Ziteng Gao, Limin Wang, Gangshan Wu:
LIP: Local Importance-Based Pooling. ICCV 2019: 3354-3363 - [c26]Yazhou Yao, Zeren Sun, Fumin Shen, Li Liu, Limin Wang, Fan Zhu, Lizhong Ding, Gangshan Wu, Ling Shao:
Dynamically Visual Disambiguation of Keyword-based Image Search. IJCAI 2019: 996-1002 - [i28]Jianchao Wu, Limin Wang, Li Wang, Jie Guo, Gangshan Wu:
Learning Actor Relation Graphs for Group Activity Recognition. CoRR abs/1904.10117 (2019) - [i27]Dapeng Du, Limin Wang, Huiling Wang, Kai Zhao, Gangshan Wu:
Translate-to-Recognize Networks for RGB-D Scene Recognition. CoRR abs/1904.12254 (2019) - [i26]Yazhou Yao, Zeren Sun, Fumin Shen, Li Liu, Limin Wang, Fan Zhu, Lizhong Ding, Gangshan Wu, Ling Shao:
Dynamically Visual Disambiguation of Keyword-based Image Search. CoRR abs/1905.10955 (2019) - [i25]Ziteng Gao, Limin Wang, Gangshan Wu:
LIP: Local Importance-based Pooling. CoRR abs/1908.04156 (2019) - [i24]Zhaoyang Liu, Donghao Luo, Yabiao Wang, Limin Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Tong Lu:
TEINet: Towards an Efficient Architecture for Video Recognition. CoRR abs/1911.09435 (2019) - 2018
- [j9]Limin Wang, Zhe Wang, Yu Qiao, Luc Van Gool:
Transferring Deep Object and Scene Representations for Event Recognition in Still Images. Int. J. Comput. Vis. 126(2-4): 390-409 (2018) - [j8]Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-Time Action Recognition With Deeply Transferred Motion Vector CNNs. IEEE Trans. Image Process. 27(5): 2326-2339 (2018) - [c25]Limin Wang, Wei Li, Wen Li, Luc Van Gool:
Appearance-and-Relation Networks for Video Classification. CVPR 2018: 1430-1439 - [c24]Jie Guo, Zuojian Zhou, Limin Wang:
Single Image Highlight Removal with a Sparse and Low-Rank Reflection Model. ECCV (4) 2018: 282-298 - [c23]Zhe Wang, Xiaoyi Liu, Limin Wang, Yu Qiao, Xiaohui Xie, Charless C. Fowlkes:
Structured Triplet Learning with POS-Tag Guided Attention for Visual Question Answering. WACV 2018: 1888-1896 - [i23]Zhe Wang, Xiaoyi Liu, Liangjian Chen, Limin Wang, Yu Qiao, Xiaohui Xie, Charless C. Fowlkes:
Structured Triplet Learning with POS-tag Guided Attention for Visual Question Answering. CoRR abs/1801.07853 (2018) - [i22]Dongliang He, Zhichao Zhou, Chuang Gan, Fu Li, Xiao Liu, Yandong Li, Limin Wang, Shilei Wen:
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition. CoRR abs/1811.01549 (2018) - 2017
- [j7]Sheng Guo, Weilin Huang, Limin Wang, Yu Qiao:
Locally Supervised Deep Hybrid Model for Scene Recognition. IEEE Trans. Image Process. 26(2): 808-820 (2017) - [j6]Zhe Wang, Limin Wang, Yali Wang, Bowen Zhang, Yu Qiao:
Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition. IEEE Trans. Image Process. 26(4): 2028-2041 (2017) - [j5]Limin Wang, Sheng Guo, Weilin Huang, Yuanjun Xiong, Yu Qiao:
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs. IEEE Trans. Image Process. 26(4): 2055-2068 (2017) - [c22]Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges:
Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos. CVPR 2017: 5563-5572 - [c21]Limin Wang, Yuanjun Xiong, Dahua Lin, Luc Van Gool:
UntrimmedNets for Weakly Supervised Action Recognition and Detection. CVPR 2017: 6402-6411 - [c20]Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, Xiaoou Tang, Dahua Lin:
Temporal Action Detection with Structured Segment Networks. ICCV 2017: 2933-2942 - [i21]Yuanjun Xiong, Yue Zhao, Limin Wang, Dahua Lin, Xiaoou Tang:
A Pursuit of Temporal Accuracy in General Activity Detection. CoRR abs/1703.02716 (2017) - [i20]Limin Wang, Yuanjun Xiong, Dahua Lin, Luc Van Gool:
UntrimmedNets for Weakly Supervised Action Recognition and Detection. CoRR abs/1703.03329 (2017) - [i19]Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges:
Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos. CoRR abs/1703.10898 (2017) - [i18]Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, Dahua Lin, Xiaoou Tang:
Temporal Action Detection with Structured Segment Networks. CoRR abs/1704.06228 (2017) - [i17]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks for Action Recognition in Videos. CoRR abs/1705.02953 (2017) - [i16]Wen Li, Limin Wang, Wei Li, Eirikur Agustsson, Jesse Berent, Abhinav Gupta, Rahul Sukthankar, Luc Van Gool:
WebVision Challenge: Visual Learning and Understanding With Web Data. CoRR abs/1705.05640 (2017) - [i15]Wen Li, Limin Wang, Wei Li, Eirikur Agustsson, Luc Van Gool:
WebVision Database: Visual Learning and Understanding from Web Data. CoRR abs/1708.02862 (2017) - [i14]Limin Wang, Wei Li, Wen Li, Luc Van Gool:
Appearance-and-Relation Networks for Video Classification. CoRR abs/1711.09125 (2017) - 2016
- [j4]Xiaojiang Peng, Limin Wang, Xingxing Wang, Yu Qiao:
Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice. Comput. Vis. Image Underst. 150: 109-125 (2016) - [j3]Ze-Huan Yuan, Hao Wang, Limin Wang, Tong Lu, Shivakumara Palaiahnakote, Chew Lim Tan:
Modeling spatial layout for scene image understanding via a novel multiscale sum-product network. Expert Syst. Appl. 63: 231-240 (2016) - [j2]Limin Wang, Yu Qiao, Xiaoou Tang:
MoFAP: A Multi-level Representation for Action Recognition. Int. J. Comput. Vis. 119(3): 254-271 (2016) - [c19]Yifan Wang, Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges:
Two-Stream SR-CNNs for Action Recognition in Videos. BMVC 2016 - [c18]Limin Wang, Yu Qiao, Xiaoou Tang, Luc Van Gool:
Actionness Estimation Using Hybrid Fully Convolutional Networks. CVPR 2016: 2708-2717 - [c17]Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-Time Action Recognition with Enhanced Motion Vector CNNs. CVPR 2016: 2718-2726 - [c16]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. ECCV (8) 2016: 20-36 - [c15]Zhe Wang, Yali Wang, Limin Wang, Yu Qiao:
Codebook enhancement of vlad representation for visual recognition. ICASSP 2016: 1258-1262 - [i13]Limin Wang, Yu Qiao, Xiaoou Tang, Luc Van Gool:
Actionness Estimation Using Hybrid Fully Convolutional Networks. CoRR abs/1604.07279 (2016) - [i12]Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-time Action Recognition with Enhanced Motion Vector CNNs. CoRR abs/1604.07669 (2016) - [i11]Yuanjun Xiong, Limin Wang, Zhe Wang, Bowen Zhang, Hang Song, Wei Li, Dahua Lin, Yu Qiao, Luc Van Gool, Xiaoou Tang:
CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016. CoRR abs/1608.00797 (2016) - [i10]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool:
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. CoRR abs/1608.00859 (2016) - [i9]Zhe Wang, Limin Wang, Yali Wang, Bowen Zhang, Yu Qiao:
Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition. CoRR abs/1609.00153 (2016) - [i8]Limin Wang, Zhe Wang, Yu Qiao, Luc Van Gool:
Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images. CoRR abs/1609.00162 (2016) - [i7]Limin Wang, Sheng Guo, Weilin Huang, Yuanjun Xiong, Yu Qiao:
Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs. CoRR abs/1610.01119 (2016) - 2015
- [c14]Zhe Wang, Limin Wang, Wenbin Du, Yu Qiao:
Exploring Fisher vector and deep networks for action spotting. CVPR Workshops 2015: 10-14 - [c13]Limin Wang, Zhe Wang, Wenbin Du, Yu Qiao:
Object-Scene Convolutional Neural Networks for event recognition in images. CVPR Workshops 2015: 30-35 - [c12]Limin Wang, Yu Qiao, Xiaoou Tang:
Action recognition with trajectory-pooled deep-convolutional descriptors. CVPR 2015: 4305-4314 - [c11]Limin Wang, Zhe Wang, Sheng Guo, Yu Qiao:
Better Exploiting OS-CNNs for Better Event Recognition in Images. ICCV Workshops 2015: 287-294 - [i6]Limin Wang, Zhe Wang, Wenbin Du, Yu Qiao:
Object-Scene Convolutional Neural Networks for Event Recognition in Images. CoRR abs/1505.00296 (2015) - [i5]Limin Wang, Yu Qiao, Xiaoou Tang:
Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors. CoRR abs/1505.04868 (2015) - [i4]Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao:
Towards Good Practices for Very Deep Two-Stream ConvNets. CoRR abs/1507.02159 (2015) - [i3]Limin Wang, Sheng Guo, Weilin Huang, Yu Qiao:
Places205-VGGNet Models for Scene Recognition. CoRR abs/1508.01667 (2015) - [i2]Limin Wang, Zhe Wang, Sheng Guo, Yu Qiao:
Better Exploiting OS-CNNs for Better Event Recognition in Images. CoRR abs/1510.03979 (2015) - 2014
- [j1]Limin Wang, Yu Qiao, Xiaoou Tang:
Latent Hierarchical Model of Temporal Structure for Complex Activity Classification. IEEE Trans. Image Process. 23(2): 810-822 (2014) - [c10]Zhuowei Cai, Limin Wang, Xiaojiang Peng, Yu Qiao:
Multi-view Super Vector for Action Recognition. CVPR 2014: 596-603 - [c9]Xiaojiang Peng, Limin Wang, Zhuowei Cai, Yu Qiao:
Action and Gesture Temporal Spotting with Super Vector Representation. ECCV Workshops (1) 2014: 518-527 - [c8]Limin Wang, Yu Qiao, Xiaoou Tang:
Video Action Detection with Relational Dynamic-Poselets. ECCV (5) 2014: 565-580 - [c7]Xiaojiang Peng, Limin Wang, Yu Qiao, Qiang Peng:
Boosting VLAD with Supervised Dictionary Learning and High-Order Statistics. ECCV (3) 2014: 660-674 - [c6]Xiaojiang Peng, Limin Wang, Yu Qiao, Qiang Peng:
A Joint Evaluation of Dictionary Learning and Feature Encoding for Action Recognition. ICPR 2014: 2607-2612 - [i1]Xiaojiang Peng, Limin Wang, Xingxing Wang, Yu Qiao:
Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice. CoRR abs/1405.4506 (2014) - 2013
- [c5]Limin Wang, Yu Qiao, Xiaoou Tang:
Motionlets: Mid-level 3D Parts for Human Motion Recognition. CVPR 2013: 2674-2681 - [c4]Limin Wang, Yu Qiao, Xiaoou Tang:
Mining Motion Atoms and Phrases for Complex Action Recognition. ICCV 2013: 2680-2687 - 2012
- [c3]Xingxing Wang, Limin Wang, Yu Qiao:
A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition. ACCV (3) 2012: 572-585 - 2011
- [c2]Limin Wang, Yirui Wu, Tong Lu, Kang Chen:
Multiclass object detection by combining local appearances and context. ACM Multimedia 2011: 1161-1164 - 2010
- [c1]Limin Wang, Yirui Wu, Zhiyuan Tian, Zailiang Sun, Tong Lu:
A Novel Approach for Robust Surveillance Video Content Abstraction. PCM (2) 2010: 660-671
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-19 23:11 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint