default search action
Visual Intelligence, Volume 2
Volume 2, Number 1, 2024
- Bin Fan, Yuchao Dai, Yongduek Seo, Mingyi He:
A revisit of the normalized eight-point algorithm and a self-supervised deep solution. - Wenqing Zhao, Lijiao Xu:
Weakly supervised target detection based on spatial attention. - Gang Li, Xiang Li, Shanshan Zhang, Jian Yang:
Towards more reliable evaluation in pedestrian detection by rethinking "ignore regions". - Zihao Jia, Shengkun Sun, Guangcan Liu, Bo Liu:
MSSD: multi-scale self-distillation for object detection. - Rui Qian, Weiyao Lin, John See, Dian Li:
Controllable augmentations for video representation learning. - Yaonan Wang:
In Memoriam: Professor Edwin R. Hancock. - Lichun Tang, Zhaoxia Yin, Hang Su, Wanli Lyu, Bin Luo:
WFSS: weighted fusion of spectral transformer and spatial self-attention for robust hyperspectral image classification against adversarial attacks. - Li Fang, Qian Wang, Long Ye:
GLGNet: light field angular superresolution with arbitrary interpolation rates. - Jia-Mu Sun, Tong Wu, Lin Gao:
Recent advances in implicit representation-based 3D shape generation. - Huaizhou Lin, Dan Cai, Zengmin Xu, Jinsong Wu, Lixian Sun, Haibin Jia:
Fabric4show: real-time vision system for fabric defect detection and post-processing. 13 - Yuliang Sun, Xudong Zhang, Yongwei Miao:
A review of point cloud segmentation for understanding 3D indoor scenes. 14 - Zhiqiang Yan, Yupeng Zheng, Deng-Ping Fan, Xiang Li, Jun Li, Jian Yang:
Learnable differencing center for nighttime depth perception. 15 - Chang Liu, Xudong Jiang, Henghui Ding:
PrimitiveNet: decomposing the global constraints for referring segmentation. 16 - Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan:
Effectiveness assessment of recent large vision-language models. 17 - Zelong Zeng, Fan Yang, Hong Liu, Shin'ichi Satoh:
Improving deep metric learning via self-distillation and online batch diffusion process. 18 - Qingjie Zeng, Yutong Xie, Zilin Lu, Yong Xia:
A human-in-the-loop method for pulmonary nodule detection in CT scans. 19 - Lei Cao, Zirui Shen, Sheng Xu:
Efficient forest fire detection based on an improved YOLO model. 20 - Siran Peng, Xiangyu Zhu, Dong Yi, Chen Qian, Zhen Lei:
Formulating facial mesh tracking as a differentiable optimization problem: a backpropagation-based solution. 21 - Fan Yu, Yaqun Fang, Zhixiang Zhao, Jia Bei, Tongwei Ren, Gangshan Wu:
CAGNet: a context-aware graph neural network for detecting social relationships in videos. 22 - Junpei Liao, Liang Yi, Wenxin Shi, Wenyuan Yang, Yanmei Fang, Xin Yang:
Imperceptible backdoor watermarks for speech recognition model copyright protection. 23 - Yichao Yan, Zanwei Zhou, Zi Wang, Jingnan Gao, Xiaokang Yang:
DialogueNeRF: towards realistic avatar face-to-face conversation video generation. 24 - Kaiwen Guo, Chaoyang Zhao, Jinqiao Wang:
A fast mask synthesis method for face recognition. 25 - Zonglin Li, Xiaoqian Lv, Wei Yu, Qinglin Liu, Jingbo Lin, Shengping Zhang:
Face shape transfer via semantic warping. 26 - Megani Rajendran, Chek Tien Tan, Indriyati Atmosukarto, Aik Beng Ng, Simon See:
Review on synergizing the Metaverse and AI-driven synthetic data: enhancing virtual realms and activity recognition in computer vision. 27 - Fangyi Liu, Mang Ye, Bo Du:
Learning a generalizable re-identification model from unlabelled data with domain-agnostic expert. 28 - Yong Li, Menglin Liu, Lingjie Lao, Yuanzhi Wang, Zhen Cui:
Counterfactual discriminative micro-expression recognition. 29 - Xiyao Liu, Jiaxin Hu, Qingying Yang, Ming Jiang, Jianbiao He, Hui Fang:
A divide-and-conquer reconstruction method for defending against adversarial example attacks. 30 - Yuehao Song, Xinggang Wang, Jingfeng Yao, Wenyu Liu, Jinglin Zhang, Xiangmin Xu:
ViTGaze: gaze following with interaction features in vision transformers. 31 - Zhangwei Gao, Zhe Chen, Erfei Cui, Yiming Ren, Weiyun Wang, Jinguo Zhu, Hao Tian, Shenglong Ye, Junjun He, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Jifeng Dai, Wenhai Wang:
Mini-InternVL: a flexible-transfer pocket multi-modal model with 5% parameters and 90% performance. 32 - Dehong Kong, Siyuan Liang, Xiaopeng Zhu, Yuansheng Zhong, Wenqi Ren:
Patch is enough: naturalistic adversarial patch against vision-language pre-training models. 33 - Xiaoguang Tu, Zhi He, Yi Huang, Zhi-Hao Zhang, Ming Yang, Jian Zhao:
An overview of large AI models and their applications. 34 - Chang Liu, Yongsheng Yuan, Xin Chen, Huchuan Lu, Dong Wang:
Spatial-temporal initialization dilemma: towards realistic visual tracking. 35 - Wei Huang, Xingyu Zheng, Xudong Ma, Haotong Qin, Chengtao Lv, Hong Chen, Jie Luo, Xiaojuan Qi, Xianglong Liu, Michele Magno:
An empirical study of LLaMA3 quantization: from LLMs to MLLMs. 36
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.