


default search action
IEEE Transactions on Multimedia, Volume 27
Volume 27, 2025
- Yuan Yuan
, Hongjie He
, Yaolin Yang
, Hadi Amirpour
, Christian Timmerer
, Fan Chen
:
JPEG Image Encryption With DC Rotation and Undivided RSV-Based AC Group Permutation. 1-15 - Dizhan Xue
, Shengsheng Qian
, Quan Fang
, Changsheng Xu
:
LININ: Logic Integrated Neural Inference Network for Explanatory Visual Question Answering. 16-27 - Pingping Zhang, Shiqi Wang
, Meng Wang
, Peilin Chen
, Wenhui Wu
, Xu Wang
, Sam Kwong
:
HNR-ISC: Hybrid Neural Representation for Image Set Compression. 28-40 - Qingxin Sheng, Chong Fu
, Zhaonan Lin, Junxin Chen
, Xingwei Wang
, Chiu-Wing Sham
:
Content-Aware Tunable Selective Encryption for HEVC Using Sine-Modular Chaotification Model. 41-55 - Qiguang Miao, Wentian Xin
, Ruyi Liu, Yi Liu, Mengyao Wu, Cheng Shi
, Chi-Man Pun
:
Adaptive Pitfall: Exploring the Effectiveness of Adaptation in Skeleton-Based Action Recognition. 56-71 - Shizhou Zhang
, Dexuan Kong
, Yinghui Xing
, Yue Lu
, Lingyan Ran
, Guoqiang Liang
, Hexu Wang, Yanning Zhang
:
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection. 72-83 - Yu Wang
, Shengjie Zhao
, Shiwei Chen
:
SQL-Net: Semantic Query Learning for Point-Supervised Temporal Action Localization. 84-94 - Kefan Tang
, Lihuo He
, Nannan Wang
, Xinbo Gao
:
Dual Semantic Reconstruction Network for Weakly Supervised Temporal Sentence Grounding. 95-107 - Yiting Liu
, Liang Li
, Yunbin Tu
, Beichen Zhang
, Zheng-Jun Zha
, Qingming Huang
:
Dynamic Strategy Prompt Reasoning for Emotional Support Conversation. 108-119 - Yunlong Tang, Yuxuan Wan, Lei Qi
, Xin Geng
:
DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization. 120-132 - Zhenyu Shu
, Shiyang Li
, Shiqing Xin
, Ligang Liu
:
3D Shape Segmentation With Potential Consistency Mining and Enhancement. 133-144 - Min Dang
, Gang Liu
, Hao Li
, Di Wang
, Rong Pan
, Quan Wang
:
PRA-Det: Anchor-Free Oriented Object Detection With Polar Radius Representation. 145-157 - Yizhen Jia
, Rong Quan
, Haiyan Chen
, Jiamei Liu, Yichao Yan
, Song Bai
, Jie Qin
:
Disaggregation Distillation for Person Search. 158-170 - Shiqi Gao
, Huiyu Duan
, Xinyue Li
, Kang Fu
, Yicong Peng, Qihang Xu, Yuanyuan Chang, Jia Wang
, Xiongkuo Min
, Guangtao Zhai
:
Quality-Guided Skin Tone Enhancement for Portrait Photography. 171-185 - Yue Dai
, Shihui Ying
, Yue Gao
:
Exploring Local and Global Consistent Correlation on Hypergraph for Rotation Invariant Point Cloud Analysis. 186-197 - Hao Tan
, Zichang Tan
, Dunfang Weng
, Ajian Liu
, Jun Wan
, Zhen Lei
, Stan Z. Li
:
Vision Transformer With Relation Exploration for Pedestrian Attribute Recognition. 198-208 - Zhaofeng Shi
, Qingbo Wu
, Fanman Meng
, Linfeng Xu
, Hongliang Li
:
Cross-Modal Cognitive Consensus Guided Audio-Visual Segmentation. 209-223 - Ge Li
, Jiale Cao
, Hanqing Sun
, Rao Muhammad Anwer
, Jin Xie
, Fahad Khan
, Yanwei Pang
:
Video Instance Segmentation Without Using Mask and Identity Supervision. 224-235 - Guanghui Yue
, Shangjie Wu
, Tianwei Zhou
, Gang Li
, Jie Du
, Yu Luo
, Qiuping Jiang
:
Progressive Region-to-Boundary Exploration Network for Camouflaged Object Detection. 236-248 - Yumo Zhang
, Zhanchuan Cai
:
DNP-AUT: Image Compression Using Double-Layer Non-Uniform Partition and Adaptive U Transform. 249-262 - Sijia Wen
, Yinqiang Zheng
, Feng Lu
:
Polarization State Attention Dehazing Network With a Simulated Polar-Haze Dataset. 263-274 - Jiapeng Li
, Ruonan Zhang
, Ge Li
, Thomas H. Li:
SDE2D: Semantic-Guided Discriminability Enhancement Feature Detector and Descriptor. 275-286 - Xu Han
, Junyu Gao
, Chuang Yang
, Yuan Yuan, Qi Wang
:
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection. 287-299 - Kai Hu
, Xiaobo Chen
, Zhineng Chen
, Yuan Zhang
, Xieping Gao
:
Multi-Perspective Pseudo-Label Generation and Confidence-Weighted Training for Semi-Supervised Semantic Segmentation. 300-311 - Xinru Guo
, Huaxiang Zhang
, Li Liu
, Dongmei Liu
, Xu Lu
, Hui Meng
:
Primary Code Guided Targeted Attack against Cross-modal Hashing Retrieval. 312-326 - Shichao Zhang
, Yibo Ding
, Tianxiang Huo
, Shukai Duan
, Lidan Wang
:
PointAttention: Rethinking Feature Representation and Propagation in Point Cloud. 327-339 - Mengzan Qi
, Sixian Chan
, Chen Hang
, Guixu Zhang
, Tieyong Zeng
, Zhi Li
:
Auxiliary Representation Guided Network for Visible-Infrared Person Re-Identification. 340-355 - Li Huang
, Yaping Huang
, Qingji Guan
:
Improving Image Inpainting via Adversarial Collaborative Training. 356-370 - Lin Jiang
, Jigang Wu
, Shuping Zhao
, Jiaxing Li
:
Cross-Scatter Sparse Dictionary Pair Learning for Cross-Domain Classification. 371-384 - Yusra Alkendi
, Rana Azzam
, Sajid Javed
, Lakmal D. Seneviratne
, Yahya H. Zweiri
:
Neuromorphic Vision-Based Motion Segmentation With Graph Transformer Neural Network. 385-400 - Guangzhao Dai
, Xiangbo Shu
, Wenhao Wu, Rui Yan
, Jiachao Zhang
:
GPT4Ego: Unleashing the Potential of Pre-Trained Models for Zero-Shot Egocentric Action Recognition. 401-413 - Nan Wang
, Shaohui Mei
, Yi Wang
, Yifan Zhang
, Duo Zhan
:
WHANet:Wavelet-Based Hybrid Asymmetric Network for Spectral Super-Resolution From RGB Inputs. 414-428 - Haojin Deng
, Yimin Yang
:
Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning Framework. 429-441 - Jingyi Xu, Xin Deng
, Yibing Fu, Mai Xu
, Shengxi Li
:
MDSC-Net: Multi-Modal Discriminative Sparse Coding Driven RGB-D Classification Network. 442-454 - Chen Guo
, Weiling Chen
, Aiping Huang
, Tiesong Zhao
:
Prototype Alignment With Dedicated Experts for Test-Agnostic Long-Tailed Recognition. 455-465 - Hefeng Wang
, Jiale Cao
, Jin Xie
, Aiping Yang, Yanwei Pang
:
Implicit and Explicit Language Guidance for Diffusion-Based Visual Perception. 466-476 - Meijing Zhang, Mengxue Chen, Qi Li
, Yanchen Chen, Rui Lin, Xiaolian Li, Shengfeng He
, Wenxi Liu
:
Category-Contrastive Fine-Grained Crowd Counting and Beyond. 477-488 - Kaiwei Zhang
, Dandan Zhu
, Xiongkuo Min
, Huiyu Duan
, Guangtao Zhai
:
Explain Vision Focus: Blending Human Saliency Into Synthetic Face Images. 489-502 - Shaowei Weng
, Jianhao Zhang, Tanguo Zhu, Lifang Yu
, Chunyu Zhang
:
DCM-Net: A Diffusion Model-Based Detection Network Integrating the Characteristics of Copy-Move Forgery. 503-514 - Meng Yang
, Jun Chen
, Xin Tian
, Longsheng Wei
, Jiayi Ma
:
VRTNet: Vector Rectifier Transformer for Two-View Correspondence Learning. 515-530 - Kai Ye, Zepeng Huang
, Yilei Xiong, Yu Gao, Jinheng Xie, Linlin Shen
:
Progressive Pseudo Labeling for Multi-Dataset Detection Over Unified Label Space. 531-543 - Yuxiu Lin
, Hui Liu
, Ren Wang, Qiang Guo
, Caiming Zhang
:
Multiview Feature Decoupling for Deep Subspace Clustering. 544-556 - Lili Huang
, Yiming Cao, Pengcheng Jia, Chenglong Li
, Jin Tang
, Chuanfu Li:
Knowledge-Guided Cross-Modal Alignment and Progressive Fusion for Chest X-Ray Report Generation. 557-567 - Min Liu
, Zhu Zhang
, Yuan Bian
, Xueping Wang
, Yeqing Sun
, Baida Zhang, Yaonan Wang
:
Cross-Modality Semantic Consistency Learning for Visible-Infrared Person Re-Identification. 568-580 - Ben Fei
, Liwen Liu
, Tianyue Luo
, Weidong Yang
, Lipeng Ma
, Zhijun Li
, Wenming Chen
:
Point Patches Contrastive Learning for Enhanced Point Cloud Completion. 581-596 - Shunjie Yuan
, Xinghua Li
, Yinbin Miao
, Haiyan Zhang
, Ximeng Liu
, Robert H. Deng
:
Combating Noisy Labels by Alleviating the Memorization of DNNs to Noisy Labels. 597-609 - Jiaping Yu, Muli Yang
, Aming Wu
, Cheng Deng
:
Memory-Enhanced Confidence Calibration for Class-Incremental Unsupervised Domain Adaptation. 610-621 - Yi Jin
, Xiaoxiao Ma
, Rui Zhang
, Huaian Chen
, Yuxuan Gu
, Pengyang Ling
, Enhong Chen
:
Masked Video Pretraining Advances Real-World Video Denoising. 622-636 - Kun Dai
, Zhiqiang Jiang
, Tao Xie
, Ke Wang
, Dedong Liu
, Zhendong Fan
, Ruifeng Li
, Lijun Zhao
, Mohamed Omar
:
SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection. 637-651 - Abdullah Aman Khan
, Jie Shao
, Yunbo Rao
, Lei She, Heng Tao Shen
:
LRDNet: Lightweight LiDAR Aided Cascaded Feature Pools for Free Road Space Detection. 652-664 - Shuhua Wang
, Ke Lv
, Jian Xue
, Yang Zhao
:
DA-Net: Density-Aware 3D Object Detection Network for Point Clouds. 665-678 - Congcong Wen
, Xiang Li, Hao Huang, Yu-Shen Liu
, Yi Fang
:
3D Shape Contrastive Representation Learning With Adversarial Examples. 679-692 - Dong Liang, Dong Zhang, Qiong Wang, Zongqi Wei, Liyan Zhang:
CrossNet: Cross-Scene Background Subtraction Network via 3D Optical Flow. 693-706 - Zhanwen Liu
, Juanru Cheng
, Jin Fan
, Shan Lin
, Yang Wang
, Xiangmo Zhao
:
Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection. 707-717 - Hui Tian
, Zheng Qin, Renjiao Yi, Chenyang Zhu, Kai Xu
:
Tensorformer: Normalized Matrix Attention Transformer for High-Quality Point Cloud Reconstruction. 718-730 - Mingtao Feng
, Haoran Hou
, Liang Zhang
, Yulan Guo
, Hongshan Yu
, Yaonan Wang
, Ajmal Mian
:
Exploring Hierarchical Spatial Layout Cues for 3D Point Cloud Based Scene Graph Prediction. 731-743 - Qiaoyun Wu
, Jun Wang
, Yi Zhang, Hua Dong, Cheng Yi
:
Accelerating Point Cloud Registration With Low Overlap Using Graphs and Sparse Convolutions. 744-753 - Qijian Zhang
, Junhui Hou
, Yue Qian:
PointMCD: Boosting Deep Point Cloud Encoders via Multi-View Cross-Modal Distillation for 3D Shape Recognition. 754-767 - Shuaihang Yuan
, Congcong Wen
, Yu-Shen Liu
, Yi Fang
:
Retrieval-Specific View Learning for Sketch-to-Shape Retrieval. 768-779 - Jing-Yu Yang
, Wenqiang Xu
, Yusen Hou, Xinchen Ye
, Pascal Frossard
, Kun Li
:
High-Quality Reconstruction of Depth Maps From Graph-Based Non-Uniform Sampling. 780-791 - Shaojie Zhuang
, Guangshun Wei
, Zhiming Cui, Yuanfeng Zhou
:
Robust Hybrid Learning for Automatic Teeth Segmentation and Labeling on 3D Dental Models. 792-803 - Jiawen Zhao
, Qing Zhu
, Yaonan Wang
, Weixing Peng
, Hui Zhang, Jianxu Mao
:
Registration of Multiview Point Clouds With Unknown Overlap. 804-819 - Jincen Jiang
, Xuequan Lu
, Lizhi Zhao
, Richard Dazeley
, Meili Wang
:
Masked Autoencoders in 3D Point Cloud Representation Learning. 820-831 - Xu Wang
, Yi Jin
, Yigang Cen
, Tao Wang
, Bowen Tang
, Yidong Li
:
LighTN: Light-Weight Transformer Network for Performance-Overhead Tradeoff in Point Cloud Downsampling. 832-847 - Shuangzhi Li
, Zhijie Wang
, Felix Juefei-Xu
, Qing Guo
, Xingyu Li
, Lei Ma
:
Common Corruption Robustness of Point Cloud Detectors: Benchmark and Enhancement. 848-859 - Shanshan Li
, Pan Gao
, Xiaoyang Tan
, Wei Xiang
:
RLGrid: Reinforcement Learning Controlled Grid Deformation for Coarse-to-Fine Point Cloud Completion. 860-874 - Xianglin Guo
, Yifan Wang
, Heng Liu
, Haoran Xie
, Gary Cheng
, Fu Lee Wang
:
Steerable Graph Neural Network on Point Clouds via Second-Order Random Walks. 875-888 - Junteng Zhang
, Jianqiang Wang
, Dandan Ding
, Zhan Ma
:
Scalable Point Cloud Attribute Compression. 889-899 - Wenting Cui
, Shaoyi Du
, Runzhao Yao
, Canhui Tang
, Aixue Ye
, Feng Wen
, Zhiqiang Tian
:
RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient. 900-913 - André F. R. Guarda
, Manuel Ruivo
, Luís Coelho
, Abdelrahman Seleem
, Nuno M. M. Rodrigues
, Fernando Pereira
:
Deep Learning-Based Point Cloud Coding and Super-Resolution: A Joint Geometry and Color Approach. 914-926 - Zicheng Zhang
, Wei Sun
, Yucheng Zhu
, Xiongkuo Min
, Wei Wu
, Ying Chen
, Guangtao Zhai
:
Evaluating Point Cloud From Moving Camera Videos: A No-Reference Metric. 927-939 - Lintai Wu
, Qijian Zhang
, Junhui Hou
, Yong Xu
:
Leveraging Single-View Images for Unsupervised 3D Point Cloud Completion. 940-953 - Xin Kang
, Chaoqun Wang
, Xuejin Chen
:
Region-Enhanced Feature Learning for Scene Semantic Segmentation. 954-964 - Weiquan Liu
, Minghao Liu, Shijun Zheng
, Siqi Shen
, Xuesheng Bian
, Yu Zang, Ping Zhong
, Cheng Wang
:
Interpreting Hidden Semantics in the Intermediate Layers of 3D Point Cloud Classification Neural Network. 965-977 - Elena Camuffo
, Umberto Michieli
, Simone Milani
:
Learning From Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation. 978-989 - Xiantong Zhao
, Yinan Han
, Shengjing Tian
, Jian Liu
, Xiuping Liu
:
OST: Efficient One-Stream Network for 3D Single Object Tracking in Point Clouds. 990-1002 - Shuai Guo
, Lei Shi
, Xiaoheng Jiang
, Pei Lv
, Qidong Liu
, Yazhou Hu
, Rongrong Ji
, Mingliang Xu
:
An Efficient Ungrouped Mask Method With two Learnable Parameters for 3D Object Detection. 1003-1017 - Yuan Liang
, Zitian Zhang
, Chuhua Xian
, Shengfeng He
:
Delving Into Multi-Illumination Monocular Depth Estimation: A New Dataset and Method. 1018-1032 - Yanyang Xiao
, Tieyi Zhang
, Juan Cao
, Zhonggui Chen
:
Accelerated Lloyd's Method for Resampling 3D Point Clouds. 1033-1046 - Qing Guo
, Zhijie Wang
, Lubo Wang, Haotian Dong, Felix Juefei-Xu
, Di Lin
, Lei Ma
, Wei Feng
, Yang Liu
:
CarveNet: Carving Point-Block for Complex 3D Shape Completion. 1047-1058 - Jingtao Sun
, Yaonan Wang
, Mingtao Feng
, Xiaofeng Guo
, Huimin Lu
, Xieyuanli Chen
:
Category-Level Multi-Object 9D State Tracking Using Object-Centric Multi-Scale Transformer in Point Cloud Stream. 1072-1085 - Xingyu Gao
, Zhenyu Chen
, Jianze Wei
, Rubo Wang, Zhijun Zhao:
Deep Mutual Distillation for Unsupervised Domain Adaptation Person Re-Identification. 1059-1071 - Yuanpeng Zeng, Ru Zhang, Hao Zhang
, Shaojie Qiao
, Faliang Huang
, Qing Tian
, Yuzhong Peng
:
GCCNet: A Novel Network Leveraging Gated Cross-Correlation for Multi-View Classification. 1086-1099 - Liangchen Liu
, Nannan Wang
, Dawei Zhou
, Decheng Liu
, Xi Yang
, Xinbo Gao
, Tongliang Liu
:
Generalizable Prompt Learning via Gradient Constrained Sharpness-Aware Minimization. 1100-1113 - Liangwei Chen
, Xiren Zhou
, Qiuju Chen, Fang Xiong
, Huanhuan Chen
:
Investigating the Effective Dynamic Information of Spectral Shapes for Audio Classification. 1114-1126 - Abdullah Aman Khan
, Jie Shao
, Sidra Shafiq, Shuyuan Zhu
, Heng Tao Shen
:
Enhancing Few-Shot 3D Point Cloud Classification With Soft Interaction and Self-Attention. 1127-1141 - Guanglin Zhou
, Zhongyi Han
, Shiming Chen
, Biwei Huang, Liming Zhu
, Tongliang Liu
, Lina Yao
, Kun Zhang
:
HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain Generalization. 1142-1152 - Cairong Zhao
, Rui Shu
, Shuyang Feng
, Liang Zhu
, Xuekuan Wang:
Scene Text Image Super-Resolution Via Semantic Distillation and Text Perceptual Loss. 1153-1164 - Yuhui Quan
, Xi Wan, Tianxiang Zheng, Yan Huang
, Hui Ji
:
Dual-Path Deep Unsupervised Learning for Multi-Focus Image Fusion. 1165-1176 - Zihan Gao
, Lingling Li
, Xu Liu
, Licheng Jiao
, Fang Liu
, Shuyuan Yang
:
Uncertainty Guided Progressive Few-Shot Learning Perception for Aerial View Synthesis. 1177-1192 - Lingtong Min
, Ziman Fan, Shunzhou Wang
, Feiyang Dou, Xin Li
, Binglu Wang
:
Adaptive Fusion Learning for Compositional Zero-Shot Recognition. 1193-1204 - Jian Yang
, Jun Li
, Yunong Cai, Guoming Wu
, Zhi-Ping Shi
, Chaodong Tan, Xianglong Liu
:
Hard-Sample Style Guided Patch Attack With RL-Enhanced Motion Pattern for Video Recognition. 1205-1215 - Gaosheng Liu
, Huanjing Yue
, Bihan Wen
, Jing-Yu Yang
:
Learned Focused Plenoptic Image Compression With Local-Global Correlation Learning. 1216-1227 - Jingyun Tian
, Jinjing Gu
, Yuanyuan Pu
, Zhengpeng Zhao
:
Leveraging Enriched Skeleton Representation With Multi-Relational Metrics for Few-Shot Action Recognition. 1228-1241 - Shaocan Liu
, Xingtao Wang
, Ruiqin Xiong
, Xiaopeng Fan
:
GCN-Based Multi-Modality Fusion Network for Action Recognition. 1242-1253 - Deng Xu
, Chao Zhang
, Zechao Li
, Chunlin Chen
, Huaxiong Li
:
Fast Disentangled Slim Tensor Learning for Multi-View Clustering. 1254-1265 - Tae-Young Kim, Jufeng Yang
, Eunil Park
:
MSDLF-K: A Multimodal Feature Learning Approach for Sentiment Analysis in Korean Incorporating Text and Speech. 1266-1276 - Lei Zhao
, Bo Li
, Jixiang Jiang, Xingxing Wei
:
Classification Committee for Active Deep Object Detection. 1277-1288 - Lingzhi Zhao
, Ying Cui
, Yuhang Jia, Yunfei Zhang
, Klara Nahrstedt
:
Enhancing Neural Adaptive Wireless Video Streaming via Cross-Layer Information Exposure and Online Tuning. 1289-1304 - Wenyang Liu
, Kejun Wu
, Tianyi Liu
, Yi Wang
, Kim-Hui Yap
, Lap-Pui Chau
:
ByteNet: Rethinking Multimedia File Fragment Classification Through Visual Perspectives. 1305-1319 - Weikang Wang
, Yuting Su, Jing Liu
, Wei Sun
, Guangtao Zhai
:
Weakly Supervised Referring Video Object Segmentation With Object-Centric Pseudo-Guidance. 1320-1333 - Zeke Zexi Hu
, Xiaoming Chen
, Vera Yuk Ying Chung
, Yiran Shen
:
Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-Resolution. 1334-1348 - Yu Jiang
, Yuehang Wang
, Siqi Li
, Yongji Zhang
, Qianren Guo
, Qi Chu
, Yue Gao
:
EvCSLR: Event-Guided Continuous Sign Language Recognition and Benchmark. 1349-1361 - Bingzheng Liu
, Jianjun Lei
, Bo Peng
, Zhe Zhang
, Jie Zhu
, Qingming Huang
:
Advancing Generalizable Occlusion Modeling for Neural Human Radiance Field. 1362-1373 - Rui Tian
, Zuxuan Wu
, Qi Dai
, Micah Goldblum, Han Hu
, Yu-Gang Jiang
:
The Role of ViT Design and Training in Robustness to Common Corruptions. 1374-1385 - Yalan Qin
, Nan Pu
, Hanzhou Wu
, Nicu Sebe
:
Discriminative Anchor Learning for Efficient Multi-View Clustering. 1386-1396 - Jingyao Wang
, Luntian Mou
, Changwen Zheng
, Wen Gao:
Image-Based Freeform Handwriting Authentication With Energy-Oriented Self-Supervised Learning. 1397-1409 - Dong Chen
, Kaihang Pan, Guangyu Dai, Guoming Wang
, Yueting Zhuang
, Siliang Tang
, Mingliang Xu
:
Improving Vision Anomaly Detection With the Guidance of Language Modality. 1410-1419 - Li Wang
, Yunzhou Zhang
, Fawei Ge
, Wenjing Bai
, Yifan Wang
:
Learning Local Features by Reinforcing Spatial Structure Information. 1420-1431 - Feiwei Qin
, Gaoyang Zhan, Meie Fang
, C. L. Philip Chen
, Ping Li
:
VGNet: Multimodal Feature Extraction and Fusion Network for 3D CAD Model Retrieval. 1432-1447 - Chuanming Wang
, Huiyuan Fu
, Peiye Liu, Huadong Ma
:
Part-Level Relationship Learning for Fine-Grained Few-Shot Image Classification. 1448-1460 - Jinpu Zhang
, Ziwen Li
, Ruonan Wei
, Yuehuan Wang
:
Augment One With Others: Generalizing to Unforeseen Variations for Visual Tracking. 1461-1474 - Chunlei Peng
, Boyu Wang, Decheng Liu
, Nannan Wang
, Ruimin Hu, Xinbo Gao
:
Masked Attribute Description Embedding for Cloth-Changing Person Re-Identification. 1475-1485 - Xingfeng Li
, Yuangang Pan
, Yuan Sun
, Quansen Sun
, Yinghui Sun
, Ivor W. Tsang
, Zhenwen Ren
:
Incomplete Multi-View Clustering With Paired and Balanced Dynamic Anchor Learning. 1486-1497 - Guanghui Wu
, Lili Chen, Zengping Chen
:
Uni-DPM: Unifying Self-Supervised Monocular Depth, Pose, and Object Motion Estimation With a Shared Representation. 1498-1511 - Yi Liu
, Qiuping Jiang
, Xinyi Wang, Ting Luo
, Jingchun Zhou
:
Underwater Image Enhancement With Cascaded Contrastive Learning. 1512-1525 - Md. Moniruzzaman
, Zhaozheng Yin
:
Progressive Knowledge Distillation From Different Levels of Teachers for Online Action Detection. 1526-1537 - Mingze Yao
, Huibing Wang
, Yawei Chen
, Xianping Fu
:
Between/Within View Information Completing for Tensorial Incomplete Multi-View Clustering. 1538-1550 - Dongqing Wu
, Huihui Li
, Cang Gu
, Lei Guo, Hang Liu
:
Dual Stream Relation Learning Network for Image-Text Retrieval. 1551-1565 - Hailong Ma
, Sibo Feng
, Xi Xiao
, Chenyu Dong, Xingyue Cheng:
Image Shooting Parameter-Guided Cascade Image Retouching Network: Think Like an Artist. 1566-1573 - Song Chang
, Youfang Lin
, Shuo Zhang
:
Structure-Aware Pre-Selected Neural Rendering for Light Field Reconstruction. 1574-1587 - Jianxin Shi
, Miao Zhang
, Linfeng Shen
, Jiangchuan Liu
, Lingjun Pu
, Jingdong Xu
:
Towards Neural Codec-Empowered 360$^\circ$ Video Streaming: A Saliency-Aided Synergistic Approach. 1588-1600 - Xiating Jin
, Jiajun Bu
, Zhi Yu
, Hui Zhang
, Yaonan Wang
:
Federated Hallucination Translation and Source-Free Regularization Adaptation in Decentralized Domain Adaptation for Foggy Scene Understanding. 1601-1616 - Mehwish Ghafoor, Arif Mahmood
, Muhammad Bilal
:
Enhancing 3D Human Pose Estimation Amidst Severe Occlusion With Dual Transformer Fusion. 1617-1624 - Wei Gao, Jintian Feng, Mengqi Wei, Rui Zou, Jianwen Sun:
Towards a Multi-Granulated Statistical Framework for Human-Machine Collaboration in Image Classification. 1625-1636 - Shishun Tian, Tiantian Zeng, Zhengyu Zhang, Wenbin Zou, Xia Li:
Dual Residual-Guided Interactive Learning for the Quality Assessment of Enhanced Images. 1637-1651 - Weida Chen, Jie Jiang, Linfei Wang, Huafeng Li, Yibing Zhan, Dapeng Tao:
Cps-STS: Bridging the Gap Between Content and Position for Coarse-Point-Supervised Scene Text Spotter. 1652-1664 - Zhongwei Shen, Xiaojun Wu, Hui Li, Tianyang Xu, Cong Wu:
I Know How You Move: Explicit Motion Estimation for Human Action Recognition. 1665-1676 - Hai Liu, Cheng Zhang, Yongjian Deng, Bochen Xie, Tingting Liu, Youfu Li:
TransIFC: Invariant Cues-Aware Feature Concentration Learning for Efficient Fine-Grained Bird Image Classification. 1677-1690 - Quanquan Xiao, Haiyan Jin, Haonan Su, Yuanlin Zhang, Zhaolin Xiao, Bin Wang:
SPDFusion:A Semantic Prior Knowledge-Driven Method for Infrared and Visible Image Fusion. 1691-1705 - Renjie Zhang, Di Lin, Xin Wang, George Baciu, C. L. Philip Chen, Ping Li:
Accurate-PGNet: Learning to Assemble Perceptual Body Parts for Accurate Human Skeleton Establishment. 1706-1721 - Ke Liang, Lingyuan Meng, Hao Li, Meng Liu, Siwei Wang, Sihang Zhou, Xinwang Liu, Kunlun He:
MGKsite: Multi-Modal Knowledge-Driven Site Selection via Intra and Inter-Modal Graph Fusion. 1722-1735 - Haoran Li, Yulan Guo, Jiali You, Xiaojian You, Zhenwen Ren:
Graph Proxy Fusion: Consensus Graph Intermediated Multi-View Local Information Fusion Clustering. 1736-1747 - Mina Han, Kailong Yu, Weiran Li, Qiannan Guo, Zhenbo Li:
Colliding Depths and Fusion: Leveraging Adaptive Feature Maps and Restorable Depth Recharge for Infrared and Visible Scene Fusion. 1748-1759 - Yijun Chen, Xianwei Zheng, Zhulun Yang, Xutao Li, Jiantao Zhou, Yuanman Li:
DuPMAM: An Efficient Dual Perception Framework Equipped With a Sharp Testing Strategy for Point Cloud Analysis. 1760-1771 - Guozhang Li, Xinpeng Ding, De Cheng, Jie Li, Nannan Wang, Xinbo Gao:
ETC: Temporal Boundary Expand Then Clarify for Weakly Supervised Video Grounding With Multimodal Large Language Model. 1772-1782 - Yi Xiao, Qiangqiang Yuan, Kui Jiang, Yuzeng Chen, Qiang Zhang, Chia-Wen Lin:
Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution. 1783-1796 - Shuo Wang, Xinyu Zhang, Meng Wang, Xiangnan He:
Symmetric Hallucination With Knowledge Transfer for Few-Shot Learning. 1797-1807 - Yu Luo, Xuanrong Chen, Jie Ling, Chao Huang, Wei Zhou, Guanghui Yue:
Unsupervised Low-Light Image Enhancement With Self-Paced Learning. 1808-1820 - Xiaoyang Hao, Han Li, Jing Sun, Lei Wang, Jianping Fan:
A Twist Representation and Shape Refinement Method for Human Mesh Recovery. 1821-1834 - Yidi Li, Hong Liu, Bing Yang:
STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking. 1835-1847 - Hua Yu, Yaqing Hou, Wenbin Pei, Yew-Soon Ong, Qiang Zhang:
DivDiff: A Conditional Diffusion Model for Diverse Human Motion Prediction. 1848-1859 - Leiyu Xie, Yuxing Yang, Zeyu Fu, Syed Mohsen Naqvi:
Position and Orientation Aware One-Shot Learning for Medical Action Recognition From Signal Data. 1860-1873 - Yanan Zhu, Jiaqiu Ai, Le Wu, Dan Guo, Wei Jia, Richang Hong:
An Active Multi-Target Domain Adaptation Strategy: Progressive Class Prototype Rectification. 1874-1886 - Jie Zhang, Kangneng Zhou, Yan Luximon, Tong-Yee Lee, Ping Li:
3DCMM: 3D Comprehensive Morphable Models With UV-UNet for Accurate Head Creation. 1887-1900 - Sheng Zheng, Chaoning Zhang, Xinhong Hao:
Black-Box Targeted Adversarial Attack on Segment Anything (SAM). 1901-1913 - Hao Feng, Wendi Wang, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li:
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser. 1914-1925 - Bo Ding, Libao Zhang, Hongbo Sun, Yongjun He, Jian Qin:
Semantic-Enhanced ULIP for Zero-Shot 3D Shape Recognition. 1926-1936 - Xu Han, Junyu Gao, Chuang Yang, Yuan Yuan, Qi Wang:
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera. 1937-1949 - Jingcheng Ke, Dele Wang, Jun-Cheng Chen, I-Hong Jhuo, Chia-Wen Lin, Yen-Yu Lin:
Make Graph-Based Referring Expression Comprehension Great Again Through Expression-Guided Dynamic Gating and Regression. 1950-1961 - Zhiqiang Fu, Yao Zhao, Dongxia Chang, Yiming Wang, Jie Wen:
Reordered $k$-Means: A New Baseline for View-Unaligned Multi-View Clustering. 1962-1972 - Huafeng Li, Shedan Yang, Yafei Zhang, Dapeng Tao, Zhengtao Yu:
Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval. 1973-1987 - Zhengyi Liu, Sheng Deng, Xinrui Wang, Linbo Wang, Xianyong Fang, Bin Tang:
SSFam: Scribble Supervised Salient Object Detection Family. 1988-2000 - Hao Luo, Baoliang Chen, Lingyu Zhu, Peilin Chen, Shiqi Wang:
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement. 2001-2014 - Xu Cheng, Hao Yu, Kevin Ho Man Cheng, Zitong Yu, Guoying Zhao:
MDANet: Modality-Aware Domain Alignment Network for Visible-Infrared Person Re-Identification. 2015-2027 - Ying Fu, Xinyu Zhu, Xiaojie Li, Xin Wang, Xi Wu, Shu Hu, Yi Wu, Siwei Lyu, Wei Liu:
VB-KGN: Variational Bayesian Kernel Generation Networks for Motion Image Deblurring. 2028-2042 - Yiting Lu, Xin Li, Jianzhao Liu, Zhibo Chen:
StyleAM: Perception-Oriented Unsupervised Domain Adaption for No-Reference Image Quality Assessment. 2043-2058 - Wenhao Xu, Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Man Zhang, Xiaopeng Zhang:
Token Masking Transformer for Weakly Supervised Object Localization. 2059-2069 - Rongqun Lin, Wenhan Yang, Baoliang Chen, Pingping Zhang, Yue Liu, Shiqi Wang, Sam Kwong:
HFGlobalFormer: When High-Frequency Recovery Meets Global Context Modeling for Compressed Image Deraindrop. 2070-2082 - Zhen Lan, Zixing Li, Chao Yan, Xiaojia Xiang, Dengqing Tang, Han Zhou, Jun Lai:
Adaptive Knowledge Distillation With Attention-Based Multi-Modal Fusion for Robust Dim Object Detection. 2083-2096 - Kai Zhang, Ludan Sun, Jun Yan, Wenbo Wan, Jiande Sun, Shuyuan Yang, Huaxiang Zhang:
Texture-Content Dual Guided Network for Visible and Infrared Image Fusion. 2097-2111 - Gang Hu, Yafei Lv, Jianting Zhang, Qian Wu, Zaidao Wen:
CLIP-Based Modality Compensation for Visible-Infrared Image Re-Identification. 2112-2126 - Bowen Shi, Xiaopeng Zhang, Yaoming Wang, Wenrui Dai, Junni Zou, Hongkai Xiong:
MENSA: Multi-Dataset Harmonized Pretraining for Semantic Segmentation. 2127-2140 - Shaowei Wang, Lingling Zhang, Wenjun Wu, Tao Qin, Xinyu Zhang, Jun Liu:
Alignment-Guided Self-Supervised Learning for Diagram Question Answering. 2141-2154 - Fan Nie, Jiangqun Ni, Jian Zhang, Bin Zhang, Weizhe Zhang:
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection. 2155-2167 - Xingjian He, Sihan Chen, Fan Ma, Zhicheng Huang, Xiaojie Jin, Zikang Liu, Dongmei Fu, Yi Yang, Jing Liu, Jiashi Feng:
VLAB: Enhancing Video Language Pretraining by Feature Adapting and Blending. 2168-2180

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.