


default search action
IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 47
Volume 47, Number 1, January 2025
- Longguang Wang
, Yulan Guo
, Yingqian Wang
, Xiaoyu Dong
, Qingyu Xu
, Jun-Gang Yang
, Wei An
:
Unsupervised Degradation Representation Learning for Unpaired Restoration of Images and Point Clouds. 1-18 - Devanshu Arya
, Deepak K. Gupta, Stevan Rudinac
, Marcel Worring
:
Adaptive Neural Message Passing for Inductive Learning on Hypergraphs. 19-31 - Yunhao Zou
, Ying Fu
, Tsuyoshi Takatani
, Yinqiang Zheng
:
EventHDR: From Event to High-Speed HDR Videos and Beyond. 32-50 - Man Liu
, Huihui Bai
, Feng Li
, Chunjie Zhang
, Yunchao Wei
, Meng Wang, Tat-Seng Chua
, Yao Zhao
:
PSVMA+: Exploring Multi-Granularity Semantic-Visual Adaption for Generalized Zero-Shot Learning. 51-66 - Hao Zhang
, Chenglin Li
, Wenrui Dai
, Ziyang Zheng
, Junni Zou
, Hongkai Xiong
:
Stabilizing and Accelerating Federated Learning on Heterogeneous Data With Partial Client Participation. 67-83 - Wencheng Han, Runzhou Tao, Haibin Ling
, Jianbing Shen
:
Weakly Supervised Monocular 3D Object Detection by Spatial-Temporal View Consistency. 84-98 - Hanjae Kim
, Jiyoung Lee, Kwanghoon Sohn
:
Prototype-Guided Attention Distillation for Discriminative Person Search. 99-115 - Zongsheng Yue
, Jianyi Wang
, Chen Change Loy
:
Efficient Diffusion Model for Image Restoration by Residual Shifting. 116-130 - Hanbo Bi
, Yingchao Feng
, Wenhui Diao
, Peijin Wang
, Yongqiang Mao
, Kun Fu, Hongqi Wang, Xian Sun
:
Prompt-and-Transfer: Dynamic Class-Aware Enhancement for Few-Shot Segmentation. 131-148 - Mushir Akhtar
, M. Tanveer
, Mohd. Arshad
:
RoBoSS: A Robust, Bounded, Sparse, and Smooth Loss Function for Supervised Learning. 149-160 - Chao Li
, Tingsong Jiang
, Handing Wang
, Wen Yao
, Donghua Wang
:
Optimizing Latent Variables in Integrating Transfer and Query Based Attack Framework. 161-171 - Weizhi Nie
, Ruidong Chen
, Weijie Wang
, Bruno Lepri
, Nicu Sebe
:
T2TD: Text-3D Generation Model Based on Prior Knowledge Guidance. 172-189 - Matthew Kowal
, Mennatullah Siam
, Md. Amirul Islam
, Neil D. B. Bruce
, Richard P. Wildes
, Konstantinos G. Derpanis:
Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks. 190-205 - Mengyue Geng
, Lizhi Wang
, Lin Zhu
, Wei Zhang
, Ruiqin Xiong
, Yonghong Tian
:
Event-Enhanced Snapshot Mosaic Hyperspectral Frame Deblurring. 206-223 - Congqi Cao
, Hanwen Zhang
, Yue Lu
, Peng Wang
, Yanning Zhang
:
Scene-Dependent Prediction in Latent Space for Video Anomaly Detection and Anticipation. 224-239 - Bing Han
, Feifei Zhao
, Yi Zeng
, Guobin Shen
:
Developmental Plasticity-Inspired Adaptive Pruning for Deep Spiking and Artificial Neural Networks. 240-251 - Jianqiang Wang
, Ruixiang Xue
, Jiaxin Li
, Dandan Ding
, Yi Lin
, Zhan Ma
:
A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding - Part II: Attribute. 252-268 - Jianqiang Wang
, Ruixiang Xue
, Jiaxin Li
, Dandan Ding
, Yi Lin
, Zhan Ma
:
A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding - Part I: Geometry. 269-287 - Lingting Zhu
, Yizheng Chen
, Lianli Liu
, Lei Xing
, Lequan Yu
:
Multi-Sensor Learning Enables Information Transfer Across Different Sensory Data and Augments Multi-Modality Imaging. 288-304 - Wenshui Luo
, Shuo Chen
, Tongliang Liu
, Bo Han
, Gang Niu
, Masashi Sugiyama
, Dacheng Tao
, Chen Gong
:
Estimating Per-Class Statistics for Label Noise Learning. 305-322 - Tao Yan
, Jiahui Gao, Ke Xu
, Xiangjie Zhu, Hao Huang
, Helong Li, Benjamin W. Wah
, Rynson W. H. Lau
:
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues. 323-337 - Zhenyu Huang
, Mouxing Yang
, Xinyan Xiao, Peng Hu
, Xi Peng
:
Noise-Robust Vision-Language Pre-Training With Positive-Negative Learning. 338-350 - Wei Feng
, Fei Wang, Ruize Han
, Yiyang Gan
, Zekun Qian, Junhui Hou
, Song Wang
:
Unveiling the Power of Self-Supervision for Multi-View Multi-Human Association and Tracking. 351-368 - Qing Xiao
, Guiying Liu
, Qianjin Feng, Yu Zhang
, Zhenyuan Ning
:
Tensor Coupled Learning of Incomplete Longitudinal Features and Labels for Clinical Score Regression. 369-386 - Bin Zhang
, Yue Zhang, Junyu Li
, Jiazhou Chen
, Tatsuya Akutsu
, Yiu-Ming Cheung
, Hongmin Cai
:
Unsupervised Dual Deep Hashing With Semantic-Index and Content-Code for Cross-Modal Retrieval. 387-399 - Wenqing Zheng
, S. P. Sharan
, Zhiwen Fan
, Kevin Wang
, Yihan Xi
, Zhangyang Wang
:
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search. 400-412 - Jaehyeon Son
, Soochan Lee
, Gunhee Kim
:
When Meta-Learning Meets Online and Continual Learning: A Survey. 413-432 - Yawei Zhao
, Qinghe Liu, Pan Liu, Xinwang Liu
, Kunlun He
:
Medical Federated Model With Mixture of Personalized and Shared Components. 433-449 - Yisi Luo
, Xile Zhao
, Deyu Meng
:
Revisiting Nonlocal Self-Similarity from Continuous Representation. 450-468 - Yake Wei
, Di Hu
, Henghui Du, Ji-Rong Wen
:
On-the-Fly Modulation for Balanced Multimodal Learning. 469-485 - Xuelin Qian
, Wenxuan Wang
, Yu-Gang Jiang
, Xiangyang Xue
, Yanwei Fu
:
Dynamic Routing and Knowledge Re-Learning for Data-Free Black-Box Attack. 486-501 - Mohammad Mohammadi, Mohammad Babai
, Michael H. F. Wilkinson
:
Generalized Relevance Learning Grassmann Quantization. 502-513 - Seung-geun Chi
, Hyung-Gun Chi
, Qixing Huang
, Karthik Ramani
:
InfoGCN++: Learning Representation by Predicting the Future for Online Skeleton-Based Action Recognition. 514-528 - Hanyu Zhou
, Yi Chang
, Zhiwei Shi
, Wending Yan
, Gang Chen
, Yonghong Tian
, Luxin Yan
:
Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation. 529-548 - Miaoyu Li, Ying Fu
, Tao Zhang
, Ji Liu
, Dejing Dou
, Chenggang Yan
, Yulun Zhang
:
Latent Diffusion Enhanced Rectangle Transformer for Hyperspectral Image Restoration. 549-564 - Chao Chen
, Yu-Shen Liu
, Zhizhong Han
:
NeuralTPS: Learning Signed Distance Functions Without Priors From Single Sparse Point Clouds. 565-582 - Yiran Wang
, Min Shi
, Jiaqi Li
, Chaoyi Hong
, Zihao Huang
, Juewen Peng
, Zhiguo Cao
, Jianming Zhang
, Ke Xian
, Guosheng Lin
:
NVDS$^{\mathbf{+}}$+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation. 583-600 - Guangrong Zhao
, Yiran Shen
, Chenlong Zhang
, Zhaoxin Shen, Yuanfeng Zhou
, Hongkai Wen
:
RGBE-Gaze: A Large-Scale Event-Based Multimodal Dataset for High Frequency Remote Gaze Tracking. 601-615 - Yurong Guo
, Ruoyi Du
, Aneeshan Sain
, Kongming Liang
, Yuan Dong
, Yi-Zhe Song
, Zhanyu Ma
:
Understanding Episode Hardness in Few-Shot Learning. 616-633 - Bo Li, Fengguang Peng, Tianrui Hui, Xiaoming Wei, Xiaolin Wei, Lijun Zhang, Hang Shi, Si Liu:
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating. 634-649 - Runsheng Xu, Chia-Ju Chen, Zhengzhong Tu
, Ming-Hsuan Yang
:
V2X-ViTv2: Improved Vision Transformers for Vehicle-to-Everything Cooperative Perception. 650-662
Volume 47, Number 2, February 2025
- Xudong Pan
, Mi Zhang
, Yifan Yan
, Shengyao Zhang
, Min Yang
:
Matryoshka: Exploiting the Over-Parametrization of Deep Learning Models for Covert Data Transmission. 663-678 - Kun-Yu Lin
, Jiaming Zhou, Wei-Shi Zheng
:
Human-Centric Transformer for Domain Adaptive Action Recognition. 679-696 - Dylan R. Ashley
, Vincent Herrmann
, Zachary Friggstad
, Jürgen Schmidhuber
:
On the Distillation of Stories for Transferring Narrative Arcs in Collections of Independent Media. 697-707 - Jing Liu
, Sihan Chen
, Xingjian He
, Longteng Guo
, Xinxin Zhu
, Weining Wang
, Jinhui Tang
:
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset. 708-724 - Zhuo Zheng
, Stefano Ermon
, Dongjun Kim, Liangpei Zhang
, Yanfei Zhong
:
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model. 725-741 - Md Kaykobad Reza
, Ashley Prater-Bennette
, M. Salman Asif
:
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation. 742-754 - Martin Cífka
, Georgy Ponimatkin
, Yann Labbé
, Bryan C. Russell
, Mathieu Aubry
, Vladimír Petrík
, Josef Sivic
:
FocalPose++: Focal Length and Object Pose Estimation via Render and Compare. 755-772 - Olga Veksler
, Yuri Boykov:
Sparse Non-Local CRF With Applications. 773-788 - Chong Shen
, Yicheng Wu
, Guanyu Qian
, Xindong Wu
, Huiliang Cao
, Chenguang Wang
, Jun Tang
, Jun Liu
:
Intelligent Bionic Polarization Orientation Method Using Biological Neuron Model for Harsh Conditions. 789-806 - Lin Zhu
, Xianzhang Chen
, Lizhi Wang
, Xiao Wang
, Yonghong Tian
, Hua Huang
:
Continuous-Time Object Segmentation Using High Temporal Resolution Event Camera. 807-824 - Shaheer U. Saeed
, Shiqi Huang, João Ramalhinho
, Iani J. M. B. Gayo, Nina Montaña Brown
, Ester Bonmati
, Stephen P. Pereira
, Brian R. Davidson
, Dean C. Barratt
, Matthew J. Clarkson
, Yipeng Hu
:
Competing for Pixels: A Self-Play Algorithm for Weakly-Supervised Semantic Segmentation. 825-839 - Man Zhou
, Naishan Zheng
, Xuanhua He
, Danfeng Hong
, Jocelyn Chanussot
:
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion. 840-857 - Zhenyu Wu
, Wei Wang
, Lin Wang, Yacong Li, Fengmao Lv
, Qing Xia
, Chenglizhao Chen
, Aimin Hao
, Shuo Li
:
Pixel is All You Need: Adversarial Spatio-Temporal Ensemble Active Learning for Salient Object Detection. 858-877 - Wenguan Wang
, Yi Yang, Fei Wu
:
Towards Data-And Knowledge-Driven AI: A Survey on Neuro-Symbolic Computing. 878-899 - Yan Lu
, Xinzhu Ma
, Lei Yang
, Tianzhu Zhang
, Yating Liu
, Qi Chu
, Tong He
, Yonghui Li
, Wanli Ouyang
:
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection. 900-915 - Zhehui Wang
, Tao Luo
, Cheng Liu
, Weichen Liu
, Rick Siow Mong Goh
, Weng-Fai Wong
:
Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small. 916-933 - Bram Vanherle
, Vittorio Pippi, Silvia Cascianelli
, Nick Michiels
, Frank Van Reeth
, Rita Cucchiara
:
VATr++: Choose Your Words Wisely for Handwritten Text Generation. 934-948 - Wei Zhang
, Jiaming Li, Meng Xia
, Xu Gao, Xiao Tan, Yifeng Shi, Zhenhua Huang
, Guanbin Li
:
OffsetNet: Towards Efficient Multiple Object Tracking, Detection, and Segmentation. 949-960 - Jianqi Chen
, Hao Chen
, Keyan Chen
, Yilan Zhang
, Zhengxia Zou
, Zhenwei Shi
:
Diffusion Models for Imperceptible and Transferable Adversarial Attack. 961-977 - Tianyi Zhang
, Chunyun Chen
, Yun Liu
, Xue Geng
, Mohamed M. Sabry Aly
, Jie Lin
:
PSRR-MaxpoolNMS++: Fast Non-Maximum Suppression With Discretization and Pooling. 978-993 - Mingwu Zheng
, Haiyu Zhang
, Hongyu Yang
, Liming Chen
, Di Huang
:
ImFace++: A Sophisticated Nonlinear 3D Morphable Face Model With Implicit Neural Representations. 994-1012 - Zhanzhou Feng
, Shiliang Zhang
:
Evolved Hierarchical Masking for Self-Supervised Learning. 1013-1027 - Steven A. Grosz
, Anil K. Jain
:
Universal Fingerprint Generation: Controllable Diffusion Model With Multimodal Conditions. 1028-1041 - Minsu Kim
, Hyung-Il Kim
, Yong Man Ro
:
Prompt Tuning of Deep Neural Networks for Speaker-Adaptive Visual Speech Recognition. 1042-1055 - Hao Chen
, François Brémond
, Nicu Sebe
, Shiliang Zhang
:
Anti-Forgetting Adaptation for Unsupervised Person Re-Identification. 1056-1072 - Zhao Zhang
, Suiyi Zhao
, Xiaojie Jin
, Mingliang Xu
, Yi Yang, Shuicheng Yan
, Meng Wang
:
Noise Self-Regression: A New Learning Paradigm to Enhance Low-Light Images Without Task-Related Data. 1073-1088 - Yifan Zhao
, Jia Li
, Zeyin Song, Yonghong Tian
:
Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning. 1089-1102 - Guojie Li
, Zhiwen Yu
, Kaixiang Yang
, C. L. Philip Chen
, Xuelong Li
:
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data. 1103-1119 - Woo-Jeoung Nam
, Seong-Whan Lee
:
Illuminating Salient Contributions in Neuron Activation With Attribution Equilibrium. 1120-1131 - Tian Zhang
, Kongming Liang
, Ruoyi Du
, Wei Chen, Zhanyu Ma
:
Disentangling Before Composing: Learning Invariant Disentangled Features for Compositional Zero-Shot Learning. 1132-1147 - Ioannis Sarridis, Christos Koutlis, Symeon Papadopoulos, Christos Diou:
FLAC: Fairness-Aware Representation Learning by Suppressing Attribute-Class Associations. 1148-1160 - Eduardo Fernandes Montesuma
, Fred Maurice Ngolè Mboula, Antoine Souloumiac:
Recent Advances in Optimal Transport for Machine Learning. 1161-1180 - Sherenaz W. Al-Haj Baddar
, Alessandro Languasco
, Mauro Migliardi
:
Efficient Analysis of Overdispersed Data Using an Accurate Computation of the Dirichlet Multinomial Distribution. 1181-1189 - Xu Zheng
, Peng Yuan Zhou
, Athanasios V. Vasilakos
, Lin Wang
:
360SFUDA++: Towards Source-Free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes. 1190-1204 - Yipo Huang
, Leida Li
, Pengfei Chen
, Haoning Wu
, Weisi Lin
, Guangming Shi
:
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing. 1205-1218 - Shilin Gu
, Chao Xu
, Dewen Hu
, Chenping Hou
:
Adaptive Learning for Dynamic Features and Noisy Labels. 1219-1237 - Shuaicheng Liu
, Zhuofan Zhang
, Zhen Liu
, Ping Tan
, Bing Zeng
:
Minimum Latency Deep Online Video Stabilization and Its Extensions. 1238-1249 - Rui Zhang
, Xingbo Du
, Junchi Yan
, Shihua Zhang
:
The Decoupling Concept Bottleneck Model. 1250-1265 - Bo Zhang
, Jinli Suo
, Qionghai Dai
:
Event-Enhanced Snapshot Compressive Videography at 10K FPS. 1266-1278 - Lue Fan
, Feng Wang
, Naiyan Wang
, Zhaoxiang Zhang
:
FSD V2: Improving Fully Sparse 3D Object Detection With Virtual Voxels. 1279-1292 - Valero Laparra
, Juan Emmanuel Johnson
, Gustau Camps-Valls
, Raúl Santos-Rodríguez
, Jesús Malo
:
Estimating Information Theoretic Measures via Multidimensional Gaussianization. 1293-1308 - Alessandra Carbone
, Aurélien Decelle
, Lorenzo Rosset
, Beatriz Seoane
:
Fast and Functional Structured Data Generators Rooted in Out-of-Equilibrium Physics. 1309-1316 - Min Gan
, Xiang-Xiang Su, Guang-Yong Chen
, Jing Chen
, C. L. Philip Chen
:
Online Learning Under a Separable Stochastic Approximation Framework. 1317-1330
Volume 47, Number 3, March 2025
- Zhou Zhai, Bin Gu
, Cheng Deng
, Heng Huang
:
Global Model Selection via Solution Paths for Robust Support Vector Machine. 1331-1347 - Zhilu Zhang
, Ruohao Wang
, Hongzhi Zhang
, Wangmeng Zuo
:
Self-Supervised Learning for Real-World Super-Resolution From Dual and Multiple Zoomed Observations. 1348-1361 - Wei-Shi Zheng
, Junkai Yan
, Yi-Xing Peng
:
A Versatile Framework for Multi-Scene Person Re-Identification. 1362-1380 - Huachen Fang
, Jinjian Wu
, Qibin Hou
, Weisheng Dong
, Guangming Shi
:
Fast Window-Based Event Denoising With Spatiotemporal Correlation Enhancement. 1381-1394 - Chenyi Jiang
, Shidong Wang
, Yang Long
, Zechao Li
, Haofeng Zhang
, Ling Shao
:
Imaginary-Connected Embedding in Complex Space for Unseen Attribute-Object Discrimination. 1395-1413 - Xin Wei
, Xiang Gu
, Jian Sun
:
Multi-Scale Part-Based Feature Representation for 3D Domain Generalization and Adaptation. 1414-1430 - Maoyuan Ye
, Jing Zhang
, Juhua Liu
, Chenyu Liu
, Baocai Yin
, Cong Liu
, Bo Du
, Dacheng Tao
:
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation. 1431-1447 - SuBeen Lee
, WonJun Moon
, Hyun Seok Seong
, Jae-Pil Heo
:
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification. 1448-1463 - Zhenyi Wang
, Enneng Yang
, Li Shen
, Heng Huang:
A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. 1464-1483 - Jiechao Yang
, Yong Liu
, Wei Wang, Haoran Wu, Zhiyuan Chen, Xibo Ma
:
PATNAS: A Path-Based Training-Free Neural Architecture Search. 1484-1500 - Ziming Zhang
, Yuping Shao, Yiqing Zhang, Fangzhou Lin
, Haichong K. Zhang
, Elke A. Rundensteiner
:
Deep Loss Convexification for Learning Iterative Models. 1501-1513 - Seongwon Lee
, Hongje Seong
, Suhyeon Lee
, Euntai Kim
:
Correlation Verification for Image Retrieval and Its Memory Footprint Optimization. 1514-1529 - Juewen Peng
, Zhiguo Cao
, Xianrui Luo
, Ke Xian
, Wenfeng Tang, Jianming Zhang
, Guosheng Lin
:
BokehMe++: Harmonious Fusion of Classical and Neural Rendering for Versatile Bokeh Creation. 1530-1547 - Hao Wang
, Minghui Liao
, Zhouyi Xie
, Wenyu Liu
, Xiang Bai
:
Partial Scene Text Retrieval. 1548-1563 - Tianyi Zhang
, Matthew Dutson
, Vivek Boominathan
, Mohit Gupta
, Ashok Veeraraghavan
:
Streaming Quanta Sensors for Online, High-Performance Imaging and Vision. 1564-1577 - Bin Xia
, Yulun Zhang
, Shiyin Wang
, Yitong Wang
, Xinglong Wu
, Yapeng Tian
, Wenming Yang
, Radu Timofte, Luc Van Gool
:
DiffI2I: Efficient Diffusion Model for Image-to-Image Translation. 1578-1593 - Lingfeng Yang
, Xiang Li
, Yueze Wang, Xinlong Wang
, Jian Yang
:
Fine-Grained Visual Text Prompting. 1594-1609 - Bin Chen
, Jian Zhang
:
Practical Compact Deep Compressed Sensing. 1610-1626 - Ruiwen Yuan
, Yongqiang Tang
, Yanghao Xiao, Wensheng Zhang
:
IBCS: Learning Information Bottleneck-Constrained Denoised Causal Subgraph for Graph Classification. 1627-1643 - Daochang Liu
, Qiyue Li
, Anh-Dung Dinh, Tingting Jiang
, Mubarak Shah
, Chang Xu
:
DiffAct++: Diffusion Action Segmentation. 1644-1659 - Shaofei Huang
, Zhenwei Shen
, Zehao Huang
, Yue Liao
, Jizhong Han
, Naiyan Wang
, Si Liu
:
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression. 1660-1673 - Yi Yu
, Yufei Wang, Wenhan Yang
, Lanqing Guo, Shijian Lu
, Ling-Yu Duan
, Yap-Peng Tan, Alex C. Kot
:
Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior. 1674-1693 - Guotao Wang
, Chenglizhao Chen
, Aimin Hao
, Hong Qin
, Deng-Ping Fan
:
WinDB: HMD-Free and Distortion-Free Panoptic Video Fixation Learning. 1694-1713 - Dewei Zhou
, You Li, Fan Ma
, Zongxin Yang
, Yi Yang:
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis. 1714-1728 - Peng Zhou
, Rongwen Li, Zhaolong Ling
, Liang Du
, Xinwang Liu
:
Fair Clustering Ensemble With Equal Cluster Capacity. 1729-1746 - Zefeng Zheng
, Shaohua Teng
, Luyao Teng
, Wei Zhang
, NaiQi Wu
:
Adaptive Graph Learning With Semantic Promotability for Domain Adaptation. 1747-1763 - Zhiping Yu
, Chenyang Liu
, Liqin Liu
, Zhenwei Shi
, Zhengxia Zou
:
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation. 1764-1781 - Yulin Wang
, Haoji Zhang, Yang Yue
, Shiji Song
, Chao Deng, Junlan Feng, Gao Huang
:
Uni-AdaFocus: Spatial-Temporal Dynamic Computation for Video Recognition. 1782-1799 - Yi Tang
, Min Liu
, Baopu Li
, Yaonan Wang
, Wanli Ouyang
:
NAS-PED: Neural Architecture Search for Pedestrian Detection. 1800-1817 - Xingming Long
, Jie Zhang
, Shiguang Shan
:
Generalized Face Liveness Detection via De-Fake Face Generator. 1818-1831 - Yansheng Li
, Linlin Wang
, Tingzhu Wang
, Xue Yang
, Junwei Luo
, Qi Wang
, Youming Deng, Wenbin Wang, Xian Sun
, Haifeng Li
, Bo Dang
, Yongjun Zhang
, Yi Yu
, Junchi Yan
:
STAR: A First-Ever Dataset and a Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery. 1832-1849 - Wenjun Zhang
, Liangxiao Jiang
, Chaoqun Li
:
ELDP: Enhanced Label Distribution Propagation for Crowdsourcing. 1850-1862 - Shuaicheng Liu
, Mingbo Hong, Yuhang Lu, Nianjin Ye
, Chunyu Lin
, Bing Zeng
:
Unsupervised Global and Local Homography Estimation With Coplanarity-Aware GAN. 1863-1876 - Peng Xu
, Wenqi Shao
, Kaipeng Zhang, Peng Gao
, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao
, Ping Luo
:
LVLM-EHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. 1877-1893 - Zihao Wang
, Shaofei Cai
, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang
, Haowei Lin, Zhaofeng He
, Zilong Zheng, Yaodong Yang
, Xiaojian Ma
, Yitao Liang
:
JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models. 1894-1907 - Haotian Wang
, Meng Yang
, Xinhu Zheng
, Gang Hua
:
Scale Propagation Network for Generalizable Depth Completion. 1908-1922 - Zhi Wang
, Dong Hu, Zhuo Liu
, Chao Gao
, Zhen Wang:
Iteratively Capped Reweighting Norm Minimization With Global Convergence Guarantee for Low-Rank Matrix Learning. 1923-1940 - Tharindu Fernando
, Harshala Gammulle
, Sridha Sridharan
, Simon Denman
, Clinton Fookes
:
Remembering What is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction. 1941-1957 - Hao Tang
, Zechao Li
, Dong Zhang
, Shengfeng He
, Jinhui Tang
:
Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection. 1958-1974 - Zhile Yang
, Shangqi Guo
, Ying Fang, Zhaofei Yu
, Jian K. Liu
:
Spiking Variational Policy Gradient for Brain Inspired Reinforcement Learning. 1975-1990 - Hua Li
, Wenya Luo
, Zhidong Bai, Huanchao Zhou, Zhangni Pu
:
Spectrally-Corrected and Regularized LDA for Spiked Model. 1991-1999 - Raphael Sulzer
, Renaud Marlet
, Bruno Vallet, Loïc Landrieu
:
A Survey and Benchmark of Automatic Surface Reconstruction From Point Clouds. 2000-2019 - Zhiqi Li
, Wenhai Wang
, Hongyang Li
, Enze Xie
, Chonghao Sima
, Tong Lu
, Yu Qiao, Jifeng Dai
:
BEVFormer: Learning Bird's-Eye-View Representation From LiDAR-Camera via Spatiotemporal Transformers. 2020-2036 - Chenglizhao Chen
, Guangxiao Ma
, Wenfeng Song
, Shuai Li
, Aimin Hao
, Hong Qin
:
Saliency-Free and Aesthetic-Aware Panoramic Video Navigation. 2037-2054 - Feiping Nie
, Yitao Song
, Wei Chang
, Rong Wang
, Xuelong Li
:
Fast Semi-Supervised Learning on Large Graphs: An Improved Green-Function Method. 2055-2070 - Xiao Wu
, Zihan Cao
, Ting-Zhu Huang
, Liang-Jian Deng
, Jocelyn Chanussot
, Gemine Vivone
:
Fully-Connected Transformer for Multi-Source Image Fusion. 2071-2088 - Tianxin Xie
, Hu Han
, Shiguang Shan
, Xilin Chen
:
Natural Adversarial Mask for Face Identity Protection in Physical World. 2089-2106 - Hanlin Wang
, Yilu Wu, Sheng Guo
, Limin Wang
:
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos. 2107-2124 - Peng Jin
, Hao Li
, Li Yuan
, Shuicheng Yan
, Jie Chen
:
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning. 2125-2139 - Haeyong Kang
, Jaehong Yoon, Sung Ju Hwang, Chang D. Yoo
:
Continual Learning: Forget-Free Winning Subnetworks for Video Representations. 2140-2156 - Tomasz Lukaszewicz
, Dariusz Kania
:
Trajectory of Fifths Based on Chroma Subbands Extraction-A New Approach to Music Representation, Analysis, and Classification. 2157-2169 - Chen Qiu
, Marius Kloft
, Stephan Mandt
, Maja Rudolph
:
Self-Supervised Anomaly Detection With Neural Transformations. 2170-2185 - Ziyu Guan
, Wanqing Zhao
, Hongmin Liu
, Yuta Nakashima
, Noboru Babaguchi
, Xiaofei He
:
Cross-Modal Guided Visual Representation Learning for Social Image Retrieval. 2186-2198 - Daojun Liang
, Haixia Zhang
, Dongfeng Yuan
, Minggao Zhang:
Multi-Head Encoding for Extreme Label Classification. 2199-2211 - Fengxiang Bie
, Yibo Yang
, Zhongzhu Zhou, Adam Ghanem
, Minjia Zhang, Zhewei Yao, Xiaoxia Wu, Connor Holmes, Pareesa Ameneh Golnari, David A. Clifton
, Yuxiong He, Dacheng Tao
, Shuaiwen Leon Song:
RenAIssance: A Survey Into AI Text-to-Image Generation in the Era of Large Model. 2212-2231 - Ajay Kumar
:
Insights on 'Complex-Valued Iris Recognition Network'. 2232-2236 - Yanbiao Ma
, Licheng Jiao
, Fang Liu
, Lingling Li
, Wenping Ma
, Shuyuan Yang
, Xu Liu
, Puhua Chen
:
Unveiling and Mitigating Generalized Biases of DNNs Through the Intrinsic Dimensions of Perceptual Manifolds. 2237-2244
Volume 47, Number 4, April 2025
- Muhammad Awais, Muzammal Naseer, Salman Khan, Rao Muhammad Anwer, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Foundation Models Defining a New Era in Vision: A Survey and Outlook. 2245-2264 - Lei Sun, Daniel Gehrig, Christos Sakaridis, Mathias Gehrig, Jingyun Liang, Peng Sun, Zhijie Xu, Kaiwei Wang, Luc Van Gool, Davide Scaramuzza:
A Unified Framework for Event-Based Frame Interpolation With Ad-Hoc Deblurring in the Wild. 2265-2279 - Shuangming Yang, Bernabé Linares-Barranco, Yuzhu Wu, Badong Chen:
Self-Supervised High-Order Information Bottleneck Learning of Spiking Neural Network for Robust Event-Based Optical Flow Estimation. 2280-2297 - Bo Sun, Hao Kang, Li Guan, Haoxiang Li, Philippos Mordohai, Gang Hua:
Glissando-Net: Deep Single View Category Level Pose Estimation and 3D Reconstruction. 2298-2312 - Chunxiao Fan, Dan Guo, Ziqi Wang, Meng Wang:
Multi-Objective Convex Quantization for Efficient Model Compression. 2313-2329 - Chinthaka Dinesh, Gene Cheung, Saghar Bagheri, Ivan V. Bajic:
Efficient Signed Graph Sampling via Balancing & Gershgorin Disc Perfect Alignment. 2330-2348 - Jinyuan Liu, Guanyao Wu, Zhu Liu, Di Wang, Zhiying Jiang, Long Ma, Wei Zhong, Xin Fan, Risheng Liu:
Infrared and Visible Image Fusion: From Data Compatibility to Task Adaption. 2349-2369 - Yue Dai, Yifan Feng, Nan Ma, Xibin Zhao, Yue Gao:
Cross-Modal 3D Shape Retrieval via Heterogeneous Dynamic Graph Representation. 2370-2387 - Yifan Feng, Jiangang Huang, Shaoyi Du, Shihui Ying, Jun-Hai Yong, Yipeng Li, Guiguang Ding, Rongrong Ji, Yue Gao:
Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation. 2388-2401 - Hang Lin, Yifan Peng, Yubo Zhang, Lin Bie, Xibin Zhao, Yue Gao:
Filter Pruning by High-Order Spectral Clustering. 2402-2415 - Xiaowei Hu, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhou, Lewei Lu, Jie Zhou, Xiaogang Wang, Yu Qiao, Jifeng Dai:
Demystify Transformers & Convolutions in Modern Image Deep Networks. 2416-2428 - Xuxiang Sun, Gong Cheng, Hongda Li, Chunbo Lang, Junwei Han:
STDatav2: Accessing Efficient Black-Box Stealing for Adversarial Attacks. 2429-2445 - Miin-Shen Yang, Kristina P. Sinaga:
Federated Multi-View K-Means Clustering. 2446-2459 - Tianwei Cao, Qianqian Xu, Zhiyong Yang, Zhanyu Ma, Qingming Huang:
Practically Unbiased Pairwise Loss for Recommendation With Implicit Feedback. 2460-2474 - Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji:
JM3D & JM3D-LLM: Elevating 3D Representation With Joint Multi-Modal Cues. 2475-2492 - Yanan Li, Zhimin Wang, Ruipeng Xing, Changheng Shao, Shangshang Shi, Jiaxin Li, Guoqiang Zhong, Yongjian Gu:
Quantum Gated Recurrent Neural Networks. 2493-2504 - Yue Song, Wei Wang, Nicu Sebe:
RankFeat&RankWeight: Rank-1 Feature/Weight Removal for Out-of-Distribution Detection. 2505-2519 - Chuanxing Geng, Aiyang Han, Songcan Chen:
Explicit View-Labels Matter: A Multifacet Complementarity Study of Multi-View Clustering. 2520-2532 - Kaiwen Jiang, Shu-Yu Chen, Feng-Lin Liu, Hongbo Fu, Lin Gao:
Towards High-Quality and Disentangled Face Editing in a 3D GAN. 2533-2544 - Wenhui Wu, Jian Weng, Pingping Zhang, Xu Wang, Wenhan Yang, Jianmin Jiang:
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement. 2545-2562 - Shipeng Wang, Xiaorong Li, Jian Sun, Zongben Xu:
Training Networks in Null Space of Feature Covariance With Self-Supervision for Incremental Learning. 2563-2580 - Jin Liu, Zhongyuan Lu, Yaorong Cen, Hui Hu, Zhenfeng Shao, Yong Hong, Ming Jiang, Miaozhong Xu:
Enhancing Object Detection With Fourier Series. 2581-2596 - Haochen Liu, Zhiyu Huang, Wenhui Huang, Haohan Yang, Xiaoyu Mo, Chen Lv:
Hybrid-Prediction Integrated Planning for Autonomous Driving. 2597-2614 - Zhiying Lu, Chuanbin Liu, Xiaojun Chang, Yongdong Zhang, Hongtao Xie:
DHVT: Dynamic Hybrid Vision Transformer for Small Dataset Recognition. 2615-2631 - Xin Deng, Chenxiao Zhang, Lai Jiang, Jingyuan Xia, Mai Xu:
DeepSN-Net: Deep Semi-Smooth Newton Driven Network for Blind Image Restoration. 2632-2646 - Lingling Zhang, Yujie Zhong, Qinghua Zheng, Jun Liu, Qianying Wang, Jiaxin Wang, Xiaojun Chang:
TDGI: Translation-Guided Double-Graph Inference for Document-Level Relation Extraction. 2647-2659 - Jintian Ji, Songhe Feng:
Anchors Crash Tensor: Efficient and Scalable Tensorial Multi-View Subspace Clustering. 2660-2675 - Yunshan Zhong, You Huang, Jiawei Hu, Yuxin Zhang, Rongrong Ji:
Towards Accurate Post-Training Quantization of Vision Transformers via Error Reduction. 2676-2692 - Xiuwen Fang, Mang Ye, Bo Du:
Robust Asymmetric Heterogeneous Federated Learning With Corrupted Clients. 2693-2705 - Yibo Zhou, Bo Li, Hai-Miao Hu, Xiaokang Zhang, Dongping Zhang, Hanzi Wang:
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition. 2706-2722 - Yongkun Du, Zhineng Chen, Yuchen Su, Caiyan Jia, Yu-Gang Jiang:
Instruction-Guided Scene Text Recognition. 2723-2738 - Bo Pang, Zhenyu Wei, Jingli Lin, Cewu Lu:
Auto-Pairing Positives Through Implicit Relation Circulation for Discriminative Self-Learning. 2739-2753 - Jiefeng Li, Siyuan Bian, Chao Xu, Zhicun Chen, Lixin Yang, Cewu Lu:
HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-Body Mesh Recovery. 2754-2769 - Mingyuan Lin, Jian Liu, Chi Zhang, Zibo Zhao, Chu He, Lei Yu:
Non-Uniform Exposure Imaging via Neuromorphic Shutter Control. 2770-2784 - Dong Zhang, Kwang-Ting Cheng:
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion. 2785-2798 - Xingran Liao, Xuekai Wei, Mingliang Zhou, Hau-San Wong, Sam Kwong:
Image Quality Assessment: Exploring Joint Degradation Effect of Deep Network Features via Kernel Representation Similarity Analysis. 2799-2815 - Wen Fei, Wenrui Dai, Liang Zhang, Luoming Zhang, Chenglin Li, Junni Zou, Hongkai Xiong:
Latent Weight Quantization for Integerized Training of Deep Neural Networks. 2816-2832 - Ke Sun, Zhongxi Chen, Xianming Lin, Xiaoshuai Sun, Hong Liu, Rongrong Ji:
Conditional Diffusion Models for Camouflaged and Salient Object Detection. 2833-2848 - Xingxing Wei, Shouwei Ruan, Yinpeng Dong, Hang Su, Xiaochun Cao:
Distributionally Location-Aware Transferable Adversarial Patches for Facial Images. 2849-2864 - Yong Du, Jiahui Zhan, Xinzhe Li, Junyu Dong, Sheng Chen, Ming-Hsuan Yang, Shengfeng He:
One-for-All: Towards Universal Domain Translation With a Single StyleGAN. 2865-2881 - Jiajun Zhou, Shengbo Gong, Xuanze Chen, Chenxuan Xie, Shanqing Yu, Qi Xuan, Xiaoniu Yang:
Clarify Confused Nodes via Separated Learning. 2882-2896 - Junhong Zhang, Zhihui Lai, Heng Kong, Jian Yang:
Learning the Optimal Discriminant SVM With Feature Extraction. 2897-2911 - Xiao Wang, Jianlong Wu, Zijia Lin, Fuzheng Zhang, Di Zhang, Liqiang Nie:
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding. 2912-2923 - Jinghua Zhang, Li Liu, Olli Silvén, Matti Pietikäinen, Dewen Hu:
Few-Shot Class-Incremental Learning for Classification and Object Detection: A Survey. 2924-2945 - Cong Shen, Xiang Liu, Jiawei Luo, Kelin Xia:
Torsion Graph Neural Networks. 2946-2956 - Yuliang Liu, Mingxin Huang, Hao Yan, Linger Deng, Weijia Wu, Hao Lu, Chunhua Shen, Lianwen Jin, Xiang Bai:
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-Domain Generalization. 2957-2972 - Man Yao, Xuerui Qiu, Tianxiang Hu, Jiakui Hu, Yuhong Chou, Keyu Tian, Jianxing Liao, Luziwei Leng, Bo Xu, Guoqi Li:
Scaling Spike-Driven Transformer With Efficient Spike Firing Approximation Training. 2973-2990 - Zongbo Bao, Penghui Yao:
On Testing and Learning Quantum Junta Channels. 2991-3002 - Jianan Li, Jie Wang, Junjie Chen, Tingfa Xu:
Towards Robust Point Cloud Recognition With Sample-Adaptive Auto-Augmentation. 3003-3017 - Ziang Cao, Fangzhou Hong, Tong Wu, Liang Pan, Ziwei Liu:
DiffTF++: 3D-Aware Diffusion Transformer for Large-Vocabulary 3D Generation. 3018-3030 - Lihe Yang, Zhen Zhao, Hengshuang Zhao:
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation. 3031-3048 - Peirong Zhang, Yuliang Liu, Songxuan Lai, Hongliang Li, Lianwen Jin:
Privacy-Preserving Biometric Verification With Handwritten Random Digit String. 3049-3066 - Zhuang Yang:
Adaptive Biased Stochastic Optimization. 3067-3078 - Yunsong Zhou, Quan Liu, Hongzi Zhu, Yunzhe Li, Shan Chang, Minyi Guo:
Exploiting Ground Depth Estimation for Mobile Monocular 3D Object Detection. 3079-3093 - Jie Wang, Mingxuan Ye, Yufei Kuang, Rui Yang, Wengang Zhou, Houqiang Li, Feng Wu:
Long-Term Feature Extraction via Frequency Prediction for Efficient Reinforcement Learning. 3094-3110 - Christos Sakaridis, David Brüggemann, Fisher Yu, Luc Van Gool:
Condition-Invariant Semantic Segmentation. 3111-3125 - Chengyue Wang, Haicheng Liao, Zhenning Li, Chengzhong Xu:
WAKE: Towards Robust and Physically Feasible Trajectory Prediction for Autonomous Vehicles With WAvelet and KinEmatics Synergy. 3126-3140 - Ao Li, Le Zhang, Yun Liu, Ce Zhu:
Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution. 3141-3158 - Junke Wang, Zuxuan Wu, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang:
OmniTracker: Unifying Visual Object Tracking by Tracking-With-Detection. 3159-3174 - Quanxue Gao, Fangfang Li, Qianqian Wang, Xinbo Gao, Dacheng Tao:
Manifold Based Multi-View K-Means. 3175-3182

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.