default search action
ICME 2024: Niagara Falls, ON, Canada
- IEEE International Conference on Multimedia and Expo, ICME 2024, Niagara Falls, ON, Canada, July 15-19, 2024. IEEE 2024, ISBN 979-8-3503-9015-5
- Xinyue Chen, Miaojing Shi:
Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation. 1-6 - Ziran Zhu, Tongda Xu, Ling Li, Yan Wang:
Noise Dimension of GAN: An Image Compression Perspective. 1-6 - Weijun Yuan, Zhan Li, Xiaohan Li, Liangda Fang, Qingfeng Zhang, Zhixiang Qiu:
Crowd Counting and Localization in Haze and Rain. 1-6 - Jiayang Liu, Kai Wang, Zheng Wang, Xing Xu:
SADA: Self-Adaptive Domain Adaptation From Black-Box Predictors. 1-6 - Jin Chen, Jiahe Tian, Cai Yu, Xi Wang, Zhaoxing Li, Yesheng Chai, Jiao Dai, Jizhong Han:
ConfR: Conflict Resolving for Generalizable Deepfake Detection. 1-6 - Wentao Ma, Anni Tang, Jun Ling, Han Xue, Huiheng Liao, Yunhui Zhu, Li Song:
SingAvatar: High-fidelity Audio-driven Singing Avatar Synthesis. 1-6 - Yuchen Wang, Xiaoguang Li, Li Yang, Lu Zhou, Jianfeng Ma, Hui Li:
Adaptive Oriented Adversarial Attacks on Visible and Infrared Image Fusion Models. 1-6 - Xin Li, Haizhuang Liu, Rongquan Wang, Bochao Zou, Yuxin Lin, Huimin Ma:
EMo Transformer: Transformer-Based Depression Detection via Eye Movements. 1-6 - Lin Bie, Shouan Pan, Kai Cheng, Li Han:
Build a Cross-modality Bridge for Image-to-Point Cloud Registration. 1-6 - Yibowen Zhao, Yonghui Xu, Ning Liu, Yixin Zhang, Wei Guo, Xudong Lu, Lizhen Cui:
Causal Denoising Framework for Generalizable Recommendation System using Graph Neural Network. 1-6 - Ting Cai, Yu Xiong, Chengyang He, Chao Wu, Song Zhou:
TBU: A Large-scale Multi-mask Video Dataset for Teacher Behavior Understanding. 1-6 - Ying Ren, Kailai Shen, Zhe Ye, Diqun Yan:
EventTrojan: Manipulating Non-Intrusive Speech Quality Assessment via Imperceptible Events. 1-6 - Ziqiang Shi, Rujie Liu:
Multimedia Generative Modelling with High-Order Langevin Dynamics. 1-6 - Ye Bai, Chenxing Li, Hao Li, Yuanyuan Zhao, Xiaorui Wang:
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation. 1-6 - Zekun Xu, Yipeng Zhou, Quan Z. Sheng, Chao Li, Tongtong Lou, Weipeng Jing:
Adaptive Global-local Fusion Network Based Deep Unsupervised Hashing for Remote Sensing Image Retrieval. 1-6 - Chen Wu, Zhuoran Zheng, Pengwen Dai, Chenggang Shan, Xiuyi Jia:
Rethinking Image Deraining via Text-guided Detail Reconstruction. 1-6 - Hanlin Li, Yueyi Zhang, Guanting Dong, Shida Sun, Zhiwei Xiong:
Joint Flow Estimation from Point Clouds and Event Streams. 1-6 - Yulin Zhao, Xiangling Ding:
One-Class HEVC Double Compression Detection with Same Coding Parameters. 1-6 - Sumei Li, Xiaoxuan Chen, Peiming Lin:
A Lightweight CNN and Spatial-Channel Transformer Hybrid Network for Image Super-Resolution. 1-6 - Yunzhe Xiao, Xueqiong Li, Shaowu Yang, Wenjing Yang, Yong Dou:
CRNet: Cross-Reconstruction Network for Inconsistent Point Cloud Registration. 1-6 - Biao Wu, Haitao Wang, Hejun Wu:
Task-Aware Lipschitz Confidence Data Augmentation in Visual Reinforcement Learning From Images. 1-6 - Yaoxun Xu, Xingchen Song, Zhiyong Wu, Di Wu, Zhendong Peng, Binbin Zhang:
Hydraformer: One Encoder for All Subsampling Rates. 1-6 - Hao Deng, Shengmei Chen, Cheng Liu, Bo Jiang, Lin Wang:
Geo GCN: Geometric-based Graph CNN for Learning on Point Cloud. 1-6 - Xiaotian Han, Yiqi Wang, Bohan Zhai, Quanzeng You, Hongxia Yang:
COCO is "ALL" You Need for Visual Instruction Fine-tuning. 1-5 - Shuai Zhao, Shibin Liu, Boyuan Zhang, Yang Zhai, Ziyi Liu, Yahong Han:
A Patch-wise Adversarial Denoising Could Enhance the Robustness of Adversarial Training. 1-6 - Zixian Gao, Xun Jiang, Hua Chen, Yujie Li, Yang Yang, Xing Xu:
Uncertainty-Debiased Multimodal Fusion: Learning Deterministic Joint Representation for Multimodal Sentiment Analysis. 1-6 - Shifeng Liu, Xinglong Mao, Sirui Zhao, Chaoyou Fu, Ying Yu, Tong Xu, Enhong Chen:
TGMAE: Self-supervised Micro-Expression Recognition with Temporal Gaussian Masked Autoencoder. 1-6 - Tianci Xun, Wei Chen, Yulin He, Di Wu, Yuanming Gao, Jiuyuan Zhu, Weiwei Zheng:
Distinguishing Textual Prompt Importance: Image-Guided Text Weighting for CLIP-Based Few-shot Learning. 1-6 - Xinyu Xiao, Yun Hu, Eryun Liu:
Local-to-Global Self-Consistency Learning for Temporal Action Localization. 1-6 - Gakusei Sato, Taketo Akama:
Annotation-Free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion. 1-6 - Ruisheng Yuan, Minzhe Tang, Dongliang Kou, Mingyang Sun, Dingkang Yang, Xiao Zhao, Lihua Zhang:
IIPC: Intra-Inter Patch Correlations for Garment Collision Handling. 1-6 - Haoyu Tang, Shuaike Zhang, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Liqiang Nie:
Two-Stage Information Bottleneck For Temporal Language Grounding. 1-6 - Zhixiang Yuan, Kaixin Zhang, Tao Huang:
Positive Label Is All You Need for Multi-Label Classification. 1-6 - Stephen D. Voran:
Why Some Audio Signal Short-Time Fourier Transform Coefficients Have Nonuniform Phase Distributions. 1-6 - Yixuan Guan, Xuefeng Liu, Tao Ren, Jianwei Niu:
FedMDC: Enabling Communication-Efficient Federated Learning over Packet Lossy Networks via Multiple Description Coding. 1-7 - Guosheng Cui, Fusheng Hao, Dan Wu, Ye Li:
Fast label prediction based on shrunk anchor graph for semi-supervised incomplete multiview classification. 1-6 - Xingbei Guo, Ziping Ma, Qing Wang, Pengxu Wei:
Towards Real-world Continuous Super-Resolution: Benchmark and Method. 1-6 - Feihu Jiang, Chuan Qin, Jingshuai Zhang, Kaichun Yao, Xi Chen, Dazhong Shen, Chen Zhu, Hengshu Zhu, Hui Xiong:
Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach. 1-6 - Ruoyan Pi, Jinglin Xu, Yuxin Peng:
FE-VAD: High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection. 1-6 - Alysa Ziying Tan, Siwei Feng, Han Yu:
FL-Clip: Bridging Plasticity and Stability in Pre-Trained Federated Class-Incremental Learning Models. 1-6 - Mingzhou Wu, Shiqi Dai, Han Hu, Zhi Wang:
Collaborative Edge Caching in LEO Satellites Networks: A MAPPO Based Approach. 1-6 - Jiaxin Deng, Shiyao Wang, Dong Shen, Liqin Zhao, Fan Yang, Guorui Zhou, Gaofeng Meng:
A Multimodal Transformer for Live Streaming Highlight Prediction. 1-6 - Qilong Xu, Xiuyang Zhao:
Contour-Guided Modality Mitigation Network for Visible-Infrared Person Re-Identification. 1-6 - Xiaowen Ma, Jiawei Yang, Rui Che, Huanting Zhang, Wei Zhang:
DDLNet: Boosting Remote Sensing Change Detection with Dual-Domain Learning. 1-6 - Qi Jia, Shuilian Yao, Youcan Xu, Yu Liu, Dehao Kong, Longin Jan Latecki:
Fuzzy Boundary-Guided Network for Camouflaged Object Detection. 1-6 - Yutao Rao, Liwei Sun, Junjie Zhang, Haoran Jiang, Jian Zhang, Dan Zeng:
Densely Connected Transformer with Frequency Awareness and Sam Guidance for Semi-Supervised Hyperspectral Image Classification. 1-6 - Jinglin Zhao, Debin Liu, Laurence T. Yang, Ruonan Zhao, Zheng Wang, Zhe Li:
TD3D: Tensor-based Discrete Diffusion Process for 3D Shape Generation. 1-6 - Tingting Li, Gensheng Pei, Xinhao Cai, Qiong Wang, Huafeng Liu, Yazhou Yao:
Universal Organizer of Segment Anything Model for Unsupervised Semantic Segmentation. 1-6 - Jiabang He, Jia Liu, Lei Wang, Xiyao Li, Xing Xu:
MoCoSA: Momentum Contrast for Knowledge Graph Completion with Structure-Augmented Pre-trained Language Models. 1-6 - Zhichao Jiang, Hongsong Wang, Xi Teng, Baopu Li:
Robust 3D Face Alignment with Multi-Path Neural Architecture Search. 1-6 - Zongyuan Jiang, Jiayu Chen, Chongyu Liu, Ning Zhang, Jun Huang, Xue Gao, Lianwen Jin:
RISC: Boosting High-quality Referring Image Segmentation via Foundation Model CLIP. 1-6 - Zhuang Qi, Weihao He, Xiangxu Meng, Lei Meng:
Attentive Modeling and Distillation for Out-of-Distribution Generalization of Federated Learning. 1-6 - Wenyu Li, Zongxin Ye, Sidun Liu, Ziteng Zhang, Xi Wang, Peng Qiao, Yong Dou:
ParaSurRe: Parallel Surface Reconstruction with No Pose Prior. 1-6 - Pengfei Yao, Yinglong Zhu, Tianlu Mao, Hao Jiang, Zhaoqi Wang:
Modeling Scene-Agent Interaction for Pedestrian Trajectory Prediction. 1-6 - Yu Wang, Shengjie Zhao:
Weakly-Supervised Action Localization by Hierarchical Attention Mechanism with Multi-Scale Fusion Strategies. 1-6 - Liwen Hu, Lei Ma, Yijia Guo, Tiejun Huang:
SCSim: A Realistic Spike Cameras Simulator. 1-6 - Guiyu Zhao, Zewen Du, Zhentao Guo, Hongbin Ma:
VRHCF: Cross-Source Point Cloud Registration via Voxel Representation and Hierarchical Correspondence Filtering. 1-6 - Yihong Lu, Jianyi Liu, Ru Zhang:
An Images Regeneration Method for CG Anti-Forensics Based on Sensor Device Trace. 1-6 - Shuhua Wang, Ke Lu, Yang Zhao, Hengsheng Lun, Zehai Niu, Jian Xue:
VS3D: A Vote-Based Semi-Supervised 3D Object Detection Framework for Point Clouds. 1-6 - Ziming Cheng, Xiangning Ruan, Qixiang Yin, Zhicheng Zhao:
The Root Element of Human Poses is Radian: MCPRL is All You Need. 1-6 - Zheng Lin, Zheng-Peng Duan, Xuying Zhang, Luojun Lin:
No-Reference Segmentation Annotation Quality Assessment. 1-6 - Kangze Xu, Ziqiang He, Xiangui Kang, Z. Jane Wang:
Transferable and high-quality adversarial example generation leveraging diffusion model. 1-6 - Jiaxin Chen, Xin Liao, Zhenxing Qian, Zheng Qin:
Multi-domain Probability Estimation Network for Forgery Detection over Online Social Network Shared Images. 1-6 - Hengsheng Lun, Ke Lu, Liping Hou, Shuhua Wang, Jian Xue:
From 3D to 4D: Fixing the Erroneous Coupling between IoU and Angle for Optimizing 3D Object Detection. 1-6 - Xiaogang Du, Meng Yang, Tao Lei, Xuejun Zhang, Yingbo Wang, Asoke K. Nandi:
HSVFormer: Robust and Unsupervised HSV-based Transformer Framework for Low-Light Image Enhancement. 1-6 - Xin Zheng, Ziang Peng, Yuan Cao, Hongming Shan, Junping Zhang:
SIAM: A Simple Alternating Mixer for Video Prediction. 1-10 - Yu Cai, Shihao Gao, Songzhi Su, Xizhi Chen, Xi Wang:
MeshStyle: Text-driven Efficient and High-Quality 3D Mesh Stylization via Hypergraph Convolution. 1-6 - Yijie Wei, Bo Liu, Peng Luan, Yinchi Ma:
Multi-Scale Dense Description for Blind Image Quality Assessment. 1-6 - Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men:
Selective Cross-Correlation Consistency Loss for Out-of-Distribution Generalization. 1-6 - Guangxing Wu, Junxi Chen, Qiu Li, Wentao Zhang, Wei-Shi Zheng, Ruixuan Wang:
Region Attention Fine-tuning with CLIP for Few-shot Classification. 1-6 - Yang Chen, Yueqi Duan, Runzhong Zhang, Yap-Peng Tan:
Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation. 1-6 - Mingrui Xiao, Zijian Zeng, Yue Zheng, Shu Yang, Yali Li, Shengjin Wang:
A Dataset with Multi-Modal Information and Multi-Granularity Descriptions for Video Captioning. 1-6 - Haotian Hu, Bin Jiang, Chao Yang, Xinjiao Zhou, Xiaofei Huo:
ScribbleEditor: Guided Photo-realistic and Identity-preserving Image Editing with Interactive Scribble. 1-6 - Ying Liu, Ge Bai, Chenji Lu, Shilong Li, Zhang Zhang, Ruifang Liu, Wenbin Guo:
Eliminating the Language Bias for Visual Question Answering with fine-grained Causal Intervention. 1-6 - Tian Feng, Jiaheng Wang, Junao Shen, Qiangguo Jin, Zhiyuan Zhu, Xinyu Wang:
Retinal Vessel Segmentation via Cross-attention Feature Fusion. 1-6 - Juncheng Yang, Zuchao Li, Shuai Xie, Weiping Zhu, Wei Yu, Shijun Li:
Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models. 1-6 - Ting Liu, Xuyang Liu, Siteng Huang, Honggang Chen, Quanjun Yin, Long Qin, Donglin Wang, Yue Hu:
DARA: Domain- and Relation-Aware Adapters Make Parameter-Efficient Tuning for Visual Grounding. 1-6 - Mengxi Zhang, Heqing Lian, Yiming Liu, Jie Chen:
HARIS: Human-Like Attention for Reference Image Segmentation. 1-6 - Jing Zhao, KokSheik Wong, Vishnu Monn Baskaran, Kiki Adhinugraha, David Taniar:
Music Form Analysis: A Case Study of The Theme and Variations Form. 1-6 - Zhigang Wang, Yunpeng Gao, Xun Li, Peipei Gu, Bin Zhao, Xuelong Li:
A Coarse-to-Fine Reconstruction Framework for Non-Lambertian Photometric Stereo. 1-6 - Xiaoxi Lu, Xingyue Wang, Jiansheng Fang, Na Zeng, Jingqi Huang, Chuangguang Huang, Jingfeng Zhang, Jianjun Zheng, Heng Meng, Jiang Liu:
3D Nodule Content-Based Metric Learning for Evidence-Based Lung Cancer Screening. 1-7 - Junjie Kang, Jinsong Wu, Shiqi Jiang:
Photorealistic image style transfer based on explicit affine transformation. 1-8 - Wenjing Wang, Si Li:
Consensus Co-teaching for Dynamically Learning with Noisy Labels. 1-6 - Bingheng Pang, Zhuoxuan Liang, Wei Li, Xiangxu Meng, Chenhao Wang, Yilin Ren:
Brain Waves Unleashed: Illuminating Neonatal Seizure Detection via Multi-scale Hierarchical Modeling. 1-6 - Xiao Fu, Wei Xi, Zhao Yang, Rui Jiang, Dianwen Ng, Jie Yang, Jizhong Zhao:
MRFER: Multi-Channel Robust Feature Enhanced Fusion for Multi-Modal Emotion Recognition. 1-6 - Jianbo Ma, Chuanming Tang, Fei Wu, Can Zhao, Jianlin Zhang, Zhiyong Xu:
STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking. 1-6 - Zheng Wang, Junkun Zhao, BiFan Lai, XingHuai Zheng:
Structural Highlight Network for Camouflaged Object Detection. 1-6 - Qiong Chen, Yaochi Zhao, Yujia Chen, He Zhang, Zhuhua Hu:
Combining Soft and Hard Attentions for high-quality single-stage instance segmentation. 1-5 - Wajahat Khalid, Bin Liu, Muhammad Waqas:
Clothmix: A Cloth Augmentation Strategy for Cloth-Changing Person Re-Identification. 1-6 - Yujie Liu, Mingyue Li, Jiansen Jing, Yante Li, Guoying Zhao:
Clothing Sampling Based on Active Learning For Cloth-Changing Person Re-identification. 1-6 - Depei Liu, Hongjie Fan, Junfei Liu:
PGDM: Multimodal Panoramic Image Generation with Diffusion Models. 1-6 - Sijing Xie, Chengxin Zhao, Nan Sun, Wei Li, Hefei Ling:
Picking watermarks from noise (PWFN): an improved robust watermarking model against intensive distortions. 1-6 - Zhuo Xie, Haoran Mo, Chengying Gao:
Video-Driven Sketch Animation Via Cyclic Reconstruction Mechanism. 1-6 - Huanting Zhang, Mengting Ma, Xinyu Wang, Jiawei Yang, Xiangdong Li, Wei Zhang:
SSETPAN: Spatial-Spectral Enhanced Transformer based network for pansharpening. 1-6 - Xin Zhou, Tianyang Dong, Jing Fan, Wenyuan Ying, Hubin Kong:
ODNet: Orthogonal-Perception and Dense-dilation Enhanced Network for Segmenting Complex Tree Branch Structures. 1-6 - Ruizhou Liu, Zongsheng Cao, Zhe Wu, Qianqian Xu, Qingming Huang:
Multimodal Knowledge Graph Embeddings via Lorentz-based Contrastive Learning. 1-6 - Haitao Yao, Zhenwei Wang, Mingli Zhang, Wen Zhu, Lizhi Zhang, Lijun He, Jianxin Zhang:
Second-Order Self-Supervised Learning for Breast Cancer Classification. 1-6 - Daowu Yang, Ying Liu, Qiyun Yang, Ruihui Li:
Talking Portrait with Discrete Motion Priors in Neural Radiation Field. 1-6 - Junjie Yang, Hao Wu, Ji Zhang, Lianli Gao, Jingkuan Song:
Effective and Efficient Few-shot Fine-tuning for Vision Transformers. 1-6 - Yulun Wu, Yaolong Ju, Simon Lui, Jing Yang, Fan Fan, Xuhao Du:
Cycle Frequency-Harmonic-Time Transformer for Note-Level Singing Voice Transcription. 1-6 - Jie Luo, Xin Jin, Mingyu Liu, Yihui Fan:
TrafficScene: A Multi-modal Dataset including Light Field for Semantic Segmentation of Traffic Scenes. 1-6 - Yuxuan Mu, Shihao Zou, Kangning Yin, Zheng Tian, Li Cheng, Weinan Zhang, Jun Wang:
RACon: Retrieval-Augmented Simulated Character Locomotion Control. 1-6 - Yang Li, Songlin Yang, Wei Wang, Ziwen He, Bo Peng, Jing Dong:
Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts. 1-6 - Haiyan Jin, Yifan Shuai, Fengyuan Zuo, Haonan Su, Zhaolin Xiao, Bin Wang, Yuanlin Zhang:
A Channel-Wise Guidance Sparse Transformer for Effective Dark Image Enhancement. 1-6 - Zongyao He, Zhi Jin:
Dynamic Implicit Image Function for Efficient Arbitrary-Scale Super-Resolution. 1-6 - Yuebin Xie, Xiaochen He, Baoyao Yang, Fei Lyu, Siqi Liu:
CAM-Guided Translation for Unpaired Weakly-Supervised Medical Image Segmentation. 1-6 - Zihan Niu, Zheyong Xie, Tong Xu, Xiangfeng Wang, Yao Hu, Ying Yu, Enhong Chen:
Knowledge-Enhanced Multi-perspective Incongruity Perception Network for Multimodal Sarcasm Detection. 1-6 - Haoxuan Wang, Ping Wei, Shuaijia Chen, Zhimin Liao, Jialu Qin:
Local-to-Global Perception Network for Point Cloud Segmentation. 1-6 - Jiacheng Su, Kunhong Liu, Liyan Chen, Junfeng Yao, Qingsong Liu, Dongdong Lv:
Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN. 1-6 - Meng Wang, Xiaojie Guo, Jiawan Zhang:
FNFORMER: A Transformer-Based Face Normal Estimator. 1-6 - Hanting Li, Hongjing Niu, Zhaoqing Zhu, Feng Zhao:
CLIPER: A Unified Vision-Language Framework for In-the-Wild Facial Expression Recognition. 1-6 - Chuanfei Hu, Hang Shao, Bo Dong, Zhe Wang, Yongxiong Wang:
ASD: Towards Attribute Spatial Decomposition for Prior-Free Facial Attribute Recognition. 1-9 - Dawei Dai, Yingge Liu, Shiyu Fu, Guoyin Wang:
Multimodal Image-Text Representation Learning for Sketch-Less Facial Image Retrieval. 1-6 - Junkun Hong, Yitian Long, Yueyi Luo, Qianqian Qi, Jun Long:
Multi-feature and Multi-branch Action Segmentation Framework for Modeling Long-Short-Term Dependencies. 1-6 - Chuqiao Wu, Haitao Huang, Wenming Yang:
Diffusion based Coarse-to-Fine Network for 3D Human Pose and Shape Estimation from monocular video. 1-6 - Jingru Wang, Xinguang Xiang:
Multi-scale Transformer with Prompt Learning for Remote Sensing Image Dehazing. 1-6 - Liman Jiang, Canlong Zhang, Lei Wu, Zhixin Li, Zhiwen Wang, Chunrong Wei:
Mask-guided Salient Feature Mining for Cloth-Changing Person Re-identification. 1-6 - Haoran Mo, Xusheng Lin, Chengying Gao, Ruomei Wang:
Text-Based Vector Sketch Editing with Image Editing Diffusion Prior. 1-6 - Zhibo Zhang, Ximing Yang, Weizhong Zhang, Cheng Jin:
ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation. 1-6 - Jianhao Fu, Xiang Ling, Yaguan Qian, Changjiang Li, Tianyue Luo, Jingzheng Wu:
Towards Query-Efficient Decision-Based Adversarial Attacks Through Frequency Domain. 1-6 - Zemin Tang, Min Shi, Zhibang Yang, Xu Zhou, Cen Chen, Joey Tianyi Zhou:
Sentiment Confidence Separation: A Trust-Optimized Framework for Multimodal Sentiment Classification. 1-6 - Yiming Tang, Yi Yu, Yan Qiu Chen:
Prototype-Guided Prior Enhancement and Rectification in Few-shot Semantic Segmentation. 1-6 - Qingfeng Zheng, Peijia Zheng, Weiqi Luo, Wei Lu:
A Fast and Tunable Privacy-Preserving Action Recognition Framework over Compressed Video. 1-6 - Jaakko Laitinen, Tero Partanen, Alexandre Mercat, Jarno Vanne, Miska M. Hannuksela, Honglei Zhang, Alireza Aminlou, Francesco Cricri:
Feasibility Study of Multi-Layer VVC Coding Scheme for Hybrid Machine-Human Consumption. 1-6 - Longjie Qi, Yue Ding, Hongtao Lu:
CGCUT: Unpaired Image-to-Image Translation via Cluster-Guided Contrastive Learning. 1-6 - Jiayi Lyu, Xing Lan, Guohong Hu, Hanyu Jiang, Wei Gan, Jian Xue:
ETAU: Towards Emotional Talking Head Generation Via Facial Action Unit. 1-6 - Yuhao Gao, Gensheng Pei, Mengmeng Sheng, Zeren Sun, Tao Chen, Yazhou Yao:
Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection. 1-6 - Qianrui Teng, Rui Wang, Xing Cui, Peipei Li, Zhaofeng He:
Exploring 3D-aware Lifespan Face Aging via Disentangled Shape-Texture Representations. 1-6 - Xinlong Ding, Hongwei Yu, Jiansheng Chen, Jinlong Wang, Jintai Du, Huimin Ma:
Invisible Pedestrians: Synthesizing Adversarial Clothing Textures To Evade Industrial Camera-Based 3D Detection. 1-6 - Zhiwei Dong, Xi Zhu, Xiya Cao, Ran Ding, Caifa Zhou, Wei Li, Yongliang Wang, Qiangbo Liu:
BézierFormer: A Unified Architecture for 2D and 3D Lane Detection. 1-6 - Yicheng Pan, Zhenrong Zhang, Jiefeng Ma, Pengfei Hu, Jun Du, Qing Wang, Jianshu Zhang, Dan Liu, Si Wei:
Maths: Multimodal Transformer-Based Human-Readable Solver. 1-6 - Qi Li:
Parameter Efficient Fine-Tuning on Selective Parameters for Transformer-Based Pre-Trained Models. 1-6 - Jiancheng Huang, Mingfu Yan, Yifan Liu, Shifeng Chen:
Color-SD: Stable Diffusion Model Already has a Color Style Noisy Latent Space. 1-6 - Siyu Xing, Chen Gong, Hewei Guo, Xiao-Yu Zhang, Xinwen Hou, Yu Liu:
GAN Inversion for Image Editing via Unsupervised Domain Adaptation. 1-6 - Yongkang Ding, Rui Mao, Hanyue Zhu, Anqi Wang, Liyan Zhang:
Discriminative Pedestrian Features and Gated Channel Attention for Clothes-Changing Person Re-Identification. 1-6 - Zheng Wang, Bowen Tang, Yi Bin, Lei Zhu, Guoqing Wang, Yang Yang:
Shapley Ensemble Adversarial Attack. 1-6 - Haitao Cao, Baoping Cheng, Qiran Pu, Haocheng Zhang, Bin Luo, Yixiang Zhuang, Juncong Lin, Liyan Chen, Xuan Cheng:
DNPM: A Neural Parametric Model for the Synthesis of Facial Geometric Details. 1-6 - Yule Liu, Zhuben Dong, Shenglan Liu, Wujun Wen, Lin Feng:
Two-Step Temporal Divisive Clustering for Unsupervised Action Segmentation. 1-6 - Songlin Li, Xiuhong Li, Zhe Li, Hongbing Ma, Jiabao Sheng, Boyuan Li:
Dual Guidance Enhancing Camouflaged Object Detection via Focusing Boundary and Localization Representation. 1-6 - Xuanxi Chen, Ziqian Shao, Tong Lu:
SVT: Spectral Video Transformer for Video Restoration in Under-Display Camera. 1-6 - Pengfei Hu, Xiuzhe Wu, Yang Wu, Wenming Yang:
PortraitNeRF: A Single Neural Radiance Field for Complete and Coordinated Talking Portrait Generation. 1-6 - Shizhuo Deng, Da Teng, Zhubao Guo, Jiaqi Chen, Dongyue Chen, Tong Jia, Hao Wang:
Self-Supervised Federated Learning for Personalized Human Activity Recognition. 1-6 - Ke Cao, Xuanhua He, Keyu Yan, Tao Hu, Rui Li, Chengjun Xie, Jie Zhang:
Frequency Decomposition-Driven Network for JPEG Artifacts Removal. 1-6 - Xing Wei, Zhaoxin Ji, Bin Wen, Fan Yang, Chong Zhao, Yang Lu:
Unsupervised Multi-Target Domain Adaptation Incremental Method Based on Contrastive Learning. 1-6 - Zihang Huang, Yukun Yang, Tianyu Zhao, Xin Yang:
A Noise Robust Framework via Uncertainty Guidance for Medical Image Segmentation with Noisy Label. 1-6 - Seunghwan Lee, Gwanmo Park, Hyewon Son, Jiwon Ryu, Han Joo Chae:
InFusionSurf: Refining Neural RGB-D Surface Reconstruction Using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning. 1-6 - Wuyang Chen, Kele Xu, Yong Dou, Tian Gao:
Voice-to-Face Generation: Couple of Self-Supervised Representation Learning with Diffusion Model. 1-6 - Xuan Wu, Liang Chen, Ming Tan, Yi Wu:
Convolutional Modulation Feature Distillation Network for Image Super-resolution. 1-6 - Yuyang Ji, Lianlei Shan:
LDNET: Semantic Segmentation Of High-Resolution Images Via Learnable Patch Proposal And Dynamic Refinement. 1-6 - Xinyu Li, Xing Wang, Xiaoxiao Yang, Suping Wu, Xiangzheng Li, Xitie Zhang, Zhiyuan Zhou, Xiang Zhang:
Towards Accurate 3D Face Alignment Under Extreme Scenarios Via Multi-Granularity Perturbation Relearning. 1-6 - Xinyu Zhang, Hefei Huang, Xu Jia, Wenyue Chen, Dong Wang, Shengming Li, Huchuan Lu:
Multi-Stage Fusion for Event-based Multimodal Tracker. 1-6 - Xiaotong Chen, Shikui Wei, Gangjian Zhang, Yao Zhao:
Multi-granular Semantic Mining for Composed Image Retrieval. 1-6 - Hangjie Yi, Yuhang Ming, Dongjun Liu, Wanzeng Kong:
Time-Frequency Jointed Imperceptible Adversarial Attack to Brainprint Recognition with Deep Learning Models. 1-6 - Shibiao Xu, ShuChen Zheng, Wenhao Xu, Rongtao Xu, Changwei Wang, Jiguang Zhang, Xiaoqiang Teng, Ao Li, Li Guo:
HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection. 1-6 - Zhangbin Qian, Jiawei Tan, Zhilong Ou, Hongxing Wang:
CLIP-Driven Multi-Scale Instance Learning for Weakly Supervised Video Anomaly Detection. 1-6 - Haoquan Wang, Shengbo Chen, Xijun Wang, Hong Rao, Yong Chen:
Defending Against Backdoor Attacks via Region Growing and Diffusion Model. 1-6 - Xiaorong Ma, Jiahe Tian, Yu Cai, Yesheng Chai, Zhaoxing Li, Jiao Dai, Liangjun Zang, Jizhong Han:
HIDD: Human-perception-centric Incremental Deepfake Detection. 1-6 - Le Zhang, Tong Li, Yao Lu, Mixiao Hou, Guangming Lu:
Efficient U-Shape Invertible Neural Network for Image Steganography. 1-7 - Xin Liu, Yali Li, Shengjin Wang:
Representation Distillation for Efficient Self-Supervised Learning. 1-6 - Haoran Zhang, Xi Lin, Suxian Xiang, Chenxi Huang, Lvqing Yang, Yan Wang:
Boundary Contrast Domain Adaptation for Cross-modality Medical Image Segmentation. 1-6 - Chenyi Zhu, Dengshi Li, Aolei Chen, Yu Gao, Wei Li, Xi Wang:
Noise Adaptive Fine-grained Speech Intelligibility Enhancement With Soft-label Guided Diffusion. 1-6 - Jinyang An, Wanqian Zhang, Dayan Wu, Zheng Lin, Jingzi Gu, Weiping Wang:
SD4Privacy: Exploiting Stable Diffusion for Protecting Facial Privacy. 1-6 - Yu Lu, Kevin Bui, Roummel F. Marcia:
Alternating Direction Method of Multipliers for Negative Binomial Model with the Weighted Difference of Anisotropic and Isotropic Total Variation. 1-6 - Wenjing Wang, Si Li:
Focusing on All Refined Attention Regions for Noisy Label Facial Expression Recognition. 1-6 - Ziang Li, Chengxiang Si, Zhenyu Cheng, Shuyuan Zhao, Yong Ding:
MTDM-MS: A Malicious Traffic Detection Model Based on Multi-Category Signals. 1-6 - Peilin Xiao, Yueyi Zhang, Dachun Kai, Yansong Peng, Zheyu Zhang, Xiaoyan Sun:
ESTME: Event-driven Spatio-temporal Motion Enhancement for Micro-Expression Recognition. 1-6 - Xiaolin Huang, Biqing Zeng, Jiahui Pan, Yujiang Yao, Zheng Zhou, Bingzhi Chen:
Ambiguity Consistency and Uncertainty Minimization for Semi-Supervised Medical Image Segmentation. 1-6 - Xiaoyu Qiu, Yuechen Wang, Jiaxin Shi, Wengang Zhou, Houqiang Li:
Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator. 1-6 - Ben Chen, Xuechao Zou, Kai Li, Yu Zhang, Junliang Xing, Pin Tao:
High-Fidelity Lake Extraction Via Two-Stage Prompt Enhancement: Establishing A Novel Baseline and Benchmark. 1-6 - Yugan Chen, Lin Zhao, Yalong Xu, Honglei Zu, Xiaoqi An, Guangyu Li:
Domain Adaptive Pose Estimation Via Multi-level Alignment. 1-6 - Mengjiao Zhao, Mengting Ma, Xiangdong Li, Xiaowen Ma, Xinyu Wang, Ao Gao, Wei Zhang:
DuCoFPan: Dual-Condition Flow-based Network for Pan-sharpening. 1-6 - Laurie Van Bogaert, Armand Losfeld, Gauthier Lafruit, Mehrdad Teratani:
Single RGBD to Multilayer 3D Display Pipeline. 1-6 - Yanping Li, Zhaoshuai Qi, Xiuwei Zhang, Tao Zhuo, Yue Liang, Yanning Zhang:
Edge-Guided Detector-Free Network for Robust and Accurate Visible-Thermal Image Matching. 1-6 - Yulan Gao, Zhaoxiang Hou, Chengyi Yang, Zengxiang Li, Han Yu, Xiaoxiao Li:
The Prospect of Enhancing Large-Scale Heterogeneous Federated Learning with Foundation Models. 1-6 - Qingmao Wei, Bi Zeng, Guotian Zeng:
Learning Motion Priors with DETR for Visual Tracking. 1-6 - Yang Zhang, Yue Zhou, Zonghao Yang, Ao Chen:
Cross-modal Prominent Fragments Enhancement Aligning Network for Image-text Retrieval. 1-6 - Qingzhi He, Rong Quan, Weifeng Yang, Jie Qin:
Visual Feature Disentanglement for Zero-Shot Learning. 1-6 - Zenghao Guan, Yucan Zhou, Xiaoyan Gu, Bo Li:
GIE : Gradient Inversion with Embeddings. 1-6 - Haonan Lin, Wenbin An, Yan Chen, Feng Tian, Yuzhe Yao, Wei Ding, Qianying Wang, Ping Chen:
A Tri-Branch Network with Prototype-aware Matching for Universal Category Discovery. 1-6 - Xin Liu, Yue Xu, Kun He:
Improving the Sar Image Adversarial Transferability Through Dual-Loop Ensemble Gradient Attack. 1-6 - Yuan Liu, Shu Wang, Zhe Qu, Xingyu Li, Shichao Kan, Jianxin Wang:
FedGCA: Global Consistent Augmentation Based Single-Source Federated Domain Generalization. 1-6 - Chengjie Wang, Chengming Xu, Zhenye Gan, Yuxi Li, Jianlong Hu, Wenbing Zhu, Lizhuang Ma:
PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision. 1-6 - Fengqi Li, Mengchao Guo, Fengqiang Xu, Renxuan Xiong, Xiaohong Yan, Qian Sun, Deguang Wang:
STformer: Advancing Video Deraining Network Integrating with Spatial Transformers and Multiscale Feature Extraction. 1-6 - Chenqu Ren, Yeheng Shao, Haolei Qiu:
PVRF: Single-Plane and Single-Vector for Memory-Efficient Radiance Fields. 1-6 - Leqi Shen, Tao He, Sicheng Zhao, Zhelun Shen, Yuchen Guo, Tianshi Xu, Guiguang Ding:
X-ReID: Cross-Instance Transformer for Identity-Level Person Re-Identification. 1-6 - Liang Shi, Fuyong Xu, Ru Wang, Yongqing Wei, Guangjin Wang, Bao Wang, Peiyu Liu:
Information Aggregate and Sentiment Enhance Network to Handle Missing Modalities for Multimodal Sentiment Analysis. 1-6 - Sumei Li, Hangwei Liang, Mingxuan Xie, Xiaofei He:
Multi-Scale and Multi-Patch Aggregation Network Based on Dual-Column Vision Fusion for Image Aesthetics Assessment. 1-6 - Shuai Zhao, Tuo Li, Boyuan Zhang, Yang Zhai, Ziyi Liu, Yahong Han:
Improving Transferability of Adversarial Examples with Adversaries Competition. 1-6 - Zhen Wang, Dianxi Shi, Chunping Qiu, Songchang Jin, Tongyue Li, Yanyan Shi:
ICF-Loc: An Infrared-Based Coarse-to-Fine Approach for UAV Visual Geolocation under GPS-Denied Environments. 1-6 - Yu Wu, Haiguang Wang, Mengxia Wu, Min Cao, Min Zhang:
LAIP: Learning Local Alignment from Image-Phrase Modeling for Text-based Person Search. 1-10 - Yuanwen Chen, Xinyao Zhang, Yaran Chen, Dongbin Zhao, Yunzhen Zhao, Zhe Zhao, Pengfei Hu:
Common Sense Language-Guided Exploration and Hierarchical Dense Perception for Instruction Following Embodied Agents. 1-6 - Qian Li, Cheng Wen, Rao Fu:
Improving Few-Shot Neural Radiance Field with Image Based Rendering. 1-6 - Liang He, Zhida Song, Shuanghong Liu, Mengqi Niu, Ying Hu, Hao Huang:
Speaker Recognition Based on Pre-Trained Model and Deep Clustering. 1-6 - Yige Wang, Risheng Huang, Haozhi Huang, Zongqing Lu:
FusionDreamer: Consistent Images Generation from Sparse-view Images. 1-6 - Shuoqian Wang, Mengbai Xiao, Yao Liu:
RoIRTC: Toward Region-of-Interest Reinforced Real-Time Video Communication. 1-6 - Yuejian Fang, Xiaodong Wang:
Enhancing Zero-shot 3D Photography via Mesh-represented Image Inpainting. 1-6 - Zezeng Li, Weimin Wang, Ziliang Wang, Na Lei:
Point Cloud Compression via Constrained Optimal Transport. 1-6 - Jingmou Xian, Jian Zhu, Haolin Liao, Si Li:
Frequency-regularized Neural Representation Method for Sparse-view Tomographic Reconstruction. 1-6 - Ziliang Gan, Lei Jin, Lei Nie, Zheng Wang, Li Zhou, Liang Li, Zhecan Wang, Jianshu Li, Junliang Xing, Jian Zhao:
ASQuery: A Query-based Model for Action Segmentation. i-vi - Yufeng Wang, Wensen Feng, Haoqian Wang:
EyebrowNet: High-Precision Eyebrow Reconstruction and Matting. 1-6 - Bingzhi Chen, Haoming Zhou, Yishu Liu, Biqing Zeng, Jiahui Pan, Guangming Lu:
Enhancing Few-Shot Classification without Forgetting Through Multi-level Contrastive Constraints. 1-6 - Xiangwen Deng, Yufeng Wang, Yuanhao Cai, Jingxiang Sun, Yebin Liu, Haoqian Wang:
ITportrait: Image-Text Coupled 3D Portrait Domain Adaptation. 1-6 - Naifu Xue, Qi Mao, Zijian Wang, Yuan Zhang, Siwei Ma:
Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer. 1-6 - Zhenqiang Zhang, Chuantao Li, Jian Song, Jialiang Lv, Chunxiao Wang, Zhigang Zhao, Jidong Huo:
STUI-NET: Semi-Supervised Transformer for Underwater Information Enhancement. 1-6 - Xiao Kang, Xingbo Liu, Xuening Zhang, Wen Xue, Xiushan Nie, Shaohua Wang, Yilong Yin:
Unsupervised Online Cross-modal Hashing With Multiple Association Exploitation. 1-6 - Yongkang Cheng, Mingjiang Liang, Shaoli Huang, Jifeng Ning, Wei Liu:
ExpGest: Expressive Speaker Generation Using Diffusion Model and Hybrid Audio-Text Guidance. 1-6 - Wudi Chen, Chao Zhang, Cheng Han, Yanjie Ma, Yongqing Cai:
Sttcnerf: Style Transfer of Neural Radiance Fields for 3d Scene Based on Texture Consistency Constraint. 1-6 - Anustup Choudhury, Praneet Singh, Guan-Ming Su:
NeRVA: Joint Implicit Neural Representations for Videos and Audios. 1-6 - Shengjia Zhang, Suping Wu:
GFAvatar: A High-Quality Facial Avatar Reconstruction Method. 1-6 - Zejun He, Fei Chen, Fan Jiang, Wanling Liu, Zhangyan Ye:
A Dual-Branch Network Based on Connectivity Mask for Retinal Vessel Segmentation. 1-6 - Tianyang Dong, Huanbo Zhang, Hubin Kong, Shuqian Lv, Fenghao Li:
Align-RDW: Alignment-based Redirected Walking for Multi-User VR scenarios. 1-6 - Boyuan Li, Xiuhong Li, Songlin Li, Yuye Zhang, Kangwei Liu:
Adaptive Feature Fusion Network for Infrared Small Target Detection. 1-6 - Fan Dai, Yun Zhu, Yaqi Shen, Jin Xie, Jianjun Qian:
Dense Voxel Representation Network for Implicit Scene Completion. 1-6 - Jilin Tang, Lincheng Li, Xingqun Qi, Yingfeng Chen, Changjie Fan, Xin Yu:
AS-NeRF: Learning Auxiliary Sampling for Generalizable Novel View Synthesis from Sparse Views. 1-6 - Huizhen Ji, Yaohua Zha, Qingmin Liao:
LR-MAE: Locate while Reconstructing with Masked Autoencoders for Point Cloud Self-supervised Learning. 1-6 - Tian Zhang, Kongming Liang, Ke Zhang, Zhanyu Ma:
Learning Conditional Prompt for Compositional Zero-Shot Learning. 1-6 - Zeqi Wu, Yuefeng Ma:
I2CL-ANE: A Novel Attribute Network Embedding based on Intra-Inter View Contrastive Learning. 1-6 - Njuod Alsudays, Jing Wu, Yu-Kun Lai, Ze Ji:
GRPSNET: Multi-Class Part Parsing Based on Graph Reasoning. 1-10 - Yung-Wei Fan, Sheng-Chun Huang, Shao-Yi Chien:
Graph Attention Convolutional Network for 3D Human Pose and Shape Estimation from Point Clouds. 1-6 - Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu:
Multi-modal Learnable Queries for Image Aesthetics Assessment. 1-6 - Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Daize Dong, Suncheng Xiang, Ting Liu, Yuzhuo Fu:
iDAT: inverse Distillation Adapter-Tuning. 1-6 - Yihang Zhang, Yun Liang, Shitong Weng, Hai Lin, Liping Chen, Shenlong Zheng:
Hierarchical Temporal Attention and Competent Teacher Network for Sound Event Detection. 1-6 - Dongze Hao, Qunbo Wang, Jing Liu:
Semantic-Visual Graph Reasoning for Visual Dialog. 1-6 - Haoran Jiang, Xiangjie Wang, Junjie Zhang, Jian Zhang, Dan Zeng:
DSENet: An Object-Wise Density-Informed Coarse-to-Fine Object Detector for Aerial Image. 1-6 - Xicheng Chen, Haibo Ye, Fangyu Zhou:
Class-Aware Feature Perturbation for Long-Tailed Visual Recognition. 1-6 - Yuhang Cheng, Ziyang Fan, Hongyu Wu, Xiaogang Wang:
High-Order Differential Regularizing Implicit Surface Representation of Point Cloud. 1-6 - Yaxin Liu, Yan Zhou, Ziming Li, Jinchuan Zhang, Yu Shang, Chenyang Zhang, Songlin Hu:
RNG: Reducing Multi-level Noise and Multi-grained Semantic Gap for Joint Multimodal Aspect-Sentiment Analysis. 1-6 - Haoyu Deng, Yanmei Fang, Fangjun Huang:
Enhancing Adversarial Transferability on Vision Transformer by Permutation-Invariant Attacks. 1-6 - Pengyu Wang, Jianmin Li, Wenbo Ding, Jiachen Zhong, Jianyong Ai:
Correcting Pseudo Labels in Semi Supervised Object Detection with SAM. 1-6 - Min Zhang, Zifeng Zhuang, Zhitao Wang, Donglin Wang:
RotoGBML: Towards Out-of-distribution Generalization for Gradient-based Meta-learning. 1-6 - Qi Jia, Zikun Zhao, Xiaomei Feng, Jinyuan Liu, Yu Liu, Xinwei Xue:
Joint edge detection learning for recurrent homography estimation. 1-6 - Xuewei Li, Yujie Diao, Mei Yu, Chenhan Wang, Jie Gao, Ruiguo Yu:
Area Intervention for Enhancing Class Activation Maps in Weakly Supervised Semantic Segmentation. 1-6 - Kang Zhu, Cunhang Fan, Jianhua Tao, Jun Xue, Heng Xie, Xuefei Liu, Yongwei Li, Zhengqi Wen, Zhao Lv:
Dual-View Multimodal Interaction in Multimodal Sentiment Analysis. 1-6 - Wang Yang, Lingchen Zhao, Dengpan Ye:
Reputation Defender: Local Black-Box Adversarial Attack against Image-Translation-Based DeepFake. 1-6 - Haifei Duan, Shenglan Liu, Chenwei Tan, Yuning Ding, Jirui Tian, Feilong Wang:
Decoupling Spatio-Temporal Network for Fine-Grained Temporal Action Segmentation. 1-6 - Wenjing Zhu, Sining Sun, Changhao Shan, Peng Fan, Qing Yang:
Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition. 1-6 - Hao Li, Jinlong Wang, Hanxiang Yang, Xiongxin Tang, Fanjiang Xu:
Learning Semantic-aware Retinex Network with Spatial-Frequency Interaction for Low-light Image Enhancement. 1-6 - Yuxin Huang, Yiwei Yuan, Xiangyu Zeng, Ling Xie, Yiyu Fu, Guanghui Yue, Baoquan Zhao:
Full-Reference Motion Quality Assessment Based on Efficient Monocular Parametric 3D Human Body Reconstruction. 1-6 - Liang Zhao, Yukun Yuan, Qiongjie Xie, Ziyue Wang:
Anchor Based Multi-view Clustering for Partially View-Aligned Data. 1-5 - Li Fang, Kaijun Zou, Zhiye Chen, Long Ye:
HMDST: A Hybrid Model-Data Driven Approach for Spatio-Temporally Consistent Video Inpainting. 1-6 - Ke Chen, Zhihua Huang, Kexin Lu, Yonghong Yan:
CosDiff: Code-Switching TTS Model Based on A Multi-Task DDIM. 1-6 - Changsheng Chen, Yongyi Deng, Liangwei Lin, Zitong Yu, Zhimao Lai:
Multi-Modal Document Presentation Attack Detection with Forensics Trace Disentanglement. 1-8 - Hongjing Su, Fuxiang Lu:
HctMAE: Hybrid Convolution-Transformer Meets Masked Autoencoder for Plant Recognition. 1-6 - Chengxiang Fan, Aohong Shen, Zhen Han, Cai Tong, Zhongyuan Wang, Dekang Yi:
Dual-Domain Multi-Model GAN Fingerprint Restoration for Compressed Fake Face Attribution. 1-6 - Zijian Zhang, Ruiguo Yu, Xi Wei, Jie Gao, Mei Yu, Xuewei Li, Zhiqiang Liu:
Unsupervised Domain Adaptation Semantic Segmentation on Thyroid Ultrasound Images Based on Task-Oriented Feature Disentanglement. 1-6 - Chenglin Liu, Binquan Wang, Ming Zhu:
ReCo-CXR: A Self-Supervised Pre-Training Framework for Pulmonary Nodule Detection in X-Ray Images. 1-6 - Yang Yu, Chen Xu, Kai Wang:
TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks. 1-6 - Yaoxin Wu, Hongwei Ding, Yunqi Liu, Zerui Wen, Xiaohui Cui:
Synthetic Data Augmentation for Infrared Small Target Detection via Exploring Frequency Components and Targets Prior. 1-6 - Xuewan He, Jielei Wang, Qianxin Xia, Guoming Lu, Yuan Tang, Hongxia Lu:
Cross-Domain Feature Semantic Calibration for Zero-Shot Sketch-Based Image Retrieval. 1-6 - Dan Yang, Xiuhong Li, Zhe Li, Chenyu Zhou, Xiaofan Wang, Fan Chen:
Prompt Fusion Interaction Transformer For Aspect-Based Multimodal Sentiment Analysis. 1-6 - Kaihao Lin, Guoqing Wang, Yuhui Wu, Shuhang Gu, Xing Xu, Yang Yang:
Domain Prompt Learning Framework for Real Image Dehazing. 1-6 - Jintai Du, Jinlong Wang, Jiansheng Chen, Xinlong Ding, Jiehui Wu, Tianyu Hu, Huimin Ma:
Analyzing Behavior and Intention in Multi-Agent Systems Using Graph Neural Networks. 1-6 - Haichuan Song, Zhihong Zheng, Zhizhong Zhang, Yuan Xie, Guchu Zou, Zhenyi Qi, Xin Tan:
Mutual Positive and Negative Learning for Weakly-supervised Point Cloud Semantic Segmentation. 1-6 - Zhao Wu, Dunbo Ning, Wenjing Chen, Hao Sun, Wei Xie, Ming Dong:
Spatial Dual Context Learning for Weakly-supervised Group Activity Recognition in Still-images. 1-6 - Taizhang Hu, Fan Yang, Xing Wei, Chong Zhao, Li Meng, Bin Wen, Yang Lu:
BTC: Bilateral-Branch Vision Transformer via Hilbert Patch Embedding for Image Clustering. 1-6 - Chenbin Zhang, Zhiqiang Hu, Shuyu Dai, Qingyuan He, Defeng Liu, Kun Yan, Ping Wang:
Boundary-Aware Contrastive Learning for Single-Source Domain Generalization in Medical Image Segmentation. 1-6 - Honghui Xu, Yueqian Quan, Chuangjie Fang, Jianwei Zheng:
Robust Principal Component Analysis via High-Order Self-Learning Transform Tensor Nuclear Norm. 1-6 - Yuming Yang, Dongsheng Zou:
AdaStyleSpeech: A Fast Stylized Speech Synthesis Model Based on Adaptive Instance Normalization. 1-6 - Xinfa Zhu, Yuke Li, Yi Lei, Ning Jiang, Guoqing Zhao, Lei Xie:
Boosting Multi-Speaker Expressive Speech Synthesis with Semi-Supervised Contrastive Learning. 1-6 - Bo Kong, Shengquan Liu, Liang He, Liruizhi Jia, Yi Liang:
CSMA-CNER: Multi-modal Chinese NER task with Cross- and Self-Modality Attention. 1-6 - Rao Fu, Qian Li, Cheng Wen, Ning An, Fulin Tang:
A Region-Growing Supervised Geometry-Weighted Transformer for Normal Estimation. 1-6 - Haozheng Zhang, Yanhong Yang, Zhixuan Jing, Shengyong Chen:
DA-LGNet: Enhancing Spatial-Spectral feature representation with Dual-Attention Local-General Network for Hyperspectral images and Multispectral images Fusion. 1-6 - Gongxin Yao, Xinyang Li, Yixin Xuan, Yu Pan:
MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval. 1-6 - Luojun Lin, Qipeng Liu, Xiangwei Zheng, Zheng Lin:
Slow-Fast Adaptation for Source-Free Object Detection. 1-6 - Shaoqi Yu, Lili Chen, Xiaolin Zhang, Jiamao Li:
VTR: Bidirectional Video-Textual Transmission Rail for CLIP-based Video Recognition. 1-6 - Yulin He, Wei Chen, Zhengfa Liang, Ke Liang, Yusong Tan, Tianrui Liu, Yulan Guo:
Don't Turn a Blind Eye to Localization Noise: Localization Pseudo-label Correction and Learning for Semi-Supervised Object Detection. 1-6 - Qin Lei, Rui Yang, Jiang Zhong, Rongzhen Li, Muyang He, Mianxiong Dong, Kaoru Ota:
Expanding Crack Segmentation Dataset with Crack Growth Simulation and Feature Space Diversity. 1-6 - Yuan-Yuan Liu, Song-Lu Chen, Qi Liu, Feng Chen, Xu-Cheng Yin:
Towards Low-resource License Plate Recognition via Feature Shuffling. 1-6 - Yadang Chen, Wentao Zhu, Zhi-Xin Yang, Enhua Wu:
Space-time Reinforcement Network for Video Object Segmentation. 1-6 - Ying Tang, Wei Yang, Junqing Yu, Zikai Song:
Agnostic Feature Compression with Semantic Guided Channel Importance Analysis. 1-6 - Yuwei Feng, Gang Zhou, Sen Yang, Jiang Zhang, Jing Ma, Zhenhong Jia:
Intermediate Domain Meets Natural Hazy Tracking. 1-6 - Fanxu Min, Shaoxiang Guo, Hao Fan, Junyu Dong:
GaitMA: Pose-guided Multi-modal Feature Fusion for Gait Recognition. 1-6 - Lintao Zhang, Xiangcheng Du, LeoWu TomyEnrique, Yiqun Wang, Yingbin Zheng, Cheng Jin:
Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling. 1-6 - Ru Zhen, Xingtao Zhang, Chao Min, Biao Li:
Winner Takes It All: An Efficient Overlap-Aware Hybrid Online Diarization with Partial Backtracking Mechanism. 1-6 - Yucheng Shu, Jiaxin Xie, Lihong Qiao, Bin Xiao, Weisheng Li, Xinbo Gao:
C3T: Contrastive Consistency Cross-Network Learning for Semi-Supervised Semantic Segmentation. 1-6 - Mingyu Wu, Zhiyi Tan, Bing-Kun Bao:
Inferring the effectiveness of epidemic prevention measures based on spatial heterogeneity modeling. 1-6 - Jiawei Feng, Ruomei Wang, Mingyang Liu, Yuanmao Luo, Fuwei Zhang:
Frequency-Domain Enhanced Cross-modal Interaction Mechanism for Joint Video Moment Retrieval and Highlight Detection. 1-8 - Xueqiang Sun, Jin Wang, Jiade Chen, Yunhui Shi, Nam Ling, Baocai Yin:
MC-PCGC: A Space-Channel Mixed Contextual Coding for Point Cloud Geometry Compression. 1-6 - Siqi Deng, Liu Yang:
Enhancing Consistent Federated Learning Objectives Through Uniform Feature Distributions. 1-6 - Ziheng Xu, Jianwei Niu, Qingfeng Li, Tao Ren, Chen Chen:
NID-SLAM: Neural Implicit Representation-based RGB-D SLAM In Dynamic Environments. 1-6 - Ruiting Wang, Enguang Zuo, Chen Chen, Cheng Chen, Junyi Yan, Jie Zhong, Ziwei Yan, Xiaoyi Lv:
SMAE: A Split Masked Graph Autoencoder. 1-6 - Xiaolong Wang, Ping Hu, Rongyao Hu, Xiaofeng Zhu:
GATrack: Group-Aware features for multiple object tracking. 1-6 - Tao He, Leqi Shen, Guiguang Ding, Zhiheng Zhou, Tianshi Xu, Xiaofeng Jin, Yuheng Huang:
Balanced Active Sampling for Person Re-identification. 1-6 - Xin Yan, Chi-Man Pun, Haolun Li, Mengqi Liu, Hao Gao:
Hierarchical Local Temporal Feature Enhancing for Transformer-Based 3D Human Pose Estimation. 1-6 - Bosheng Qin, Juncheng Li, Siliang Tang, Tat-Seng Chua, Yueting Zhuang:
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions. 1-6 - Hao-Yuan Ma, Li Zhang:
Multi-head multi-scale pixel localization network for crowd counting with highly dense and small-scale samples. 1-5 - Penghui Wen, Kun Hu, Dong Yuan, Zhiyuan Ning, Changyang Li, Zhiyong Wang:
Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach. 1-6 - Dongmei Zhang, Ray Zhang, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie, Shanghang Zhang:
VLUReID: Exploiting Vision-Language Knowledge for Unsupervised Person Re-Identification. 1-6 - Xiaoli Tang, Han Yu, Xiaoxiao Li:
Agent-Oriented Joint Decision Support for Data Owners in Auction-Based Federated Learning. 1-6 - Xuewei Liu, Shaofei Huang, Ruipu Wu, Hengyuan Zhao, Duo Xu, Xiaoming Wei, Jizhong Han, Si Liu:
Reference Prompted Model Adaptation for Referring Camouflaged Object Detection. 1-6 - Ching-Chia Kao, Cheng-Yi Lee, Chun-Shien Lu, Chia-Mu Yu, Chu-Song Chen:
On the Higher Moment Disparity of Backdoor Attacks. 1-6 - Liyan Guo, Kaiyu Song, Mengying Xu, Hanjiang Lai:
DNAF: Diffusion with Noise-Aware Feature for Pose-Guided Person Image Synthesis. 1-6 - Cai Yu, Shan Jia, Xiaomeng Fu, Jin Liu, Jiahe Tian, Jiao Dai, Xi Wang, Siwei Lyu, Jizhong Han:
Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection. 1-6 - Wang Xia, Yao Lu, Shunzhou Wang, Wenjing Wang, Ziqi Wang, Peiqi Xia:
Omni Spatial-Angular Correlations Exploration for Light Field Image Super-Resolution. 1-6 - Zhiyi Pan, Guoqing Liu, Wei Gao, Thomas H. Li:
EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding. 1-6 - Yuxiang Yang, Lu Wen, Yuanyuan Xu, Jiliu Zhou, Yan Wang:
Adaptive Prompt Learning with Negative Textual Semantics and Uncertainty Modeling for Universal Multi-Source Domain Adaptation. 1-6 - Donghui Zhang, Xiaobing Li, Di Lu, Yun Tie, Yan Gao, Lin Qi:
Multitrack Emotion-Based Music Generation Network Using Continuous Symbolic Features. 1-6 - Ying Zhong, Ke-Ao Zhao, Leping Zhang, Fangming Zhao, Wentao Wei, Feilin Han:
The Correlation Analysis Between Cybersickness and Postural Behavior in Immersive VR Experience. 1-6 - Chengxin Zhao, Hefei Ling, Sijing Xie, Han Fang, Yaokun Fang, Nan Sun:
SSyncOA: Self-synchronizing Object-aligned Watermarking to Resist Crop-paste Attacks. 1-6 - Ning Pang, Wansen Wu, Yue Hu, Kai Xu, Quanjun Yin, Long Qin:
Enhancing Multimodal Sentiment Analysis via Learning from Large Language Model. 1-6 - Yifang Xu, Yunzhuo Sun, Benxiang Zhai, Zien Xie, Youyao Jia, Sidan Du:
Multi-Modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection. 1-6 - Shunkai Zhou, Canlong Zhang, Zhixin Li, Zhiwen Wang, Chunrong Wei:
Person Re-identification utilizing Text to Search Video. 1-6 - Shuo Zhang, Xiongpeng Hu, Jing Liu:
TranBF: Deep Transformer Networks and Bayesian Filtering for Time Series Anomalous Signal Detection in Cyber-physical Systems. 1-6 - Jin Wang, Yahong Han:
Symmetrical Two-Stream with Selective Sampling for Diversifying Video Captions. 1-6 - Hao Wu, Ke Lu, Yuqiu Li, Junhao Huang, Jian Xue:
MISTA: A Large-Scale Dataset for Multi-Modal Instruction Tuning on Aerial Images. 1-6 - Zhaochen Li, Kedian Mu:
Disentangling and Aggregating: A Data-Centric Training Framework for Cross-Domain Few-Shot Classification. 1-6 - Peng Yan, Guodong Long:
Client-Supervised Federated Learning: Towards One-Model-for-All Personalization. 1-6 - Zhiyu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Song:
Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization. 1-6 - Qi Li, Yucan Zhou, Jiang Zhou, Xiaoyan Gu, Bo Li:
Tackling Feature Skew in Heterogeneous Federated Learning with Semantic Enhancement. 1-6 - Shuwen Yang, Tianyu Huai, Anran Wu, Xingjiao Wu, Wenxin Hu, Liang He:
Enhancing Out-of-Distribution Generalization in VQA through Gini Impurity-guided Adaptive Margin Loss. 1-6 - Yiwei Lou, Jiayu Zhang, Dexuan Xu, Yongzhi Cao, Hanpin Wang, Yu Huang:
No-Reference MRI Quality Assessment via Contrastive Representation: Spatial and Frequency Domain Perspectives. 1-6 - Ren Nie, Jin Ding, Lingxiao He, Xue Zhou:
Latent Distribution Alignment for Domain Generalizable Person Re-identification. 1-6 - Yan Li, Qiong Wang:
Leveraging Hybrid Referring Expressions for Referring Video Object Segmentation. 1-6 - Bingyu Duan, Wanqian Zhang, Dayan Wu, Zheng Lin, Jingzi Gu, Weiping Wang:
Exploiting Vision-Language Model for Visible-Infrared Person Re-identification via Textual Modality Alignment. 1-6 - Liman Wang, Hanyang Zhong:
FENet: Focusing Enhanced Network for Lane Detection. 1-6 - Bo Qian, Yang Wen, Bin Sheng:
Self-Paced Co-Training and Foundation Model for Semi-Supervised Medical Image Segmentation. 1-6 - Jingjing Lu, Yunchuan Qin, Fan Wu, Zhizhong Liu, Kenli Li, Ruihui Li:
DeformingNet: Deforming Multiple Uniform 3D Priors for 3D Point Cloud Completion. 1-6 - Zhaozhi Xie, Bochen Guan, Weihao Jiang, Muyang Yi, Yue Ding, Hongtao Lu, Lei Zhang:
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation. 1-6 - Xianpeng Cao, Weixing Xie, Xianxing Cao, Qiqin Lin, Rongzhou Zhou, Junfeng Yao, Qingqi Hong:
ICR-Net: Semi-Supervised Medical Image Segmentation Guided By Intra-Sample Cross Reconstruction. 1-6 - Yu Wang, Bingchen Zhao, Yongchun Lu, Guoqiang Xiao, Quan Lu:
Debiased Prototypical Learning Improves Generalized Category Discovery. 1-6 - Mohan Chen, Yiren Zhang, Jueqi Wei, Yuejie Zhang, Rui Feng, Tao Zhang, Shang Gao:
Temporal Feature Aggregation for Efficient 2D Video Grounding. 1-6 - Na Jiang, Yuxuan Qiu, Wei Song, Jiawei Liu, Zhiping Shi, Liyang Wang:
Joint Visual-Textual Reasoning and Visible-Infrared Modality Alignment for Person Re-Identification. 1-6 - Yongsheng Yu, Jiebo Luo:
Chain-of-Thought Prompting for Demographic Inference with Large Multimodal Models. 1-7 - You Wu, Zhixin Li:
Mining Similarity Relationships for Unsupervised Cross-Modal Hashing. 1-6 - Yeheng Zhu, Zhijian Wu, Jun Li, Jianhua Xu:
HURDNet: Heterogeneous UNet Structure With Range-Null Space Decomposition for Hyperspectral Image Reconstruction. 1-6 - Hui Wang, Jie Sun, Tianyu Wo, Xudong Liu:
FedFRR: Federated Forgetting-Resistant Representation Learning. 1-6 - Peiming Lin, Sumei Li, Zilin Zhao, Huilin Zhang:
I2GSRnet: Iterative Interaction Guidance Network for Stereo Image Super-Resolution. 1-6 - Ruitao Xie, Limai Jiang, Xiaoxi He, Yi Pan, Yunpeng Cai:
A Weakly Supervised and Globally Explainable Learning Framework for Brain Tumor Segmentation. 1-6 - Yaoxin Li, Deepak Sridhar, Hanwen Liang, Alexander Wong:
Spot the Difference! Temporal Coarse to Fine to Finer Difference Spotting for Action Recognition in Videos. 1-6 - Yong Tang, Qiang Huang, Yingying Zhu:
C2F-CCPE: Coarse-to-Fine Cross-View Camera Pose Estimation. 1-6 - Zicheng Zhang, Yu Fan, Wei Sun, Xiongkuo Min, Xiaohong Liu, Chunyi Li, Haoning Wu, Weisi Lin, Ning Liu, Guangtao Zhai:
Optimizing Projection-Based Point Cloud Quality Assessment with Human Preferred Viewpoints Selection. 1-6 - Kang Xiao, Xu Wang, Yulin He, Baoliang Chen, Xuelin Shen:
Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement. 1-6 - Haoran Zhang, Xiangdong Su, Xingxiang Zhou, Guanglai Gao:
MEMix: Improving HMER with Diverse Formula Structure Augmentation. 1-6 - Yichi Zhang, Zhihao Duan, Yuning Huang, Fengqing Zhu:
Theoretical Bound-Guided Hierarchical Vae For Neural Image Codecs. 1-6 - Yanyu Li, Jiangbo Xu, Ruoyu Zou:
Research on Image Aesthetic Assessment based on Graph Convolutional Network. 1-6 - Tiancheng Zhang, Xinyi Zhang:
Multi-contrast MRI Reconstruction with Deformable Attention and Invertible Network. 1-6 - Pei Wang, Yun Yang, Zhenyu Yu:
Multi-batch Nuclear-norm Adversarial Network for Unsupervised Domain Adaptation. 1-6 - Helin Zhao, Wei Chen, Peng Zhou:
Deep Self-paced Active Learning for Image Clustering. 1-6 - Rui Ma, Mengxi Guo, Peidong Jia, Chenxuan Li, Yi Hou, Yuan Li, Xiaodong Xie, Shanghang Zhang:
Enhanced Blind Watermarking Against Black-Box Noise: Leveraging CIN Framework. 1-6 - Xiaolin Chen, Daoguang Zan, Wei Li, Bei Guan, Yongji Wang:
FIA-TE: Feature Inference Attack on Decision Tree Ensembles in Vertical Federated Learning. 1-6 - Yuzhou Zhao, Xinyu Zhou, Haijing Guo, Qianyu Guo, Yan Zuo, Shaoli Song, Shuyong Gao, Wenqiang Zhang:
Attention in Attention for PET-CT Modality Consensus Lung Tumor Segmentation. 1-7 - Bingzhi Chen, Shuobin Lin, Yishu Liu, Zheng Zhang, Guangming Lu, Lewei He:
Rethinking Adversarial Robustness Distillation VIA Strength-Dependent Adaptive Regularization. 1-6 - Meng Wang, Yue Qi:
Efficient Sampling and Volume Rendering Strategy for Neural Field SLAM. 1-6 - Md. Ershadul Haque, Manoranjan Paul:
Block-Wise Compression Of The Quantum Gray-Scale Image Using Lossy Preparation Approach. 1-6 - Jorge Kessler-Martín, Pablo Fernández-Lagos, David García-Lucas, Gabriel Cebrián-Márquez, Belén Ríos-Sánchez, Guillermo Vigueras, Antonio Jesús Díaz-Honrubia:
Saliency Dataset and Predictive Model for Areas of Interest in VVC Perceptual Coding. 1-6 - Qiancheng Yang, Yong Luo, Bo Du:
Training-Free Robust Neural Network Search Via Pruning. 1-6 - Xianzhou Zeng, Hao Qin, Ming Kong, Luyuan Chen, Qiang Zhu:
Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation. 1-6 - Yi Pan, Jun-Jie Huang, Zihan Chen, Wentao Zhao, Ziyue Wang:
SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks. 1-6 - Jiazhe Miao, Tao Peng, Fei Fang, Xinrong Hu, Ping Zhu, Feng Yu, Minghua Jiang:
SmPhy: Generating smooth and physically plausible 3D garment animations. 1-6 - Rui Zhang, Junxiao Xue, Feng Lin, Qing Zhang, Pavel Smirnov, Xiao Ma, Xiaoran Yan:
Enhancing Human Action Recognition with Fine-grained Body Movement Attention. 1-6 - Yuting Hu, Yue Ming, Panzi Zhao, Boyang Lyu, Kai Hong:
LMGSNet: A Lightweight Multi-scale Group Shift Fusion Network for Low-quality 3D Face Recognition. 1-6 - Hao Niu, Yun Xiong, Xiaosu Wang, Biao Yang, Yao Zhang:
How Does Textual Information Selection Influence Time Series Forecasting? A Cross-modal Perspective on Financial Volatility Prediction. 1-6 - Sanhita Pathak, Vinay Kaushik, Brejesh Lall:
Single Stage Warped Cloth Learning and Semantic-Contextual Attention Feature Fusion for Virtual Tryon. 1-6 - Beibei Li, Beihong Jin, Yisong Yu, Yiyuan Zheng, Jiageng Song, Wei Zhuo, Tao Xiang:
Orthogonal Hyper-category Guided Multi-interest Elicitation for Micro-video Matching. 1-6 - Wenwen Zhang, Jie Lian, Bingying Dong:
Multi-Scale Position-Aware Cell Nucleus Mask Attention for Tumor Budding Detection. 1-6 - Ming Guo, Wenrui Li, Chao Wang, Yuxin Ge, Chongjun Wang:
Smile: Spiking Multi-Modal Interactive Label-Guided Enhancement Network for Emotion Recognition. 1-6 - Yucheng Shu, Longjin Cheng, Bin Xiao, Lihong Qiao, Weisheng Li, Xinbo Gao:
Focal-Guided Multi-Consistency for Unsupervised Partial-to-Partial Point Cloud Registration. 1-6 - Si Li, Jiaxing Liu, Peilin Li, Dichucheng Li, Xinlu Liu, Yongwei Gao, Wei Li:
Improving Drum Source Separation with Temporal-Frequency Statistical Descriptors. 1-6 - Zexian Yang, Dayan Wu, Wanqian Zhang, Jingzi Gu, Zheng Lin, Weiping Wang:
Privacy-Preserving Replay and Adaptive Relation Distillation for Camera Incremental Person Re-Identification. 1-6 - Dingbang Li, Wenzhou Chen, Xin Lin:
Tina: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation. 1-6 - Bo Gao, Junchi Ren, Fei Shen, Mengwan Wei, Zijun Huang:
Exploring Warping-Guided Features via Adaptive Latent Diffusion Model for Virtual try-on. 1-6 - Lichao Cui, Shanliang Yang:
Enhancing Multimodal Sentiment Recognition Based on Cross-Modal Contrastive Learning. 1-6 - Wen Xue, Xingbo Liu, Xiao Kang, Xuening Zhang, Xiushan Nie, Shaohua Wang, Yilong Yin:
Fast Multi-view Clustering With Binary Anchor Graph. 1-6 - Xuening Zhang, Xingbo Liu, Xiao Kang, Wen Xue, Xiushan Nie, Shaohua Wang, Yilong Yin:
Completely Unpaired Cross-Modal Hashing Based on Coupled Subspace. 1-6 - Shuang Liang, Long Zhang, Chi Xie, Lili Chen:
Causal Intervention for Panoptic Scene Graph Generation. 1-6 - Xu Wang, Kairui Zhang:
Adaptive Style Transfer Learning for Generalizable Person Re-identification. 1-6 - Fengyuan Zhang, Zhaopei Huang, Xinjie Zhang, Qin Jin:
Adaptive Temporal Motion Guided Graph Convolution Network for Micro-expression Recognition. 1-6 - Yongjie Guo, Siya Chen, Hongjian You:
Continual Semantic Segmentation via Mask-Based Class Rebalancing. 1-6 - Menglong Yang, Hanyong Wang, Yang Ren:
A Self-Attention Network for Stereo Matching. 1-10 - Ke Jia, Yonghong Song, Xiaomeng Wu, You Su:
Video Anomaly Detection Via Self-Supervised Learning With Frame Interval and Rotation Prediction. 1-6 - Weilong Peng, Yi Luo, Keke Tang, Kongyang Chen, Yangtao Wang, Ping Li, Meie Fang:
IE-aware Consistency Losses for Detailed 3D Face Reconstruction from Multiple Images in the Wild. 1-6 - Zhenggang Yang, Faming Fang, Qiaosi Yi, Guixu Zhang, Fang Li:
HFF-Net: A High-Frequency Fidelity Model for Accelerated Parallel MRI Reconstruction. 1-6 - Zhihang Wei, Jinxin Shi, Jing Yang, Jiabao Zhao:
VIP-FSCIL: A More Robust Approach for FSCIL. 1-6 - Meng Pang, Binghui Wang, Nanrun Zhou, Yintao Zhou, Wei Huang:
Reconstructing Prototype From Contaminated Face With Variations Across Heterogeneous Domains. 1-6 - Xiaoke Yang, Haixu Song, Xiangyu Lu, Shao-Lun Huang, Yueqi Duan:
AdaForensics: Learning A Characteristic-aware Adaptive Deepfake Detector. 1-6 - Kangnan Bai, Pengyi Gao, Kai Chen, Xin Nie, Shenghui Li, Bingqian Li:
Mutual Compromised Multi-feature Fusion Method for Cross-modal Hashing Retrieval. 1-7 - Qingfeng Wang, Lingyu Liang, Shuangping Huang:
Document Image Dewarping Guided by 3D Geometry and Layout Priors. 1-6 - Xuechun Wang, Wentao Chao, Fuqing Duan:
Point Cloud Reconstruction Optimization of Light Field Image based on Intra-class Distance. 1-6 - Jiawei Zhu, Meirong Ding, Yishu Liu, Biqing Zeng, Guangming Lu, Bingzhi Chen:
Robust Visual Question Answering With Contrastive-Adversarial Consistency Constraints. 1-6 - Zhuoxin Chen, Zhenyu Wu, Yang Ji:
Decoupled Federated Learning on Long-Tailed and Non-IID data with Feature Statistics. 1-6 - Kangwei Liu, Xiuhong Li, Boyuan Li, Yuye Zhang, Chao Che:
Lightweight Camouflaged Object Detection Network Based on Feature Complementation and Enhancement. 1-6 - Lei Wang, Tianfu Cai, Pinyi Huang, Xiyao Liu, Wangyang Cai:
Two-Stage Facial Expression Spotting with Spectrum-Based Post-Processing. 1-6 - Yanjie Sun, Kele Xu, Yong Dou, Tian Gao:
Self-Supervised Learning-Based General Fine-tuning Framework For Audio Classification and Event Detection. 1-6 - Wenxin Liang, Bingkai Liu, Han Liu, Hong Yu:
Boosting Node Injection Attack with Graph Local Sparsity. 1-6 - Xu Wang, Yanxia Wu, Ye Yuan, Yan Fu, Xue Zhang:
Unpaired image despeckling based on adversarial speckle generation. 1-6 - Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:
Towards Omni-supervised Referring Expression Segmentation. 1-6 - Youqian Zhang, Chunxi Yang, Eugene Yujun Fu, Qinhong Jiang, Chen Yan, Sze-Yiu Chau, Grace Ngai, Hong Va Leong, Xiapu Luo, Wenyuan Xu:
Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection. 1-6 - Baotong Su, Siyan Li, Wenguang Zheng, Yao Chen:
SFDE-net: A Spatial-Frequency Domain Feature Enhancement Network for Cloud Detection. 1-6 - Yuxuan Sun, Chenglu Zhu, Sunyi Zheng, Yunlong Zhang, Honglin Li, Lin Yang:
Context-Aware Text-Assisted Multimodal Framework for Cervical Cytology Cell Diagnosis and Chatting. 1-6 - Zhongzhan Huang, Senwei Liang, Mingfu Liang, Wei He, Haizhao Yang, Liang Lin:
Lottery Ticket Hypothesis for Attention Mechanism in Residual Convolutional Neural Network*. 1-6 - Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le Phuoc:
Cooperative Students: Navigating Unsupervised Domain Adaptation in Nighttime Object Detection. 1-6 - Nan Chen, Yonghe Wang, Xiangdong Su, Feilong Bao:
Efficient Speech-to-Text Translation: Progressive Pruning for Accelerated Speech Pre-trained Model. 1-6 - Lei Wang, Quan Zhang, Junyang Qiu, Jianhuang Lai:
Rotation Exploration Transformer for Aerial Person Re-identification. 1-6 - Xuan Dang, Guolong Wang, Xun Wu, Zheng Qin:
Improving Image Reconstruction and Synthesis by Balancing the Optimization from Frequency Perspective. 1-6 - Hengda Li, Yinglin Zheng, Qifeng Dai, Jintai Wang, Liang Song, Ming Zeng:
Multi-Modal Gait Recognition with Unidirectional Cross-modal Alignment. 1-6 - Yingxuan Li, Kiyoharu Aizawa, Yusuke Matsui:
Manga109Dialog: A Large-Scale Dialogue Dataset for Comics Speaker Detection. 1-6 - Huan Zhao, Yi Ju, Yingxue Gao:
Bilevel Relational Graph Representation Learning-based Multimodal Emotion Recognition in Conversation. 1-6 - Kangwei Liu, Xiaowei Yi, Xianfeng Zhao:
ProDub: Progressive Growing of Facial Dubbing Networks for Enhanced Lip Sync and Fidelity. 1-6 - Yulun Wu, Weixing Wei, Dichucheng Li, Mengbo Li, Yi Yu, Yongwei Gao, Wei Li:
Harmonic Frequency-Separable Transformer for Instrument-Agnostic Music Transcription. 1-6 - Xinxin Zhang, Xiankai Lu, Jizhou Li, Yongshun Gong, Qiangchang Wang, Yilong Yin:
Two-phase Parametric Registration for Retinal Images. 1-6 - Pengcheng Lei, Zaoming Yan, Tingting Wang, Faming Fang, Guixu Zhang:
Three-Stage Temporal Deformable Network for Blurry Video Frame Interpolation. 1-6 - Kaifen Cai, Kaiyu Song, Yan Pan, Hanjiang Lai:
MALIP: Improving Few-Shot Image Classification with Multimodal Fusion Enhancement. 1-6 - Zhenghao Ke, Sheng Liu, Chengyuan Ke, Yuan Feng, Shengyong Chen:
Cross-Modality Consistency Mining For Continuous Sign Language Recognition with Text-Domain Equivalents. 1-6 - Zhixuan Shen, Haonan Luo, Sijia Li, Tianrui Li:
Adversarial Training with OCR modality Perturbation for Scene-Text Visual Question Answering. 1-6 - Jinkang Ji, Junao Shen, Xinyu Wang, Tian Feng, Sensen Wu:
WirePAuS: Auxiliary-free Single-shot Wireframe Parsing. 1-6 - Hao Wu, Ruochong Li, Hao Wang, Hui Xiong:
COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval. 1-6 - Xiao Liu, Guan Yuan, Rui Bing, Zhuo Cai, Shengshen Fu, Yonghao Yu:
When Skeleton Meets Motion: Adaptive Multimodal Graph Representation Fusion for Action Recognition. 1-6 - Guodong Li, Letu Qingge, Qingyi Pan, Pei Yang:
Edge-Guided Mural Image Inpainting by Integrating Local and Global Information and Multiple Color Spaces. 1-6 - Jialing Zou, Jiahao Mei, Xudong Nan, Jinghua Li, Daoguo Dong, Liang He:
TEAdapter: Supply Vivid Guidance for Controllable Text-to-Music Generation. 1-6 - Yongheng Zhang, Yuanqiang Cai, Danfeng Yan:
Restoring Real-World Images Affected by Varied Degradations Using a Semi-Supervised Domain Adaptation Network. 1-6 - Yizhu Wen, Yiwei Wang, Kai Yi, Jing Ke, Yiqing Shen:
Diffimpute: Tabular Data Imputation with Denoising Diffusion Probabilistic Model. 1-6 - Dahe Peng, Rongrong Shen, Zhixin Li:
Robust VQA via Internal and External Interaction of Modal Information and Question Transformation. 1-6 - Zhangfeng Hu, Wenming Zheng, Yuan Zong, Mengting Wei, Xingxun Jiang, Mengxin Shi:
A Novel Decoupled Prototype Completion Network for Incomplete Multimodal Emotion Recognition. 1-6 - Ping Xu, Jiangqun Ni, Jian Zhang, Yulin Zhang, Shiyuan Tang:
Diff-IFL: Towards General Image Forgery Localization using Diffusion Probabilistic Model. 1-6 - Han Fang, Xianghao Zang, Chao Ban, Zerun Feng, Lanxiang Zhou, Zhongjiang He, Yongxiang Li, Hao Sun:
ProTA: Probabilistic Token Aggregation for Text-Video Retrieval. 1-6 - Mingchen Xu, Jing Wu, Yu-Kun Lai, Ze Ji:
Fusion of Short-term and Long-term Attention for Video Mirror Detection. 1-9 - Xiao Liang, Zijian Zhao, Weichao Zeng, Yutong He, Fupeng He, Yiyi Wang, Chengying Gao:
PianoBART: Symbolic Piano Music Generation and Understanding with Large-Scale Pre-Training. 1-6 - Ye-Wen Wang, Chen-Chen Zong, Ming-Kun Xie, Sheng-Jun Huang:
Dirichlet-Based Coarse-to-Fine Example Selection For Open-Set Annotation. 1-6 - Long Huang, Zhiwei Dong, Song-Lu Chen, Ruiyao Zhang, Shutong Ti, Feng Chen, Xu-Cheng Yin:
HQOD: Harmonious Quantization for Object Detection. 1-6 - Fan Tian, Peichi Zhou, Chen Li, Changbo Wang:
Shadow Constrained DEM Refinement Based on Differentiable Rendering. 1-6 - Cheng Shang, Jidong Tian, Jiannan Ye, Xubo Yang:
Free-view Rendering of Dynamic Human from Monocular Video Via Modeling Temporal Information Globally and Locally among Adjacent Frames. 1-6 - Simiao Lai, Dong Wang, Huchuan Lu:
DepthRefiner: Adapting RGB Trackers to RGBD Scenes via Depth-Fused Refinement. 1-6 - Shuo Zhang, Xiongpeng Hu, Jing Liu:
Causal Fusion of Convolutional Neural Network and Vision Transformer for Image Anomaly Detection and Localization. 1-6 - Yueming Zhu, Qing Xu, Kai Zhen, Runlin Zhang, Shunbo Wang:
Quantitative Analysis of Eye-Tracking Data Based on Information-Theoretic Tools for Measuring Driver Drowsiness. 1-6 - Chen He, Shenshen Li, Zheng Wang, Fumin Shen, Yang Yang, Xing Xu:
Diverse Embedding Modeling with Adaptive Noise Filter for Text-based Person Retrieval. 1-6 - Zhengyang Li, Shanshan Huang, Jiawei Liu, Laiming Jiang, Shen Chen, Yi Zhang, Jun Liao, Shu Wang, Li Liu:
Recognizing Cognitive Load by a Multi-instance Causal Learning Model from Multi-channel Physiological Data. 1-6 - Kun Hu, Zizhuo Wang, Zixuan Hu, Heng Gao, Xingjun Wang:
Stega-Matting: Irregular Matting Protection via Steganography. 1-6 - Zhenrong Cheng, Jiayan Guo, Hao Sun, Yan Zhang:
Boosting Disfluency Detection with Large Language Model as Disfluency Generator. 1-6 - Chen Liang, Zhiqian Dong, Sheng Yang, Peng Zhou:
Jointly Learn the Base Clustering and Ensemble for Deep Image Clustering. 1-6 - Jinyi Wang, Fei Ben, Huangjie Zheng, Jiangchao Yao, Ya Zhang, Yanfeng Wang:
MVTexGen: Synthesising 3D Textures Using Multi-View Diffusion. 1-6 - Xiyao Liu, Fengkai Dong, Xin Liao, Yuhan Guo, Jianbiao He, Jian Zhang, Gerald Schaefer, Hui Fang:
Multi-Strategy Adversarial Learning for Robust Face Forgery Detection Under Heterogeneous and Composite Attacks. 1-6 - Shaoyao Huang, Luozheng Qin, Ziqiang Cao, Qian Qiao:
STRA: A Simple Token Replacement Strategy Alleviating Exposure Bias in Text Generation. 1-6 - Pan Mu, Binjia Zhou, Qirui Wang, Zhiying Du, Xiaoyan Wang:
BFMEF: Brightness-Free Multi-exposure Image Fusion via Adaptive Correction. 1-6 - Hanglin Li, Peng Yin, Xiaosu Zhu, Lianli Gao, Jingkuan Song:
BFD: Binarized Frequency-enhanced Distillation for Vision Transformer. 1-6 - Jianshe Duan, Yachao Zhang, Yanyun Qu:
Source-Free Domain Adaptation for Point Cloud Semantic Segmentation. 1-6 - Quoc-Huy Trinh, Minh-Van Nguyen, Phuoc-Thao Vo Thi:
KDAS: Knowledge Distillation via Attention Supervision Framework for Polyp Segmentation. 1-6 - Yunfei Yang, Xiaojun Chen, Yuexin Xuan, Zhendong Zhao:
DualCOS: Query-Efficient Data-Free Model Stealing with Dual Clone Networks and Optimal Samples. 1-6 - Junkai Li, Huicheng Lai, Jun Ma, Tongguan Wang, Hutuo Quan, Dongji Chen:
Efficient Guided Query Network for Human-Object Interaction Detection. 1-6 - Dongyang Gao, Chen Chen, Yichao Zhou, Haotian Zhang, Xiyuan Hu:
TS-SAM: Two Small Steps for SAM, One Giant Leap for Abnormal detections. 1-6 - Zhenping Li, Si Wu, Xindian Wei, Qianfen Jiao, Cheng Liu, Rui Li:
Reference-conditional Makeup-aware Discrimination for Face Image Beautification. 1-6 - Zhenyu Yu, Pei Wang:
CaPAN: Class-aware Prototypical Adversarial Networks for Unsupervised Domain Adaptation. 1-6 - Yiran Liu, Zhanjie Wu, Mengjingcheng Mo, Ji Gan, Jiaxu Leng, Xinbo Gao:
Dual Space Embedding Learning For Weakly Supervised Audio-Visual Violence Detection. 1-6 - Yifei Pu, Chi Wang, Xiaofeng Hou, Cheng Xu, Jiacheng Liu, Jing Wang, Minyi Guo, Chao Li:
M2SN: Adaptive and Dynamic Multi-modal Shortcut Network Architecture for Latency-Aware Applications. 1-6 - Yuanwu Xu, Mohan Chen, Yuejie Zhang, Rui Feng, Tao Zhang, Shang Gao:
Memory-Augmented Transformer for Efficient End-to-End Video Grounding. 1-6 - Yan Feng, Tian Jiang, Yunqi Liu, Zijian Huang, Xiaohui Cui:
Multimodal Semantic Fusion for Zero-Shot Learning. 1-6 - Xin Zhang, Teodor Boyadzhiev, Jinglei Shi, Jufeng Yang:
ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation. 1-6 - Haifan Gong, Wenhao Huang, Huan Zhang, Yu Wang, Xiang Wan, Hong Shen, Guanbin Li, Haofeng Li:
Intensity Confusion Matters: An Intensity-Distance Guided Loss For Bronchus Segmentation. 1-6 - Soumyya Kanti Datta, Shan Jia, Siwei Lyu:
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies. 1-6 - Ila Gokarn, Yigong Hu, Tarek F. Abdelzaher, Archan Misra:
JIGSAW: Edge-based Streaming Perception over Spatially Overlapped Multi-Camera Deployments. 1-6 - Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. 1-6 - Dongyue Li, Songlin Du:
Beyond Global Cues: Unveiling the Power of Fine Details in Image Matching. 1-6 - Yin Tang, Guang Yang, Xili Wan:
SDViT: Towards Efficient Visual Foundation Model via Unifying Sparse and Dense Representation Learning. 1-6 - Jinhe Long, Zekai Chen, Fuyi Wang, Jianping Cai, Ximeng Liu:
FedCL: Detecting Backdoor Attacks in Federated Learning with Confidence Levels. 1-6 - Bowen Qu, Haohui Li, Wei Gao:
Bringing Textual Prompt to AI-Generated Image Quality Assessment. 1-6 - Zhaofei Wang, Weijia Zhang, Min-Ling Zhang:
Proposal Feature Learning Using Proposal Relations for Weakly Supervised Object Detection. 1-6 - Fengshuo Zhang:
Multi-Hop Distillation for Efficient Cross-Layer Knowledge Transfer. 1-7 - Shengyang Sun, Xiaojin Gong:
Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection. 1-6 - Qianyun Gong, Kunheng Jiang, Jingjing Wen, Xinjing Yuan, Jianxin Shi, Lingjun Pu:
TailClip: Mitigating Tail Latency in Cloud Gaming via Smart Video Frame Generation. 1-6 - Mufan Liu, Le Yang, Yiling Xu, Ye-Kui Wang, Jenq-Neng Hwang:
EVAN: Evolutional Video Streaming Adaptation via Neural Representation. 1-6 - Duo Liu, Linglan Zhao, Zhongqiang Zhang, Fuhan Cai, Xiangzhong Fang:
Distillation Excluding Positives for Few-Shot Class-Incremental Learning. 1-6 - Tienyi Hsieh, Qijun Zhao, Fan Pan, Pubu Danzeng, Dingguo Gao, Dorji Gesang:
Text and Edge Guided Thangka Image Inpainting with Diffusion Model. 1-10 - Xinyu Liu, Yong Yi, Ye Luo:
A Cascade Multimodal Fine-Grained MRI Image Grading Network For Preoperative Microvascular Invasion In Hepatocellular Carcinoma. 1-6 - Enqi Liu, Liyuan Pan:
A Lightweight Multi-Level Relation Network for Few-shot Action Recognition. 1-6 - Jiangbin Zheng, Stan Z. Li:
Progressive Multi-Modality Learning for Inverse Protein Folding. 1-6 - Zheng Cui, Yongli Hu, Jiapu Wang, Junbin Gao, Yanfeng Sun, Baocai Yin:
Common-Memory Bridged Cross-Modal Adaptive Graph Embedding for Image-Text Retrieval. 1-6 - Hengyu Zhang, Hang Lv, Yanchao Tan, Guofang Ma, Fan Wang, Carl Yang:
ExpertODE: Continuous Diagnosis Prediction with Expert Enhanced Neural Ordinary Differential Equations. 1-6 - Zhuangzi Li, Shan Liu, Ge Li:
PointELM: Fast Point Cloud Classification Using Deep Random Mapping Based Extreme Learning Machines. 1-6 - Lan Yan, Kenli Li:
Unknown Instance Learning for Person Search. 1-6 - Zhijian Wu, Dingjiang Huang:
Ultralight-weight Binary Neural Network with 1K Parameters for Image Super-Resolution. 1-6 - Diwen Wan, Jiaxiang Tang, Jingbo Wang, Xiaokang Chen, Lingyun Gan, Gang Zeng:
Open-set Hierarchical Semantic Segmentation for 3D Scene. 1-6 - Ziyi Huang, Binbin Yan, Shuo Chen, Dongliang Wang, Lu Yang:
Focal Stack Alignment Enhancement Network For Light Field Salient Object Detection. 1-6 - Zihan Ma, Huan Liu, Zhi Zeng, Hao Guo, Xiang Zhao, Minnan Luo:
Learning Multimodal Attention Mixed with Frequency Domain Information as Detector for Fake News Detection. 1-6 - Ryandhimas E. Zezario, Yu-Wen Chen, Szu-Wei Fu, Yu Tsao, Hsin-Min Wang, Chiou-Shann Fuh:
A Study On Incorporating Whisper For Robust Speech Assessment. 1-6 - Xuan Long, Meiqin Liu, Qi Tang, Chao Yao, Jian Jin, Yao Zhao:
Noisy-Residual Continuous Diffusion Models for Real Image Denoising. 1-6 - Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai:
Q-Refine: A Perceptual Quality Refiner for AI-Generated Image. 1-6 - Yuxuan Jiang, Guobin Zhu, Yi Ding, Zhen Qin, Minghui Pang:
Gradient Saliency-aware CutMix for Semi-Supervised Medical Image Segmentation. 1-6 - Thanh Hai Phung, Hung-Jen Chen, Hong-Han Shuai:
Hierarchically Aggregated Identification Transformer Network for Camouflaged Object Detection. 1-6 - Peiqi Xia, Yao Lu, Sijia Zhang, Shunzhou Wang, Ziqi Wang, Wang Xia:
Revisiting Large Kernel Convolution for Light Field Image Angular Super-Resolution. 1-6 - Lifeng Zhou, Yuke Li:
Coarse-to-fine Alignment Makes Better Speech-image Retrieval. 1-6 - Yuan Yao, Yuanhan Zhang, Zhenfei Yin, Jiebo Luo, Wanli Ouyang, Xiaoshui Huang:
3D Point Cloud Pre-Training with Knowledge Distilled from 2D Images. 1-6 - Da Ai, Kai Jia, Yunqiao Wang, Ying Liu:
NIR-VIS Image Translation for the Cross-Spectral and Cross-Distance Face Recognition. 1-6 - Ruixue Qi, Chen Pang, Mengyang Zhang, Lei Lyu:
EGLA-Net: Edge Guided with Lesion Aware Network for Medical image segmentation. 1-6 - Li Jin, Xibin Song, Jia Li, Changhe Tu, Xueying Qin:
CSS-Net: Domain Generalization in Category-level Pose Estimation via Corresponding Structural Superpoints. 1-6 - Ke Ning, Rongrong Shen, Zhixin Li:
Robust Knowledge Distillation and Self-Contrast Reasoning for Debiased Visual Question Answering. 1-6 - Lirong Xue, Kang-Yang Huang, Rong Chao, Jhih-Ciang Wu, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng:
Learning Efficient Interaction Anchor for HOI Detection. 1-6 - Chenyang Li, Xing Wei, Huazheng Zhao:
MultiQ: Multi-model Joint Learning via Synthetic Data for Data-Free Quantization. 1-6 - Jiaye Zhang, Zili Meng, Mingwei Xu:
Beimin: Serverless-based Adaptive Real-Time Video Processing. 1-6 - Yu Lu, Yizhou Jin, Yuyu Chen, Gang Zhou, Zhenghui Hu, Qingjie Liu, Di Huang, Yunhong Wang:
Fast Textile Pilling Classification Based on a Lightweight Network and 3D Point Clouds. 1-6 - Haoyu Wang, Zilong Yin, Hangling Sun, Xin Guo:
Enhancing Vital Sign Monitoring with Reinforcement Learning and Wavelet Analysis in Sleep Disorders. 1-6 - Xiufeng Liu, Zhongqiu Zhao, Chen Ding:
Style-ACAE: Adversarial Capsule Autoencoder with Styles. 1-6 - Weitian Zhang, Sijing Wu, Yichao Yan, Ben Xue, Wenhan Zhu, Xiaokang Yang:
HQ-Avatar: Towards High-Quality 3D Avatar Generation via Point-based Representation. 1-6 - Lihong Qiao, Rui Wang, Yucheng Shu, Ximing Xu, Baobin Li, Weisheng Li, Xinbo Gao:
Re3adapter: Efficient Parameter Fing-Tuning with Triple Reparameterization for Adapter without Inference Latency. 1-6 - Pengxiang Ouyang, Jianan Chen, Qing Ma, Zheng Wang, Cong Bai:
Distinguishing Visually Similar Images: Triplet Contrastive Learning Framework for Image-text Retrieval. 1-6 - Yifan Zhang, Meiqin Liu, Chenming Xu, Qi Tang, Chao Yao, Yao Zhao:
TLVC: Temporal Bit-rate Allocation for Learned Video Compression. 1-6 - Yang Xu, Yifan Feng, Yu Jiang:
Structure-aware Residual-center Representation for Self-Supervised Open-set 3D Cross-modal Retrieval. 1-6 - Jiachen Luo, Huy Phan, Lin Wang, Joshua D. Reiss:
Enhanced Speech Emotion Recognition Incorporating Speaker-Sensitive Interactions in Conversations. 1-6 - Songpei Xu, Xuri Ge, Chaitanya Kaul, Roderick Murray-Smith:
HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems. 1-6 - Xin Chen, Bin Wang, Yongsheng Gao:
Multiscale Binary-Pattern Dependency: A Novel Co-Occurrence Texture Descriptor for Fine-Grained Leaf Image Retrieval. 1-6 - Will Kerr, Crescent Jicol, Tom S. F. Haines, Wenbin Li:
Camera Chameleon - The Creative Impact of Tracked Tangible Interfaces for Virtual Film Pre-Production. 1-6 - Yixiao Li, Xiaoyuan Yang, Jun Fu, Guanghui Yue, Wei Zhou:
Deep Bi-directional Attention Network for Image Super-Resolution Quality Assessment. 1-6 - Hongzhang Mu, Shuili Zhang, Quangang Li, Tingwen Liu, Hongbo Xu:
Dynamic Multi-Modal Representation Learning For Topic Modeling. 1-6 - Zihao He, Shengchuan Zhang:
ESR-DDLN : Enhanced Single Image Super-Resolution Via Dual-Domain Learning Network. 1-6 - Junyuan Guo, Teng Wang, Chao Wang:
Mixed 3D Gaussian for Dynamic Scenes Representation and Rendering. 1-6 - Jiacheng Wang, Ping Liu, Wei Xu:
Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance. 1-6 - Davide Berghi, Craig Cieciura, Farshad Einabadi, Maxine Glancy, Oliver C. Camilleri, Philip Foster, Asmar Nadeem, Faegheh Sardari, Jinzheng Zhao, Marco Volino, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton:
ForecasterFlexOBM: A Multi-View Audio-Visual Dataset for Flexible Object-Based Media Production. 1-6 - Yingying Zhu, Dafeng Li, Zhihang Liu, Hong Zhou:
ClipComb: Global-Local Composition Network based on CLIP for Composed Image Retrieval. 1-6 - Jiaxin Qiu, Guoyu Yang, Jie Lei, Zunlei Feng, Ronghua Liang:
Visual-guided Query with Temporal Interaction for Video Object Segementation. 1-6 - Yiheng Duan, Yunjie Ge, Zixuan Wang, Jiayi Yu, Shenyi Zhang, Libing Wu:
Enhancing the Transferability of Adversarial Examples with Noise Injection Augmentation. 1-6 - Yang Yao, Xin Wang, Yijian Qin, Ziwei Zhang, Wenwu Zhu, Hong Mei:
Customized Cross-device Neural Architecture Search with Images. 1-6 - Yaori Zhang, Shujin Lin, Fan Zhou, Ruomei Wang:
Hierarchical Attention Feature Fusion and Refinement Network for Point Cloud Upsampling. 1-8 - Chen Cai, Runzhong Zhang, Jianjun Gao, Kejun Wu, Kim-Hui Yap, Yi Wang:
Temporal Sentence Grounding with Temporally Global Textual Knowledge. 1-6 - Jiayu Li, Xuechao Zou, Shiying Wang, Ben Chen, Junliang Xing, Pin Tao:
A Parallel Attention Network For Cattle Face Recognition. 1-6 - Minghao Han, Xukun Zhang, Dingkang Yang, Tao Liu, Haopeng Kuang, Jinghui Feng, Lihua Zhang:
Multi-Scale Heterogeneity-Aware Hypergraph Representation for Histopathology Whole Slide Images. 1-6 - Yabin Zhang, Xu Chen:
Enhancing Sequential Recommendation Modeling Via Adversarial Training. 1-6 - Yuteng Wang, Xing Wu, Zhongshi He, Peng Wang, Haidong Wang, Hongqian Wang:
US-SAM: An Automatic Prompt Sam For Ultrasound Image. 1-6 - Jiabo Ye, Junfeng Tian, Xiaoshan Yang, Zhenru Zhang, Anwen Hu, Ming Yan, Ji Zhang, Liang He, Xin Lin:
VG-Annotator: Vision-Language Models as Query Annotators for Unsupervised Visual Grounding. 1-6 - Kailai Feng, Minheng Ni, Jiaxiu Jiang, Zhilu Zhang, Wangmeng Zuo:
Multi-Attentional Distance for Zero-Shot Classification with Text-to-Image Diffusion Model. 1-6 - Kaiyue Tian, Chen Chen, Yichao Zhou, Xiyuan Hu:
Illumination Enlightened Spatial-temporal Inconsistency for Deepfake Video Detection. 1-6 - Hantao Zhou, Runze Hu, Xiu Li:
Video Object Segmentation with Dynamic Query Modulation. 1-6 - Ziyu Gong, Chengcheng Mai, Yihua Huang:
AsCL: An Asymmetry-sensitive Contrastive Learning Method for Image-Text Retrieval with Cross-Modal Fusion. 1-6 - Zixuan Hu, Kun Hu, Zizhuo Wang, Ranran Pan, Xingjun Wang:
OWR: Optimizing Watermark Robustness for Screen Recording. 1-6 - Zhiyuan Zhu, Zhiyuan Ning, Hui Cui, Junao Shen, Jiaheng Wang, Xinyu Wang, Tian Feng:
MuMoSNet: 3D MRI-based Brain Tumor Segmentation via Multi-modal and Multi-scale Feature Fusion. 1-6 - Kaiyu Jin, Chenwang Wu, Defu Lian:
Out-of-Distribution Generalization via Style and Spuriousness Eliminating. 1-6 - Mingdong Yu, Xiaofeng Jin, Guirong Wang, Bo Wang, Jiaqi Chen:
SPformer: Hybrid Sequential-Parallel Architectures for Automatic Speech Recognition. 1-5 - Jiahao Nie, Shan Lin, Alex C. Kot:
Color Space Learning for Cross-Color Person Re-Identification. 1-6 - Xuan Hai, Xin Liu, Zhaorun Chen, Yuan Tan, Song Li, Weina Niu, Gang Liu, Rui Zhou, Qingguo Zhou:
Ghost-in-Wave: How Speaker-Irrelative Features Interfere DeepFake Voice Detectors. 1-6 - Changjuan Ran, Yeting Guo, Fang Liu, Shenglan Cui, Yunfan Ye:
FedStyle: Style-Based Federated Learning Crowdsourcing Framework for Art Commissions. 1-6 - Yanchao Tan, Zhenghong Lin, Sujie Pan, Siying Xu, Weiming Liu, Guofang Ma, Shiping Wang:
Heterogeneous Hypergraph Structure Learning for Multimedia Recommendation. 1-6 - Liqing Zhu, Xun Jiang, Fumin Shen, Guoqing Wang, Yang Yang, Xing Xu:
Temporal Self-Paced Proposal Learning for Weakly-Supervised Video Moment Retrieval and Highlight Detection. 1-6 - Wei Han, Zhili Qin, Junming Shao:
Interpretable Function Embedding and Module in Convolutional Neural Networks. 1-6 - Zhicheng Cai, Qiu Shen:
Encoding Semantic Priors into the Weights of Implicit Neural Representation. 1-6 - Yiru Wang, Qianqian Li, Xinyue Wang, Qiao Yang, Shunli Zhang:
Unveiling the Significance of Width Dimension in Bird's-Eye View Segmentation. 1-6 - R. Gnana Praveen, Jahangir Alam:
Cross-Attention is not always needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition. 1-6 - Songshi Dou, Xianhao Chen, Kwan L. Yeung:
Enabling Practical and Pervasive Content Delivery from Emerging LEO Mega-Constellations. 1-6 - Bingxin Li, Ying Li, Shihui Ying:
Cross-Evaluation and Re-weighting for Multi-Source-Free Domain Adaptation. 1-6 - Yujiao Jiang, Qingmin Liao, Zhaolong Wang, Xiangru Lin, Zongqing Lu, Yuxi Zhao, Hanqing Wei, Jingrui Ye, Yu Zhang, Zhijing Shao:
SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations. 1-6 - Zipeng Guo, Yuchen Zhou, Chao Gou:
DrivingGen: Efficient Safety-Critical Driving Video Generation with Latent Diffusion Models. 1-6 - Md Adnan Faisal Hossain, Zhihao Duan, Fengqing Zhu:
Flexible Mixed Precision Quantization for Learne Image Compression. 1-8 - Chao Long, Mengning Yang, Kai Li, Zhifu Deng, Kunyuan Jian, Simin Wang:
LPTCGAN: Laplace Pyramid three-layer cyclic high definition image enhancement network. 1-6 - Yiqun Wang, Zhao Zhou, Xiangcheng Du, Xingjiao Wu, Yingbin Zheng, Cheng Jin:
Fine-Grained Scene Image Classification with Modality-Agnostic Adapter. 1-6 - Xinxin Jiao, Liejun Wang, Yinfeng Yu:
MFHCA: Enhancing Speech Emotion Recognition Via Multi-Spatial Fusion and Hierarchical Cooperative Attention. 1-5 - Yunqi Zhao, Yuchen Guo, Zheng Cao, Kai Ni, Ruqi Huang, Lu Fang:
DynamicTrack: Advancing Gigapixel Tracking in Crowded Scenes. 1-6 - Xiaoyuan Guan, Zhiyong Gan, Ling Deng, Wei Shi, Jiankang Chen, Shenshen Bu, Chunliang Zhao, Jianfang Hu, Yuren Zhou, Wei-Shi Zheng, Ruixuan Wang:
Out-of-Distribution Detection by Principal Component Correspondence. 1-6 - Tianjiao Du, Jun Chen, Jiasheng Lu, Qinmei Xu, Huan Liao, Yupeng Chen, Zhiyong Wu:
Controllable Text-to-Audio Generation with Training-Free Temporal Guidance Diffusion. 1-6 - Zichuan Liu, Ke Wang, Mingyuan Wu, Lantao Yu, Klara Nahrstedt, Xin Lu:
I-Matting: Improved Trimap-Free Image Matting. 1-6 - Chenyang Bu, Yunpeng Hong, Shiji Zang, Guojie Chang, Xindong Wu:
Automatic Fusion for Multimodal Entity Alignment: A New Perspective from Automatic Architecture Search. 1-6 - Fei Wang, Jianqiang Sheng, Kai Jiang, Zhineng Zhang, Juepeng Zheng, Baoquan Zhao:
Single Free-Hand Sketch Guided Free-Form Deformation For 3D Shape Generation. 1-6 - Ugochukwu Ejike Akpudo, Yongsheng Gao, Jun Zhou, Andrew Lewis:
Coherentice: Invertible Concept-Based Explainability Framework for CNNs beyond Fidelity. 1-6 - Haoyu Huang, Linxuan He, Faqiang Liu, Rong Zhao, Luping Shi:
Neural Dynamics Pruning for Energy-Efficient Spiking Neural Networks. 1-6 - Jianjun Sun, Yan Zhao, Xinbo Li, Shigang Wang, Jian Wei, Shibo Wang:
Fractional Order Spectrum in SAR Image Registration. 1-6 - Yiwen Tu, Wen Tan, Youneng Bao, Genhong Wang, Fanyang Meng, Yongsheng Liang:
Enhanced Interpretability in Learned Image Compression via Convolutional Sparse Coding. 1-6 - ZhiMin Weng, Jinpu Zhang, Yuehuan Wang:
Joint Language Prompt and Object Tracking. 1-6 - Xingzhe Su, Daixi Jia, Fengge Wu, Junsuo Zhao, Changwen Zheng, Wenwen Qiang:
Unbiased Image Synthesis via Manifold Guidance in Diffusion Models. 1-6 - Wen-Li Wei, Jen-Chun Lin:
Multi-Candidate Motion Modeling for 3D Human Pose and Shape Estimation from Monocular Video. 1-6 - Clement Bled, François Pitié:
Lightweight Video Denoising Using a Classic Bayesian Backbone. 1-6 - Sumei Li, Xiaofei He, Hangwei Liang:
Top-Down Guidance Based ViT-CNN Network Considering Theme Information for Image Aesthetic Assessment. 1-6 - Xudong Zhou, Tianxiang Chen:
FREQFORMER: Efficient Polyp Segmentation via Wavelet Transform. 1-6 - Yi Fan, Yu-Bin Yang:
Training-free Neural Architecture Search on Hybrid Convolution-attention Networks. 1-6 - Bowen Zhao, Licheng Zhang, Lei Zhang, Zhendong Mao:
Neighborhood-Adaptive Context Enhancement Learning For Scene Graph Generation. 1-6 - Shilv Cai, Xiaoguo Liang, Shuning Cao, Luxin Yan, Sheng Zhong, Liqun Chen, Xu Zou:
Powerful Lossy Compression for Noisy Images. 1-6 - Pochun Chen, Nan Zhang, Guoqing Liu, Ge Li:
MFITrack: Multi-Frame Integration Strategy for Enhanced Motion-Centric Single Object Tracking. 1-6 - Yijia Guo, Yuanxi Bai, Liwen Hu, Mianzhi Liu, Ziyi Guo, Lei Ma, Tiejun Huang:
Spike-NeRF: Neural Radiance Field Based On Spike Camera. 1-6 - Hongzhao Li, Hongyu Wang, Xia Sun, Hua He, Jun Feng:
Prompt-Guided Generation of Structured Chest X-Ray Report Using a Pre-trained LLM. 1-6 - Jicheng Yang, Qing Zhang, Yilin Zhao, Yuetong Li, Zeming Liu:
Bi-directional Boundary-object interaction and refinement network for Camouflaged Object Detection. 1-6 - Shenghao Chen, Zhe Liu, Jun Chen, Yuqing Song, Yi Liu, Qiaoying Teng:
Tutor Assisted Feature Distillation. 1-6 - Fengqiang Wan, Xiangyu Wu, Zhihao Guan, Yang Yang:
CoVLR: Coordinating Cross-Modal Consistency and Intra-Modal Relations for Vision-Language Retrieval. 1-6 - Xiaolong Xiong, Jinhan Cui, Jiaxiong Liu, Shuzhan Guo, Jun Zhou:
Inverse Optimization for Multi-View Multiple Clustering. 1-6 - Qipeng Zhu, Jie Chen, Junping Zhang, Jian Pu:
G-MIMO: Empowering GNNs with Diverse Sub-Networks for Graph Classification. 1-6 - Fuyang Yu, Runze Tian, Zhen Wang, Xiaochuan Wang, Xiaohui Liang:
CUS3D: Clip-Based Unsupervised 3D Segmentation via Object-Level Denoise. 1-6 - Yaxiong Chen, Xueping Zhang, Yunfei Zi, Shengwu Xiong:
Adaptive Learning via a Negative Selection Strategy for Few-Shot Bioacoustic Event Detection. 1-6 - Jiefeng Lin, Chenlin Fu, Qiang Huang, Yingying Zhu:
Contextual Interaction Enhancement Network for Smoke Detection. 1-6 - Jie Zhang, Hao Xiong, Hecang Zang, Meng Zhou, Dong Liu, Zhonghua Liu, Hualei Shen:
AuxSegCount: Auxiliary Seg-Attention Based Network for Wheat Ears Counting in Field Conditions. 1-6 - Liang Wen, Lizhong Wang, Yuxing Zheng, Weijing Shi, Kwang Pyo Choi:
FT-CSR: Cascaded Frequency-Time Method for Coded Speech Restoration. 1-6 - Chuang Ding, Yang Wu, Huihui Song, Kaihua Zhang, Xu Zhang, Zhenhua Guo:
Language-Guided Semantic Alignment for Co-saliency Detection. 1-6 - Yewei Gu, Xianfeng Zhao, Xiaowei Yi:
RLVC: Robust and Lightweight Voice Conversion Using Cross-Adaptive Instance Normalization. 1-6 - Chenhao Shuai, Rizhao Cai, Bandara Dissanayake, Amanda Newman, Dayan Guan, Dennis Sng, Ling Li, Alex C. Kot:
Controllable and Gradual Facial Blemishes Retouching Via Physics-Based Modelling. 1-6 - Weimin Wang, Yingxu Deng, Zezeng Li, Yu Liu, Na Lei:
MergeNet: Explicit Mesh Reconstruction from Sparse Point Clouds via Edge Prediction. 1-6 - Wei Wang, Zhi Jin:
CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement. 1-6 - Kanghui Wu, Dongyan Guo:
Semantic Bridging and Feature Anchoring for Class Incremental Learning. 1-6 - Haixu Song, Fangfu Liu, Chenyu Zhang, Yueqi Duan:
ToW3D: Consistency-aware Interactive Point-based Mesh Editing on GANs. 1-6 - Sheng Chen, Fei Yang, Aimin Pan, Zhewei Mei:
Wi-Fi based Gait Recognition using Spectrogram and Phase. 1-6 - Jihao Dong, Hua Yang, Renjie Pan:
Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model. 1-6 - Suwei Zhang, Tai Ma, Ying Wen:
RC-Block: Refinement Coefficient for Rectifying Deformation Field. 1-6 - Haixiang Zhu, Jing Ye, Jianbing Tang, Yiping Song:
DiffuStra: A Diffusion Model for Dialog Strategy in Non-Collaborative Dialog Systems. 1-6 - Nesryne Mejri, Pavel Chernakov, Polina Kuleshova, Enjie Ghorbel, Djamila Aouada:
Facial Region-Based Ensembling for Unsupervised Temporal Deepfake Localization. 1-6 - Rukai Wei, Heng Cui, Yu Liu, Yanzhao Xie, Yufeng Hou, Ke Zhou:
Contrastive masked auto-encoders based self-supervised hashing for 2D image and 3D point cloud cross-modal retrieval. 1-6 - Jianing Han, Jiangrong Shen, Qi Xu, Jian Liu, Huajin Tang:
The Balanced Multi-Modal Spiking Neural Networks with Online Loss Adjustment and Time Alignment. 1-6 - Yuwen Yang, Yuxiang Lu, Suizhi Huang, Shalayiding Sirejiding, Chang Liu, Muyang Yi, Zhaozhi Xie, Yue Ding, Hongtao Lu:
BARTENDER: A simple baseline model for task-level heterogeneous federated learning. 1-6 - Huan Li, Xinpeng Huang, Ping An:
Low Bitrate Light Field Video Compression with Two-step Refinement Reconstruction. 1-6 - Fanxiao Li, Ping Wei, Tingchao Fu, Yu Lin, Wei Zhou:
Imperceptible Text Steganography based on Group Chat. 1-6 - Xiaoke Zhu, Danyang Li, Xiaopan Chen, Fumin Qi, Fan Zhang, Xiao-Yuan Jing:
Similarity Mining via Implicit Matching Pattern Learning for Kinship Verification. 1-6 - Xiao Liang, Siyuan Duan, Lijie Zheng, Yuqian Zeng:
Unsupervised Monte Carlo Denoising via Learning Contrastive Disentanglement Representation. 1-6 - Ruihang Li, Shanding Ye, Zhe Yin, Tao Li, Zehua Zhang, KaiKai Xiao, Zhijie Pan:
M2Depth: A Novel Self-Supervised Multi-Camera Depth Estimation with Multi-Level Supervision. 1-6 - Daowan Peng, Wei Wei:
Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation. 1-6 - Jiaxu Leng, Zhanjie Wu, Mengjingcheng Mo, Mingpi Tan, Shuang Li, Xinbo Gao:
Modality-Free Violence Detection via Cross-Modal Causal Attention and Feature Distillation. 1-6 - Laiming Jiang, Jiawei Liu, Shu Wang, Jun Liao, Qingsong Li, Zhengyang Li, Shen Chen, Li Liu:
Multi-channel Spatio-Temporal Causal Representation Model for Cognitive Load Assessment in Physiological Signals. 1-6 - Bin Kang, Bin Chen, Junjie Wang, Weizhi Xian, Huifeng Chang:
Multi-Attribute Consistency Driven Visual Language Framework for Surface Defect Detection. 1-5 - Rui Deng, Yuke Li:
DeCMG: Denoise with Cross-modality Guidance Makes Better Text-Video Retrieval. 1-6 - Qianyu Li, Xiaoli Tang, Siyao Zhou, Han Yu, Hengjie Song, Lizhen Cui, Xiaoxiao Li:
FedRMS: Privacy-Preserving Federated Knowledge Graph Embedding Through Randomization. 1-6 - Yuan Gao, Zilei Wang, Yixin Zhang:
Delve into Source and Target Collaboration in Semi-supervised Domain Adaptation for Semantic Segmentation. 1-6 - Kaiyue Zhou, Ming Dong, Peiyuan Zhi, Shengjin Wang:
Cascaded Network with Hierarchical Self-Distillation for Sparse Point Cloud Classification. 1-6 - Keli Wen, Nan Zhang, Ge Li, Wei Gao:
MPVNN: Multi-resolution Point-Voxel Non-parametric Network for 3D Point Cloud Processing. 1-6 - Tianlong Zhang, Zhe Xue, Yuchen Dong, Junping Du, Meiyu Liang:
A Multi-View Double Alignment Hashing Network with Weighted Contrastive Learning. 1-6 - Li Keyao, Kai Liu, Min Peng, Bo Zhao, Li Jiangyuanhong, Jiahui Zhu:
MACFAN: A multi-channel fusion network for subjective aesthetic attributes with automated comments labeling pipeline. 1-6 - Chengji Wang, Zhiming Luo, Shaozi Li:
Omni-Granularity Embedding Network for Text-to-Image Person Retrieval. 1-6 - Zeyun Zhao, Rong Wang, Jianzhe Gao, Zhiming Luo, Shaozi Li:
Mask Matching Network for Self-supervised Few-shot Medical Image Segmentation. 1-6 - Chaoxiang He, Yimiao Zeng, Xiaojing Ma, Bin Benjamin Zhu, Zewei Li, Shixin Li, Hai Jin:
MysticMask: Adversarial Mask for Impersonation Attack Against Face Recognition Systems. 1-6 - Han Cao, Lingwei Wei, Wei Zhou, Songlin Hu:
Multi-source Knowledge Enhanced Graph Attention Networks for Multimodal Fact Verification. 1-6 - Tong Zhang, Wenxue Cui, Shaohui Liu, Feng Jiang:
SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer. 1-6 - Chuanming Tang, Kai Wang, Joost van de Weijer:
IterInv: Iterative Inversion for Pixel-Level T2I Models. 1-6 - Weijie Li, Luwei Xiao, Xingjiao Wu, Tianlong Ma, Jiabao Zhao, Liang He:
Artistry in Pixels: FVS - A Framework for Evaluating Visual Elegance and Sentiment Resonance in Generated Images. 1-6 - Guoxuan Mao, Ting Cao, Ziyang Li, Yuan Dong:
Enhancing Shape Perception and Segmentation Consistency for Industrial Image Inspection. 1-6 - Shu Wang, Zhe Qu, Yuan Liu, Shichao Kan, Yixiong Liang, Jianxin Wang:
FedMMR: Multi-Modal Federated Learning via Missing Modality Reconstruction. 1-6 - Xinyu Feng, Cong Li, Qingni Shen, Jisheng Dong, Wenjun Qian, Yuejian Fang, Zhonghai Wu:
HyPRE: Hybrid Proxy Re-Encryption for Secure Multimedia Data Sharing on Mobile Devices. 1-6 - Xiangru Lin, Shenghua Zhong, Yan Liu, Gong Chen:
Sal-Guide Diffusion: Saliency Maps Guide Emotional Image Generation through Adapter. 1-6 - Yuwu Lu, Chunzhi Liu:
Pseudolabel Distillation with Adversarial Contrastive Learning for Semisupervised Domain Adaptation. 1-6 - Zhenzhe Gao, Zhenjun Tang, Zhaoxia Yin, Baoyuan Wu, Yue Lu:
Fragile Model Watermark for integrity protection: leveraging boundary volatility and sensitive sample-pairing. 1-6 - Bingfei Fu, Xiangyang Xue:
Unsupervised Object Discovery Via Object-Centric Representation. 1-6 - Sisi You, Bing-Kun Bao:
Dynamic Scene Graph Generation with Unified Temporal Modeling. 1-6 - Shuang Cheng, Zhanyu Ma, Jian Ye:
A Benchmark of Zero-Shot Cross-Lingual Task-Oriented Dialogue Based on Adversarial Contrastive Representation Learning. 1-6 - Zhongzhu Yang, Liang Luo, Yu Gu, Fuji Ren:
K-Face Net: A Two-Stage Framework for Balanced Feature Space in Facial Expression Recognition. 1-6 - Jiaqi Guo, Sitong Su, Junchen Zhu, Lianli Gao, Jingkuan Song:
Training-Free Semantic Video Composition via Pre-trained Diffusion Model. 1-6 - Qiqin Lin, Weixing Xie, Rongzhou Zhou, Xianpeng Cao, Jingze Chen, Junfeng Yao, Qingqi Hong:
DPP-Net: Difficulty Perception-Processing Heterogeneous Network for Semi-supervised Medical Image Segmentation. 1-6 - Zheng Zhou, Zongxin Liu, Yongyong Chen, Bingzhi Chen, Biqing Zeng, Yicong Zhou:
Deep Unfolding 3D Non-Local Transformer Network for Hyperspectral Snapshot Compressive Imaging. 1-6 - Zhenhu Zhang, Li Jin, Dan Song, Jiahua Dong, Ruofeng Tong:
FedDGP: Disentangling Global and Personal Models for Federated Learning. 1-6 - Ling Li, Junliang Xing, Xinchun Yu, Xiao-Ping Zhang:
Deviation Wing Loss for High-Performance 2D Pose Estimation. 1-6 - Jintao Tan, Xize Cheng, Lingyu Xiong, Lei Zhu, Xiandong Li, Xianjia Wu, Kai Gong, Minglei Li, Yi Cai:
Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation. 1-6 - Ning Xu, Jingqiu Li, Lanjun Wang, Anan Liu:
Rumor Detection Framework Based on Multi-source Knowledge Adaptation. 1-6 - Mingzhe Yu, Lei Wu, Changshuo Wang, Lei Meng, Xiangxu Meng:
LayoutDM: Precision Multi-Scale Diffusion for Layout-to-Image. 1-6 - Yuxin Tian, Mouxing Yang, Yunfan Li, Dayiheng Liu, Xingzhang Ren, Xi Peng, Jiancheng Lv:
An Empirical Study of Parameter Efficient Fine-tuning on Vision-Language Pre-train Model. 1-6 - Yan Jiang, Guisheng Yin, Ye Yuan, Jingjing Chen, Zhipeng Wei:
Cross-Point Adversarial Attack Based on Feature Neighborhood Disruption Against Segment Anything Model. 1-6 - Zhenyu Li, Congju Du, Huijuan Zhao, Li Yu:
Offset-based Disentangled Representation for Efficient Human Pose Estimation. 1-6 - Zhihang Zhu, Yunfeng Yan, Yi Chen, Haoyuan Jin, Xuesong Nie, Donglian Qi, Xi Chen:
SAMP: Adapting Segment Anything Model for Pose Estimation. 1-7 - Zhongqiang Zhang, Fuhan Cai, Duo Liu, Ge Liu, Xiangzhong Fang:
Mix background and foreground separately: Transformer-based Augmentation Strategies for Domain Generalization. 1-6 - Yanchao Liang, Xiangqian Wu:
Do Keypoints Contain Crucial Information? Mining Keypoint Information to Enhance Cross-View Geo-Localization. 1-6 - Jiayang Gu, Xovee Xu, Yulu Tian, Yurun Hu, Jiadong Huang, Wenliang Zhong, Fan Zhou, Lianli Gao:
RRE: A Relevance Relation Extraction Framework for Cross-domain Recommender System at Alipay. 1-6 - Hongfei Xue, Qijie Shao, Kaixun Huang, Peikun Chen, Jie Liu, Lei Xie:
SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition. 1-6 - Zhenrong Huang, Bin Chen:
Unsupervised Multi-Modal Medical Image Registration via query-selected attention and decoupled Contrastive Learning. 1-6 - Shuai Yu, Xiaoliang He, Yanting Zhang:
RevNet: A Review Network with Group Aggregation Fusion for Singing Melody Extraction. 1-6 - Zhen Liang, Enyu Che, Guoqiang Xiao, Jingwei Qu:
Multi-granularity Correlation Refinement for Semantic Correspondence. 1-6 - Xiao Liang, Tao Shi, Yaoyuan Liang, Te Tao, Shao-Luo Huang:
Exploring Iterative Refinement with Diffusion Models for Video Grounding. 1-6 - Junqing Huang, Xiaochen Yuan, Chan-Tong Lam, Wei Ke:
MSFGNet: Multi-Scale Features Gathering Network for Change Detection of Remote Sensing Images. 1-6 - Gang Wu, Junjun Jiang, Kui Jiang, Xianming Liu:
Exploiting Self-Supervised Constraints in image Super-Resolution. 1-6 - Gang Liu, Jing Jia, Rui Mao, Yan Ji:
FedCA: Federated learning based on classification layer alignment. 1-6 - Hongyan Xu, Xiu Su, Arcot Sowmya, Ian Katz, Dadong Wang:
SCD-NAS: Towards Zero-Cost Training in Melanoma Diagnosis. 1-6 - Chuanfeng Yang, Kaiheng Li, Jiahui Chen, Qingqi Hong:
FFnsr: Fast and Fine Neural Surface Reconstruction. 1-6 - Yuxuan Chen, Chengbo Wang, Xiuying Wang:
CMSCL: Cross-Modal Spatial Contrastive Learning for 3D Medical Image Classification. 1-6 - Ya Jiang, Qing Wang, Jun Du, Maocheng Hu, Pengfei Hu, Zeyan Liu, Shi Cheng, Zhaoxu Nian, Yuxuan Dong, Mingqi Cai, Xin Fang, Chin-Hui Lee:
Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios. 1-6 - Chun Wang:
Data Standardization for Robust Lip Sync. 1-6 - Yi Fan, Yu-Bin Yang:
Training-free Neural Architectural Search on Transformer via Evaluating Expressivity and Trainability. 1-6 - Chuang Liu, Haogang Zhu, Xiu Su:
DomainVoyager: Embracing The Unknown Domain by Prompting for Automatic Augmentation. 1-7 - Dongming Zhou, Zhengbin Pang:
Heuristic Action-aware and Priority Communication for Multi-agent Path Finding. 1-6 - Songping Wang, Hanqing Liu, Haochen Zhao:
Public-Domain Locator for Boosting Attack Transferability on Videos. 1-6 - Tao He, Leqi Shen, Guiguang Ding, Zhiheng Zhou, Tianshi Xu, Xiaofeng Jin, Yuheng Huang:
Camera Bias Regularization for Person Re-identification. 1-6 - Chenyue Liang, Jiabei Zeng, Mingjie He, Dongmei Jiang, Shiguang Shan:
Facial Action Unit Detection with the Semantic Prompt. 1-6 - Tingyu Li, Junpeng Bao, Jiaqi Qin, Yuping Liang, Ruijiang Zhang, Jason Wang:
Multi-modal Intent Detection with LVAMoE: the Language-Visual-Audio Mixture of Experts. 1-6 - Ziqi Wang, Yao Lu, Shunzhou Wang, Wang Xia, Peiqi Xia, Wenjing Wang:
Trident Transformer for Light Field Image Super-Resolution. 1-6 - Jisheng Bai, Han Yin, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen, Susanto Rahardja:
Audiolog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning. 1-6 - Jiehang Xie, Xuanbai Chen, Shao-Ping Lu:
An Aesthetic-Guided Multimodal Framework for Video Summarization. 1-6 - Junyang Qiu, Zhanxiang Feng, Lei Wang, Jianhuang Lai:
Salient Part-Aligned and Keypoint Disentangling Transformer for Person Re-Identification in Aerial Imagery. 1-6 - Yuchen Li, Fan Wan, Yang Long:
SID-NERF: Few-Shot Nerf Based on Scene Information Distribution. 1-6
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.