default search action
27th ICPR 2024: Kolkata, India - Part XXI
- Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal:
Pattern Recognition - 27th International Conference, ICPR 2024, Kolkata, India, December 1-5, 2024, Proceedings, Part XXI. Lecture Notes in Computer Science 15321, Springer 2025, ISBN 978-3-031-78304-3 - Zujun Fu, Hanjiang Lai, Yan Pan:
Long-Tailed Hashing with Wasserstein Quantization. 1-14 - Shun Obikane, Haruna Tagawa, Yoshimitsu Aoki:
Unsupervised Metric Learning for Expressing Color and Shape Information to Uncover Abstract Connections within Image Datasets. 15-30 - Jimin Sohn, Haeji Jung, Zhiwen Yan, Vibha Masti, Xiang Li, Bhiksha Raj:
Fashion Image Retrieval with Occlusion. 31-46 - Jun Ding, Jiaoyan Wang, Alimjan Aysa, Xuebin Xu, Kurban Ubul:
Oracle Bone Inscription Image Retrieval Based on Improved ResNet Network. 47-62 - Shuming Zhang, Xiaojun Wu, Tianyang Xu, Donglin Zhang:
Novel Clustering Aggregation and Multi-grained Alignment for Image-Text Matching. 63-79 - Debojyoti Misra, Suryansh Goel, Tushar Sandhan:
Ensembling YOLO and ViT for Plant Disease Detection. 80-94 - Chandrakanth Gudavalli, Erik Rosten, Lakshmanan Nataraj, Shivkumar Chandrasekaran, B. S. Manjunath:
CIMGEN: Controlled Satellite Image Manipulation by Finetuning Pretrained Generative Models on Limited Data. 95-110 - Silvia Cappelletti, Lorenzo Baraldi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara:
Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation. 111-126 - Md Shamim Seraj, Shayok Chakraborty:
Multi-source Deep Domain Adaptation for Deepfake Detection. 127-142 - Beibei Dong, Bo Peng, Jing Dong:
SPI2I: Structure-Preserved Image-to-Image Translation with Diffusion Models. 143-159 - Mamadou Keita, Wassim Hamidouche, Hessen Bougueffa Eutamene, Abdelmalik Taleb-Ahmed, Abdenour Hadid:
FIDAVL: Fake Image Detection and Attribution Using Vision-Language Model. 160-176 - Orazio Pontorno, Luca Guarnera, Sebastiano Battiato:
DeepFeatureX Net: Deep Features eXtractors Based Network for Discriminating Synthetic from Real Images. 177-193 - Tai-Ming Huang, Yue-Hua Han, Ernie Chu, Shu-Tzu Lo, Kai-Lung Hua, Jun-Cheng Chen:
Generalized Image-Based Deepfake Detection Through Foundation Model Adaptation. 194-210 - Taiba Majid Wani, Irene Amerini:
Audio Deepfake Detection: A Continual Approach with Feature Distillation and Dynamic Class Rebalancing. 211-227 - Rajjeshwar Ganguly, Mamadou Dian Bah, Mohamed Dahmane:
Diffusion Models as a Representation Learner for Deepfake Image Detection. 228-241 - Mohammad Ahangar Kiasari, Khan Muhammad, Sambit Bakshi, Ik Hyun Lee:
Hybrid Transformer-CNN-Based Attention in Video Turbulence Mitigation (HATM). 242-256 - Ao Wei, Hanbin Zhang, Erhu Zhao:
DereflectFormer: Vision Transformers for Single Image Reflection Removal. 257-274 - Qijun Shi, Hongjian Zhan, Yangfu Li, Weijun Zou, Huasheng Li, Umapada Pal, Yue Lu:
LK-Net: Efficient Large Kernel ConvNet for Document Enhancement. 275-290 - Alik Pramanick, Utsav Bheda, Arijit Sur:
ML-CrAIST: Multi-scale Low-High Frequency Information-Based Cross Attention with Image Super-Resolving Transformer. 291-307 - Mohd Ubaid Wani, Md Raqib Khan, Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala:
Attentive Color Fusion Transformer Network (ACFTNet) for Underwater Image Enhancement. 308-324 - Yujie Wang, Bing Li, Jie Huang, Feng Zhao:
Unsupervised Low-Light Image Enhancement with Dual Contrastive Learning. 325-338 - Vaishnavi Ravi, Siddharth Parlapalli, Sameer Ranjan, Rama Krishna Gorthi:
Transformer-Based Fringe Restoration for Shadow Mitigation in Fringe Projection Profilometry. 339-354 - Kazi Reyazul Hasan, Muhammad Abdullah Adnan:
EMPATH: MediaPipe-Aided Ensemble Learning with Attention-Based Transformers for Accurate Recognition of Bangla Word-Level Sign Language. 355-371 - Lipisha Chaudhary, Fei Xu, Ifeoma Nwogu:
Cross-Attention Based Influence Model for Manual and Nonmanual Sign Language Analysis. 372-386 - Ming-Han Lee, Yu-Chen Zhang, Kun-Ru Wu, Yu-Chee Tseng:
GolfPose: From Regular Posture to Golf Swing Posture. 387-402 - Deepak Kumar, Piyush Dhamdhere, Balasubramanian Raman:
Fusing Multimodal Streams for Improved Group Emotion Recognition in Videos. 403-418 - Seonggwan Ko, Donghyeon Cho:
CSSR: Cross-and Self-feature Transformer with High-Frequency Feature Alignment for Reference-Based Super-Resolution. 419-434 - Lu Li, Yanjiao Shi, Jinyu Yang, Qiangqiang Zhou, Qing Zhang, Liu Cui:
Transformer-Based Depth Optimization Network for RGB-D Salient Object Detection. 435-450 - Tong Shi, Xuri Ge, Joemon M. Jose, Nicolas Pugeault, Paul Henderson:
Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition. 451-465
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.