default search action
27th ICPR 2024: Kolkata, India - Part XIX
- Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal:
Pattern Recognition - 27th International Conference, ICPR 2024, Kolkata, India, December 1-5, 2024, Proceedings, Part XIX. Lecture Notes in Computer Science 15319, Springer 2025, ISBN 978-3-031-78494-1 - Igor P. Maurell, Pedro L. Corçaque, Cris L. Froes, João Francisco S. S. Lemos, Felipe G. Oliveira, Paulo L. J. Drews-Jr:
Geometric Deep Learning in Industrial Scenes: A Large-Scale 3D Synthetic Dataset. 1-19 - Aakansha Mishra, Ashish Anand, Prithwijit Guha:
Visual Question Answering with Cascade of Self- and Co-Attention Blocks. 20-36 - Yu Weng, Xuming Ye, Tianjiao Xing, Zheng Liu, Chaomurilige, Xuan Liu:
Facet-Aware Multimodal Summarization via Cross-Modal Alignment. 37-52 - Aniket Gurav, Narayanan C. Krishnan, Sukalpa Chanda:
Word-Diffusion: Diffusion-Based Handwritten Text Word Image Generation. 53-72 - Marco Peer, Robert Sablatnig, Olga Serbaeva, Isabelle Marthot-Santaniello:
KaiRacters: Character-Level-Based Writer Retrieval for Greek Papyri. 73-88 - Weijia Zhang, Jia-Hong Huang, Svitlana Vakulenko, Yumo Xu, Thilina Rajapakse, Evangelos Kanoulas:
Beyond Relevant Documents: A Knowledge-Intensive Approach for Query-Focused Summarization Using Large Language Models. 89-104 - Honghui Yuan, Keiji Yanai:
Font Style Translation in Scene Text Images with CLIPstyler. 105-121 - Markus Muth, Marco Peer, Florian Kleber, Robert Sablatnig:
Advancing Handwritten Text Detection by Synthetic Text. 122-136 - Joseph Assaker, Stéphane Nicolas, Laurent Heutte:
Learning-Based Sub-image Retrieval in Historical Document Images. 137-151 - Anna Scius-Bertrand, Michael Jungo, Lars Vögtlin, Jean-Marc Spat, Andreas Fischer:
Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models. 152-166 - Minesh Mathew, Ajoy Mondal, C. V. Jawahar:
Towards Deployable OCR Models for Indic Languages. 167-182 - Ayush Roy, Shivakumara Palaiahnakote, Umapada Pal, Apostolos Antonacopoulos, Michael Blumenstein:
XLSI: A New Xception and Log Polar Transform Based Approach for Scene Text Script Identification. 183-198 - Kunal Purkayastha, Shashwat Sarkar, Palaiahnakote Shivakumara, Umapada Pal, Palash Ghosal, Xiao-Jun Wu:
DITS: A New Domain Independent Text Spotter. 199-216 - Vaibhav Agrawal, Niharika Vadlamudi, Muhammad Waseem, Amal Joseph, Sreenya Chitluri, Ravi Kiran Sarvadevabhatla:
LineTR: Unified Text Line Segmentation for Challenging Palm Leaf Manuscripts. 217-233 - Shuai Li, Xiao-Hui Li, Fei Yin, Lin-Lin Huang:
Region-Level Layout Generation for Multi-level Pre-trained Model Based Visual Information Extraction. 234-249 - Said Nammneh, Boraq Madi, Nour Atamni, Shoshana Boardman, Daria Vasyutinsky Shapira, Irina Rabaev, Raid Saabni, Jihad El-Sana:
Detecting Spiral Text Lines in Aramaic Incantation Bowls. 250-264 - Shreya Goswami, Naveen Saini, Saurabh Shukla:
Incorporating Domain Knowledge in Multi-objective Optimization Framework for Automating Indian Legal Case Summarization. 265-280 - Ruddy Théodose, Jean-Christophe Burie:
VisEmoComic: Visual Emotion Recognition in Comics Image. 281-296 - Kenny Davila, Rupak Lazarus, Fei Xu, Nicole Rodríguez Alcántara, Srirangaraj Setlur, Venu Govindaraju, Ajoy Mondal, C. V. Jawahar:
CHART-Info 2024: A Dataset for Chart Analysis and Recognition. 297-315 - Piyush Kanti Samanta, Samit Biswas:
AIO-HB: A Handwritten Text Image Dataset of Hindi and Bengali Indian Scripts for Handwritten Text Recognition. 316-332 - Ajoy Mondal, C. V. Jawahar:
Unconstrained Camera Captured Indic Offline Handwritten Dataset. 333-348 - Bin Zhang, Chaofan Zou, Wenwen Song:
FTC: A Novel Triplet Classification Model for Joint Entity and Relation Extraction. 349-364 - Liang Zhang, Nan Zheng:
H2O2Net: A Novel Entity-Relation Linking Network for Joint Relational Triple Extraction. 365-382 - Xiaoping Qiu, Ke Yang, Shiling Du:
BADA-LAT: Efficient Local Attention Transformer for Chinese Named Entity Recognition with Boundary and LLM-Based Data Augmentation. 383-398 - Faren Yan, Peng Yu, Xin Chen:
LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking. 399-411 - Yuxuan Hu, Chenwei Zhang, Min Yang, Xiaodan Liang, Chengming Li, Xiping Hu:
Learning to Generalize Unseen Domains via Multi-source Meta Learning for Text Classification. 412-428 - Krishnendu Ghosh, Munmun Patra, Eshita Dey:
Enhancing Bengali Text-to-Speech Synthesis Through Transformer-Driven Text Normalization. 429-444 - Cristiano Mesquita Garcia, Alessandro Lameiras Koerich, Alceu de Souza Britto Junior, Jean Paul Barddal:
Improving Sampling Methods for Fine-Tuning SentenceBERT in Text Streams. 445-459 - Arjun Ramesh Kaushik, R. P. Sunil Rufus, Nalini K. Ratha:
Enhancing Authorship Attribution Through Embedding Fusion: A Novel Approach with Masked and Encoder-Decoder Language Models. 460-471 - Farhan Noor Dehan, Md Fahim, AKM Mahabubur Rahman, M. Ashraful Amin, Amin Ahsan Ali:
TinyLLM Efficacy in Low-Resource Language: An Experiment on Bangla Text Classification Task. 472-487
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.