default search action
16th DAS 2024: Athens, Greece
- Giorgos Sfikas, George Retsinas:
Document Analysis Systems - 16th IAPR International Workshop, DAS 2024, Athens, Greece, August 30-31, 2024, Proceedings. Lecture Notes in Computer Science 14994, Springer 2024, ISBN 978-3-031-70441-3
Document Analysis and Understanding
- Masaki Nakagawa, Hung Tuan Nguyen, Thanh-Nghia Truong, Nam Tuan Ly, Cuong Tuan Nguyen, Haruki Oka, Tsunenori Ishioka, Tomo Asakura, Hiroshi Miyazawa, Takahiro Yamamoto, Toshihiko Horie, Fumiko Yasuno:
Two Experiments for Automatic Scoring of Handwritten Descriptive Answers. 3-19 - Arooba Maqsood, Adnan Ul-Hasan, Faisal Shafait:
Transformer-Based Architecture for Judgment Prediction and Explanation in Legal Proceedings. 20-36 - Muhammad Saif Ullah Khan, Tahira Shehzadi, Rabeya Noor, Didier Stricker, Muhammad Zeshan Afzal:
Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and Verification. 37-54
Retrieval and VQA
- Qi Dong, Lei Kang, Dimosthenis Karatzas:
Multi-page Document VQA with Recurrent Memory Transformer. 57-70 - Tosin P. Adewumi, Nudrat Habib, Lama Alkhaled, Elisa Barney:
Instruction Makes a Difference. 71-88 - Artemis Llabrés, Arka Ujjal Dey, Dimosthenis Karatzas, Ernest Valveny:
Image-Text Matching for Large-Scale Book Collections. 89-102
Layout Analysis
- Zezhong Guo, Yongjian Zhang, Shibo Chen, Chiching Wei:
RCAM-Transformer: A Novel Approach to Table Reconstruction Using Row-Column Attention Mechanism. 105-123 - Zhangchi Gao, Shoubin Li, Yangyang Liu, Mingyang Li, Kai Huang, Yi Ren:
LD-DOC: Light-Weight Domain-Adaptive Document Layout Analysis. 124-141 - Talha Uddin Sheikh, Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Muhammad Zeshan Afzal:
UnSupDLA: Towards Unsupervised Document Layout Analysis. 142-161
Document Classification
- Daichi Haraguchi, Brian Kenji Iwana, Seiichi Uchida:
What Text Design Characterizes Book Genres? 165-181 - Taylor Archibald, Tony Martinez:
Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification. 182-195 - Ricardo Batista das Neves Junior, Byron Leite Dantas Bezerra, Cleber Zanchettin:
DocLightDetect: A New Algorithm for Occlusion Classification in Identification Documents. 196-210
OCR Correction and NLP
- Arthur Hemmer, Mickaël Coustaty, Nicola Bartolo, Jean-Marc Ogier:
Confidence-Aware Document OCR Error Detection. 213-228 - Rina Suzuki, Hisao Usui, Hiroaki Ozaki, Hung Tuan Nguyen, Kanako Komiya, Tsunenori Ishioka, Masaki Nakagawa:
Error Correction of Japanese Character-Recognition in Answers to Writing-Type Questions Using T5. 229-243 - João Macedo, Byron L. D. Bezerra, Cleber Zanchettin:
How Does Changing the Optical Character Recognition System Impact the Layout-Aware Named Entity Recognition Models? 244-257 - Laraib Kaleem, Arif Ur Rahman, Momina Moetesum:
RUATS: Abstractive Text Summarization for Roman Urdu. 258-273
Recognition Systems
- Daniel Parres, Dan Anitei, Roberto Paredes, Joan-Andreu Sánchez, José-Miguel Benedí:
Speed-Up Pre-trained Vision Encoder-Decoder Transformers by Leveraging Lightweight Mixer Layers for Text Recognition. 277-294 - Markus Muth, Marco Peer, Florian Kleber, Robert Sablatnig:
Maximizing Data Efficiency of HTR Models by Synthetic Text. 295-311 - Carlos Peñarrubia, Jose J. Valero-Mas, Jorge Calvo-Zaragoza:
Contrastive Self-Supervised Learning for Optical Music Recognition. 312-326 - Ali Yesilkanat, Yann Soullard, Bertrand Coüasnon, Nathalie Girard:
Full-Page Music Symbols Recognition: State-of-the-Art Deep Model Comparison for Handwritten and Printed Music Scores. 327-343
Historical Documents
- Adrià Molina, Oriol Ramos Terrades, Josep Lladós:
Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval. 347-362 - Hussein Mohammed, Mahdi Jampour:
From Detection to Modelling: An End-to-End Paleographic System for Analysing Historical Handwriting Styles. 363-376 - Florian Kordon, Nikolaus Weichselbaumer, Randall Herz, Janne van der Loop, Stephen Mossman, Edward Potten, Mathias Seuret, Martin Mayr, Fei Wu, Vincent Christlein:
fang: Fast Annotation of Glyphs in Historical Printed Documents. 377-392 - Giorgos Sfikas, Panagiotis Dimitrakopoulos, George Retsinas, Christophoros Nikou, Pinelopi Kitsiou:
Bessarion: Medieval Greek Inscriptions on a Challenging Dataset for Vision and NLP Tasks. 393-407 - Usman Nawaz, Liliana Lo Presti, Marianna Napolitano, Marco La Cascia:
Automatic Lemmatization of Old Church Slavonic Language Using A Novel Dictionary-Based Approach. 408-421 - Esma F. Bilgin Tasdemir, Zeynep Tandogan, S. Dogan Akansu, Firat Kizilirmak, M. Umut Sen, Aysu Akcan, Mehmet Kuru, Berrin Yanikoglu:
Automatic Transcription of Ottoman Documents Using Deep Learning. 422-435
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.