default search action
17th ICDAR 2023: San José, CA, USA - Part V
- Gernot A. Fink, Rajiv Jain, Koichi Kise, Richard Zanibbi:
Document Analysis and Recognition - ICDAR 2023 - 17th International Conference, San José, CA, USA, August 21-26, 2023, Proceedings, Part V. Lecture Notes in Computer Science 14191, Springer 2023, ISBN 978-3-031-41733-7
Posters: Text and Document Recognition
- Francesc Net, Marc Folia, Pep Casals, Lluís Gómez:
Transductive Learning for Near-Duplicate Image Detection in Scanned Photo Collections. 3-17 - Tri-Cong Pham, Mickaël Coustaty, Aurélie Joseph, Vincent Poulain D'Andecy, Muriel Visani, Nicolas Sidere:
Incremental Learning and Ambiguity Rejection for Document Classification. 18-35 - Danlu Chen, Nan Jiang, Taylor Berg-Kirkpatrick:
EEBO-Verse: Sifting for Poetry in Large Early Modern Corpora Using Visual Features. 36-52 - Jilin Wang, Michael Krumdick, Baojia Tong, Hamima Halim, Maxim Sokolov, Vadym Barda, Delphine Vendryes, Chris Tanner:
A Graphical Approach to Document Layout Analysis. 53-69 - Soumi Das, Palaiahnakote Shivakumara, Umapada Pal, Raghavendra Ramachandra:
Gaussian Kernels Based Network for Multiple License Plate Number Detection in Day-Night Images. 70-87 - Mathieu Francois, Véronique Eglin:
Ensuring an Error-Free Transcription on a Full Engineering Tags Dataset Through Unsupervised Post-OCR Methods. 88-103 - Mirjam Cuper, Corine van Dongen, Tineke Koster:
Unraveling Confidence: Examining Confidence Scores as Proxy for OCR Quality. 104-120 - Joseph Attieh, Abraham Woubie Zewoudie, Vladimir Vlassov, Adrian Flanagan, Tom Bäckström:
Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings Using a Joint Loss Function. 121-136 - Jiakun Tian, Gang Zhou, Yangxin Liu, En Deng, Zhenhong Jia:
FTDNet: Joint Semantic Learning for Scene Text Detection in Adverse Weather Conditions. 137-154 - Mohamed Dhouib, Ghassen Bettaieb, Aymen Shabou:
DocParser: End-to-end OCR-Free Information Extraction from Visually Rich Documents. 155-172 - Qi Song, Qianyi Jiang, Lei Wang, Lingling Zhao, Rui Zhang:
MUGS: A Multiple Granularity Semi-supervised Method for Text Recognition. 173-188 - Zhuoyao Zhong, Jiawei Wang, Haiqing Sun, Kai Hu, Erhan Zhang, Lei Sun, Qiang Huo:
A Hybrid Approach to Document Layout Analysis for Heterogeneous Document Images. 189-206 - Saifullah Saifullah, Stefan Agne, Andreas Dengel, Sheraz Ahmed:
ColDBin: Cold Diffusion for Document Image Binarization. 207-226 - William A. P. Smith, Toby Pillatt:
You Only Look for a Symbol Once: An Object Detector for Symbols and Regions in Documents. 227-243 - Junyi Zhang, Chang Liu, Chun Yang:
SAN: Structure-Aware Network for Complex and Long-Tailed Chinese Text Recognition. 244-258 - Venkatapathy Subramanian, Sagar Poudel, Parag Chaudhuri, Ganesh Ramakrishnan:
TACTFUL: A Framework for Targeted Active Learning for Document Analysis. 259-273 - Song-Lu Chen, Qi Liu, Feng Chen, Xu-Cheng Yin:
End-to-End Multi-line License Plate Recognition with Cascaded Perception. 274-289 - Timothée Fronteau, Arnaud Paran, Aymen Shabou:
Evaluating Adversarial Robustness on Document Image Classification. 290-304 - Abdur Rahman, Arjun Ghosh, Chetan Arora:
UTRNet: High-Resolution Urdu Text Recognition in Printed Documents. 305-324 - Najoua Rahal, Lars Vögtlin, Rolf Ingold:
Layout Analysis of Historical Document Images Using a Light Fully Convolutional Network. 325-341 - Mathias Seuret, Janne van der Loop, Nikolaus Weichselbaumer, Martin Mayr, Janina Molnar, Tatjana Hass, Vincent Christlein:
Combining OCR Models for Reading Early Modern Books. 342-357 - Gerasimos Matidis, Basilis Gatos, Anastasios L. Kesidis, Panagiotis Kaddas:
Detecting Text on Historical Maps by Selecting Best Candidates of Deep Neural Networks Output. 358-367
Posters: Graphics
- Brandon Smock, Rohith Pesala, Robin Abraham:
Aligning Benchmark Datasets for Table Structure Recognition. 371-386 - Jay Lal, Aditya Mitkari, Mahesh Bhosale, David S. Doermann:
LineFormer: Line Chart Data Extraction Using Instance Segmentation. 387-400 - Ayush Kumar Shah, Richard Zanibbi:
Line-of-Sight with Graph Attention Parser (LGAP) for Math Formulas. 401-419 - Muhammad Umer, Muhammad Ahmed Mohsin, Adnan Ul-Hasan, Faisal Shafait:
PyramidTabNet: Transformer-Based Table Recognition in Image-Based Documents. 420-437 - Omar Moured, Jiaming Zhang, Alina Roitberg, Thorsten Schwarz, Rainer Stiefelhagen:
Line Graphics Digitization: A Step Towards Full Automation. 438-453 - Philippe Bernet, Joseph Chazalon, Edwin Carlinet, Alexandre Bourquelot, Élodie Puybareau:
Linear Object Detection in Document Images Using Multiple Object Tracking. 454-471 - Youngmin Baek, Daehyun Nam, Jaeheung Surh, Seung Shin, Seonghyeon Kim:
TRACE: Table Reconstruction Aligned to Corner and Edges. 472-489 - Yusuke Nagata, Brian Kenji Iwana, Seiichi Uchida:
Contour Completion by Transformers and Its Application to Vector Font Data. 490-504 - Shreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav, Anand Mishra:
Towards Making Flowchart Images Machine Interpretable. 505-521 - Nam Quan Nguyen, Anh Duy Le, Anh Khoa Lu, Xuan Toan Mai, Tuan Anh Tran:
Formerge: Recover Spanning Cells in Complex Table Structure Using Transformer Network. 522-534 - Brandon Smock, Rohith Pesala, Robin Abraham:
GriTS: Grid Table Similarity Metric for Table Structure Recognition. 535-549
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.