default search action
16th ICDAR 2021: Lausanne, Switzerland - Part I
- Josep Lladós, Daniel Lopresti, Seiichi Uchida:
16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Switzerland, September 5-10, 2021, Proceedings, Part I. Lecture Notes in Computer Science 12821, Springer 2021, ISBN 978-3-030-86548-1
Historical Document Analsyis 1
- Abhishek Trivedi, Ravi Kiran Sarvadevabhatla:
BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation. 3-18 - Anuj Rai, Narayanan C. Krishnan, Sukalpa Chanda:
Pho(SC)Net: An Approach Towards Zero-Shot Word Image Recognition in Historical Documents. 19-33 - Wassim Swaileh, Dimitrios Kotzinos, Suman Ghosh, Michel Jordan, Ngoc-Son Vu, Yaguan Qian:
Versailles-FP Dataset: Wall Detection in Ancient Floor Plans. 34-49 - Fabian Wolf, Andreas Fischer, Gernot A. Fink:
Graph Convolutional Neural Networks for Learning Attribute Representations for Word Spotting. 50-64 - Kai Brandenbusch, Eugen Rusakov, Gernot A. Fink:
Context Aware Generation of Cuneiform Signs. 65-79 - Xiao-Hui Li, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu:
Adaptive Scaling for Archival Table Structure Recognition. 80-95
Document Analysis Systems
- Liang Qiao, Zaisheng Li, Zhanzhan Cheng, Peng Zhang, Shiliang Pu, Yi Niu, Wenqi Ren, Wenming Tan, Fei Wu:
LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment. 99-114 - Peng Zhang, Can Li, Liang Qiao, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Fei Wu:
VSR: A Unified Framework for Document Layout Analysis Combining Vision, Semantics and Relations. 115-130 - Zejiang Shen, Ruochen Zhang, Melissa Dell, Benjamin Charles Germain Lee, Jacob Carlson, Weining Li:
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis. 131-146 - Shoaib Ahmed Siddiqui, Andreas Dengel, Sheraz Ahmed:
Understanding and Mitigating the Impact of Model Compression for Document Image Classification. 147-159 - Korlan Rysbayeva, Romain Giot, Nicholas Journet:
Hierarchical and Multimodal Classification of Images from Soil Remediation Reports. 160-175 - Daniel Lopresti, George Nagy:
Competition and Collaboration in Document Analysis and Recognition. 176-187
Handwriting Recognition
- Nam Tuan Ly, Hung Tuan Nguyen, Masaki Nakagawa:
2D Self-attention Convolutional Recurrent Network for Offline Handwritten Text Recognition. 191-204 - Likun Gao, Heng Zhang, Cheng-Lin Liu:
Handwritten Text Recognition with Convolutional Prototype Network and Most Aligned Frame Based CTC Training. 205-220 - William Mocaër, Éric Anquetil, Richard Kulpa:
Online Spatio-temporal 3D Convolutional Neural Network for Early Recognition of Handwritten Gestures. 221-236 - Jing Li, Qiu-Feng Wang, Rui Zhang, Kaizhu Huang:
Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution. 237-251 - Mobai Xue, Jun Du, Jianshu Zhang, Zi-Rui Wang, Bin Wang, Bo Ren:
Radical Composition Network for Chinese Character Generation. 252-267 - Alexander Mattick, Martin Mayr, Mathias Seuret, Andreas Maier, Vincent Christlein:
SmartPatch: Improving Handwritten Word Imitation with Patch Discriminators. 268-283
Scene Text Detection and Recognition
- Hui Jiang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Wenqi Ren, Fei Wu, Wenming Tan:
Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition. 287-303 - Deyang Wu, Xingfei Hu, Zhaozhi Xie, Haiyan Li, Usman Ali, Hongtao Lu:
Text Detection by Jointly Learning Character and Word Regions. 304-318 - Rowel Atienza:
Vision Transformer for Fast and Efficient Scene Text Recognition. 319-334 - Soumya Jahagirdar, Shankar Gangisetty, Anand Mishra:
Look, Read and Ask: Learning to Ask Questions by Reading Text in Images. 335-349 - Ziyin Zhang, Lemeng Pan, Lin Du, Qingrui Li, Ning Lu:
CATNet: Scene Text Recognition Guided by Concatenating Augmented Text Features. 350-365 - Lei Li, Chun Yuan, Kai Fan:
Explore Hierarchical Relations Reasoning and Global Information Aggregation. 366-381
Historical Document Analysis 2
- Christoph Wick, Christian Reul:
One-Model Ensemble-Learning for Text Recognition of Historical Printings. 385-399 - Tien-Nam Nguyen, Jean-Christophe Burie, Thi-Lan Le, Anne-Valérie Schweyer:
On the Use of Attention in Deep Learning Based Denoising Method for Ancient Cham Inscription Images. 400-415 - Brian L. Davis, Bryan S. Morse, Brian L. Price, Chris Tensmeyer, Curtis Wigington:
Visual FUDGE: Form Understanding via Dynamic Graph Editing. 416-431 - Anna Scius-Bertrand, Michael Jungo, Beat Wolf, Andreas Fischer, Marc Bui:
Annotation-Free Character Detection in Historical Vietnamese Stele Images. 432-447
Document Image Processing
- Shachar Klaiman, Marius Lehne:
DocReader: Bounding-Box Free Training of a Document Information Extraction Model. 451-465 - Guo-Wang Xie, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu:
Document Dewarping with Control Points. 466-480 - Ayantha Randika, Nilanjan Ray, Xiao Xiao, Allegra Latimer:
Unknown-Box Approximation to Improve Optical Character Recognition Performance. 481-496 - Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee, Han-Wei Shen, Jian Wu, C. Lee Giles:
Document Domain Randomization for Deep Learning Document Layout Extraction. 497-513
NLP for Document Understanding
- Minghui Wang, Ping Xue, Ying Li, Zhonghai Wu:
Distilling the Documents for Relation Extraction by Topic Segmentation. 517-531 - Lukasz Garncarek, Rafal Powalski, Tomasz Stanislawek, Bartosz Topolski, Piotr Halama, Michal Turski, Filip Gralinski:
LAMBERT: Layout-Aware Language Modeling for Information Extraction. 532-547 - Weihong Lin, Qifang Gao, Lei Sun, Zhuoyao Zhong, Kai Hu, Qin Ren, Qiang Huo:
ViBERTgrid: A Jointly Trained Multi-modal 2D Document Representation for Key Information Extraction from Documents. 548-563 - Tomasz Stanislawek, Filip Gralinski, Anna Wróblewska, Dawid Lipinski, Agnieszka Kaliska, Paulina Rosalska, Bartosz Topolski, Przemyslaw Biecek:
Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts. 564-579
Graphics, Diagram, and Math Recognition
- Weihong Ma, Hesuo Zhang, Shuang Yan, Guangshun Yao, Yichao Huang, Hui Li, Yaqiang Wu, Lianwen Jin:
Towards an Efficient Framework for Data Extraction from Chart Images. 583-597 - Zhuoying Wang, Qingkai Fang, Yongtao Wang:
Geometric Object 3D Reconstruction from Single Line Drawing Image Based on a Network for Classification and Sketch Extraction. 598-613 - Bernhard Schäfer, Heiner Stuckenschmidt:
DiagramNet: Hand-Drawn Diagram Recognition Using Visual Arrow-Relation Detection. 614-630 - Ke Yuan, Liangcai Gao, Zhuoren Jiang, Zhi Tang:
Formula Citation Graph Based Mathematical Information Retrieval. 631-647
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.