


default search action
18th ICDAR 2024: Athens, Greece - Part VI
- Elisa H. Barney Smith, Marcus Liwicki, Liangrui Peng:
Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30 - September 4, 2024, Proceedings, Part VI. Lecture Notes in Computer Science 14809, Springer 2024, ISBN 978-3-031-70551-9
Music Recognition
- Adrian Rosello, Eliseo Fuentes-Martínez, María Alfaro-Contreras, David Rizo, Jorge Calvo-Zaragoza:
Source-Free Domain Adaptation for Optical Music Recognition. 3-19 - Antonio Ríos-Vila
, Jorge Calvo-Zaragoza
, Thierry Paquet
:
Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription. 20-37 - Tristan Repolusk
, Eduardo E. Veas
:
The KuiSCIMA Dataset for Optical Music Recognition of Ancient Chinese Suzipu Notation. 38-54 - Jirí Mayer
, Milan Straka
, Jan Hajic
, Pavel Pecina
:
Practical End-to-End Optical Music Recognition for Pianoform Music. 55-73
Visual Question Answering and Comics
- Kai Hu, Jiawei Wang, Weihong Lin, Zhuoyao Zhong, Lei Sun, Qiang Huo:
UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-Like Documents. 77-96 - Chao Liu
, Jie Yang
, Wanqing Li
:
Extractive Question Answering with Contrastive Puzzles and Reweighted Clues. 97-112 - Ibrahim Souleiman Mahamoud
, Mickaël Coustaty
, Aurélie Joseph
, Vincent Poulain D'Andecy, Jean-Marc Ogier
:
CHIC: Corporate Document for Visual Question Answering. 113-127 - Emanuele Vivoli
, Joan Lafuente Baeza
, Ernest Valveny Llobet
, Dimosthenis Karatzas
:
Multimodal Transformer for Comics Text-Cloze. 128-145 - Khanh Nguyen
, Dimosthenis Karatzas
:
Federated Document Visual Question Answering: A Pilot Study. 146-163
Visual Question Answering and LLMs
- Solène Tarride
, Christopher Kermorvant
:
Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition. 167-182 - Wangli Yang, Jie Yang, Wanqing Li, Yi Guo:
ConClue: Conditional Clue Extraction for Multiple Choice Question Answering. 183-198 - Rubèn Tito, Khanh Nguyen, Marlon Tobaben, Raouf Kerkouche, Mohamed Ali Souibgui, Kangsoo Jung, Joonas Jälkö, Vincent Poulain D'Andecy, Aurélie Joseph, Lei Kang, Ernest Valveny, Antti Honkela, Mario Fritz, Dimosthenis Karatzas:
Privacy-Aware Document Visual Question Answering. 199-218 - Lei Kang
, Rubèn Tito
, Ernest Valveny
, Dimosthenis Karatzas
:
Multi-page Document Visual Question Answering Using Self-attention Scoring Mechanism. 219-232 - Tianqing Zhang
, Alimjan Aysa
, Li Zhao, Kurban Ubul
, Enguang Zuo
:
Improving Retrieval-Based Dialogue Systems: Fine-Grained Post-training Prompt Adaptation and Pairwise Optimization Fine-Tuning Strategy. 233-247 - Hamza Gbada
, Karim Kalti
, Mohamed Ali Mahjoub
:
Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural Network. 248-263 - Raghvendra Kumar
, Deepak Prakash
, Sriparna Saha
, Shubham Sharma
:
IndicBART Alongside Visual Element: Multimodal Summarization in Diverse Indian Languages. 264-280 - Shuo Zhang, Biao Yang, Zhang Li, Zhiyin Ma, Yuliang Liu, Xiang Bai:
Exploring the Capabilities of Large Multimodal Models on Dense Text. 281-298
Competitions
- Xudong Xie, Linger Deng, Zhifei Zhang, Zhaowen Wang, Yuliang Liu:
ICDAR 2024 Competition on Artistic Text Recognition. 301-314 - Silvia Zottin
, Axel De Nardin
, Gian Luca Foresti
, Emanuela Colombi
, Claudio Piciarelli
:
ICDAR 2024 Competition on Few-Shot and Many-Shot Layout Segmentation of Ancient Manuscripts (SAM). 315-331 - Alicia Fornés
, Jialuo Chen
, Pau Torras
, Carles Badal
, Beáta Megyesi
, Michelle Waldispühl
, Nils Kopal, George Lasry:
ICDAR 2024 Competition on Handwriting Recognition of Historical Ciphers. 332-344 - Arthur Flor de Sousa Neto, Byron L. D. Bezerra
, Sávio S. Araújo
, Wiliane M. A. S. Souza
, Kléberson F. Alves
, Macileide F. Oliveira
, Samara V. S. Lins
, Hugo J. F. Hazin
, Pedro H. V. Rocha
, Alejandro H. Toselli
:
ICDAR 2024 Competition on Handwritten Text Recognition in Brazilian Essays - BRESSAY. 345-362 - Zekun Li, Yijun Lin, Yao-Yi Chiang, Jerod Weinman, Solenn Tual, Joseph Chazalon, Julien Perret, Bertrand Duménieu, Nathalie Abadie:
ICDAR 2024 Competition on Historical Map Text Detection, Recognition, and Linking. 363-380 - Janne van der Loop
, Florian Kordon
, Martin Mayr
, Vincent Christlein
, Fei Wu
, Dalia Rodríguez-Salas
, Nikolaus Weichselbaumer
, Mathias Seuret
:
ICDAR 2024 Competition on Multi Font Group Recognition and OCR. 381-396 - Mingjun Chen
, Hao Wu
, Qikai Chang
, Hanbo Cheng
, Jiefeng Ma
, Pengfei Hu
, Zhenrong Zhang
, Chenyu Liu
, Changpeng Pi
, Jinshui Hu
, Baocai Yin
, Bing Yin
, Cong Liu
, Jun Du
:
ICDAR 2024 Competition on Recognition of Chemical Structures. 397-409 - Soumya Jahagirdar, Ajoy Mondal, Yuheng (Carl) Ren, Omkar M. Parkhi, C. V. Jawahar:
ICDAR 2024 Competition on Reading Documents Through Aria Glasses. 410-425 - Ajoy Mondal, Vijay Mahadevan, R. Manmatha, C. V. Jawahar:
ICDAR 2024 Competition on Recognition and VQA on Handwritten Documents. 426-442

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.