


default search action
17th ICDAR 2023: San José, CA, USA - Part V
- Gernot A. Fink, Rajiv Jain, Koichi Kise, Richard Zanibbi:
Document Analysis and Recognition - ICDAR 2023 - 17th International Conference, San José, CA, USA, August 21-26, 2023, Proceedings, Part V. Lecture Notes in Computer Science 14191, Springer 2023, ISBN 978-3-031-41733-7
Posters: Text and Document Recognition
- Francesc Net, Marc Folia, Pep Casals, Lluís Gómez
:
Transductive Learning for Near-Duplicate Image Detection in Scanned Photo Collections. 3-17 - Tri-Cong Pham
, Mickaël Coustaty
, Aurélie Joseph
, Vincent Poulain D'Andecy, Muriel Visani
, Nicolas Sidere
:
Incremental Learning and Ambiguity Rejection for Document Classification. 18-35 - Danlu Chen
, Nan Jiang
, Taylor Berg-Kirkpatrick
:
EEBO-Verse: Sifting for Poetry in Large Early Modern Corpora Using Visual Features. 36-52 - Jilin Wang, Michael Krumdick, Baojia Tong, Hamima Halim, Maxim Sokolov, Vadym Barda, Delphine Vendryes, Chris Tanner:
A Graphical Approach to Document Layout Analysis. 53-69 - Soumi Das, Palaiahnakote Shivakumara
, Umapada Pal, Raghavendra Ramachandra:
Gaussian Kernels Based Network for Multiple License Plate Number Detection in Day-Night Images. 70-87 - Mathieu Francois, Véronique Eglin:
Ensuring an Error-Free Transcription on a Full Engineering Tags Dataset Through Unsupervised Post-OCR Methods. 88-103 - Mirjam Cuper
, Corine van Dongen, Tineke Koster:
Unraveling Confidence: Examining Confidence Scores as Proxy for OCR Quality. 104-120 - Joseph Attieh
, Abraham Woubie Zewoudie, Vladimir Vlassov, Adrian Flanagan, Tom Bäckström
:
Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings Using a Joint Loss Function. 121-136 - Jiakun Tian, Gang Zhou, Yangxin Liu, En Deng, Zhenhong Jia:
FTDNet: Joint Semantic Learning for Scene Text Detection in Adverse Weather Conditions. 137-154 - Mohamed Dhouib
, Ghassen Bettaieb
, Aymen Shabou
:
DocParser: End-to-end OCR-Free Information Extraction from Visually Rich Documents. 155-172 - Qi Song
, Qianyi Jiang
, Lei Wang
, Lingling Zhao
, Rui Zhang
:
MUGS: A Multiple Granularity Semi-supervised Method for Text Recognition. 173-188 - Zhuoyao Zhong, Jiawei Wang, Haiqing Sun, Kai Hu, Erhan Zhang, Lei Sun, Qiang Huo:
A Hybrid Approach to Document Layout Analysis for Heterogeneous Document Images. 189-206 - Saifullah Saifullah
, Stefan Agne
, Andreas Dengel
, Sheraz Ahmed
:
ColDBin: Cold Diffusion for Document Image Binarization. 207-226 - William A. P. Smith
, Toby Pillatt
:
You Only Look for a Symbol Once: An Object Detector for Symbols and Regions in Documents. 227-243 - Junyi Zhang, Chang Liu
, Chun Yang:
SAN: Structure-Aware Network for Complex and Long-Tailed Chinese Text Recognition. 244-258 - Venkatapathy Subramanian
, Sagar Poudel
, Parag Chaudhuri
, Ganesh Ramakrishnan
:
TACTFUL: A Framework for Targeted Active Learning for Document Analysis. 259-273 - Song-Lu Chen
, Qi Liu
, Feng Chen, Xu-Cheng Yin
:
End-to-End Multi-line License Plate Recognition with Cascaded Perception. 274-289 - Timothée Fronteau
, Arnaud Paran
, Aymen Shabou
:
Evaluating Adversarial Robustness on Document Image Classification. 290-304 - Abdur Rahman
, Arjun Ghosh, Chetan Arora:
UTRNet: High-Resolution Urdu Text Recognition in Printed Documents. 305-324 - Najoua Rahal
, Lars Vögtlin
, Rolf Ingold
:
Layout Analysis of Historical Document Images Using a Light Fully Convolutional Network. 325-341 - Mathias Seuret
, Janne van der Loop
, Nikolaus Weichselbaumer
, Martin Mayr
, Janina Molnar
, Tatjana Hass
, Vincent Christlein
:
Combining OCR Models for Reading Early Modern Books. 342-357 - Gerasimos Matidis, Basilis Gatos, Anastasios L. Kesidis, Panagiotis Kaddas:
Detecting Text on Historical Maps by Selecting Best Candidates of Deep Neural Networks Output. 358-367
Posters: Graphics
- Brandon Smock
, Rohith Pesala
, Robin Abraham
:
Aligning Benchmark Datasets for Table Structure Recognition. 371-386 - Jay Lal
, Aditya Mitkari
, Mahesh Bhosale
, David S. Doermann
:
LineFormer: Line Chart Data Extraction Using Instance Segmentation. 387-400 - Ayush Kumar Shah
, Richard Zanibbi
:
Line-of-Sight with Graph Attention Parser (LGAP) for Math Formulas. 401-419 - Muhammad Umer
, Muhammad Ahmed Mohsin
, Adnan Ul-Hasan, Faisal Shafait:
PyramidTabNet: Transformer-Based Table Recognition in Image-Based Documents. 420-437 - Omar Moured
, Jiaming Zhang, Alina Roitberg
, Thorsten Schwarz, Rainer Stiefelhagen:
Line Graphics Digitization: A Step Towards Full Automation. 438-453 - Philippe Bernet
, Joseph Chazalon
, Edwin Carlinet
, Alexandre Bourquelot
, Élodie Puybareau
:
Linear Object Detection in Document Images Using Multiple Object Tracking. 454-471 - Youngmin Baek
, Daehyun Nam
, Jaeheung Surh
, Seung Shin
, Seonghyeon Kim
:
TRACE: Table Reconstruction Aligned to Corner and Edges. 472-489 - Yusuke Nagata
, Brian Kenji Iwana
, Seiichi Uchida
:
Contour Completion by Transformers and Its Application to Vector Font Data. 490-504 - Shreya Shukla
, Prajwal Gatti
, Yogesh Kumar
, Vikash Yadav
, Anand Mishra
:
Towards Making Flowchart Images Machine Interpretable. 505-521 - Nam Quan Nguyen, Anh Duy Le, Anh Khoa Lu, Xuan Toan Mai, Tuan Anh Tran
:
Formerge: Recover Spanning Cells in Complex Table Structure Using Transformer Network. 522-534 - Brandon Smock
, Rohith Pesala
, Robin Abraham
:
GriTS: Grid Table Similarity Metric for Table Structure Recognition. 535-549

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.