


default search action
16th ICDAR 2021: Lausanne, Switzerland - Part II
- Josep Lladós
, Daniel Lopresti
, Seiichi Uchida
:
16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Switzerland, September 5-10, 2021, Proceedings, Part II. Lecture Notes in Computer Science 12822, Springer 2021, ISBN 978-3-030-86330-2
Document Analysis for Literature Search
- Rongyu Cao, Hongwei Li, Ganbin Zhou, Ping Luo:
Towards Document Panoptic Segmentation with Pinpoint Accuracy: Method and Evaluation. 3-18 - Ayush Kumar Shah
, Abhisek Dey
, Richard Zanibbi
:
A Math Formula Extraction and Evaluation Framework for PDF Documents. 19-34 - Laura E. Brandt
, William T. Freeman
:
Toward Automatic Interpretation of 3D Plots. 35-50
Document Summarization and Translation
- Marta Esther Vicente, Robiert Sepúlveda-Torres
, Cristina Barros, Estela Saquete, Elena Lloret
:
Can Text Summarization Enhance the Headline Stance Detection Task? Benefits and Drawbacks. 53-67 - Justin Wood, Wei Wang
, Corey W. Arnold:
The Biased Coin Flip Process for Nonparametric Topic Modeling. 68-83 - Sayali Kulkarni, Sheide Chammas, Wan Zhu, Fei Sha, Eugene Ie:
CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-document Summarization. 84-98 - Tonghua Su
, Shuchen Liu, Shengjie Zhou:
RTNet: An End-to-End Method for Handwritten Text Image Translation. 99-113
Multimedia Document Analysis
- Ziyi Zhu, Liangcai Gao, Yibo Li, Yilun Huang, Lin Du, Ning Lu, Xianfeng Wang:
NTable: A Dataset for Camera-Based Table Detection. 117-129 - Tianqi Ji, Jun Li, Jianhua Xu:
Label Selection Algorithm Based on Boolean Interpolative Decomposition with Sequential Backward Selection for Multi-label Classification. 130-144 - Quang Huy Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen, Masaki Nakagawa:
GSSF: A Generative Sequence Similarity Function Based on a Seq2Seq Model for Clustering Online Handwritten Mathematical Answers. 145-159 - Vaibhavi Gupta, Vinay Detani, Vivek Khokar, Chiranjoy Chattopadhyay
:
C2VNet: A Deep Learning Framework Towards Comic Strip to Audio-Visual Scene Synthesis. 160-175 - Jie He
, Xingjiao Wu
, Wenxin Hu, Jing Yang:
LSTMVAEF: Vivid Layout via LSTM-Based Variational Autoencoder Framework. 176-189
Mobile Text Recognition
- Andrii Grygoriev
, Illya Degtyarenko
, Ivan Deriuga
, Serhii Polotskyi
, Volodymyr Melnyk
, Dmytro Zakharchuk
, Olga Radyvonenko
:
HCRNN: A Novel Architecture for Fast Online Handwritten Stroke Classification. 193-208 - Daniil Matalov
, Elena Limonova
, Natalya Skoryukina
, Vladimir V. Arlazarov
:
RFDoc: Memory Efficient Local Descriptors for ID Documents Localization and Classification. 209-224 - Haibo Qin
, Chun Yang
, Xiaobin Zhu
, Xu-Cheng Yin
:
Dynamic Receptive Field Adaptation for Attention-Based Text Recognition. 225-239 - Ryota Yoshihashi, Tomohiro Tanaka, Kenji Doi, Takumi Fujino, Naoaki Yamashita:
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition. 240-257 - Yulia S. Chernyshova
, Ekaterina Emelianova
, Alexander Sheshkus
, Vladimir V. Arlazarov
:
MIDV-LAIT: A Challenging Dataset for Recognition of IDs with Perso-Arabic, Thai, and Indian Scripts. 258-272 - Konstantin B. Bulatov
, Vladimir V. Arlazarov
:
Determining Optimal Frame Processing Strategies for Real-Time Document Recognition Systems. 273-288
Document Analysis for Social Good
- Eugen Rusakov
, Turna Somel
, Gerfrid G. W. Müller
, Gernot A. Fink
:
Embedded Attributes for Cuneiform Sign Spotting. 291-305 - Adrià Molina
, Pau Riba
, Lluís Gómez
, Oriol Ramos Terrades
, Josep Lladós
:
Date Estimation in the Wild of Scanned Historical Photos: An Image Retrieval Approach. 306-320 - Muhammad Osama Zeeshan, Imran Siddiqi, Momina Moetesum:
Two-Step Fine-Tuned Convolutional Neural Networks for Multi-label Classification of Children's Drawings. 321-334 - Tamal Chowdhury, Palaiahnakote Shivakumara
, Umapada Pal, Tong Lu, Ramachandra Raghavendra, Sukalpa Chanda:
DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region. 335-350 - Fatma Najar, Nizar Bouguila:
Sparse Document Analysis Using Beta-Liouville Naive Bayes with Vocabulary Knowledge. 351-363 - Sk Md Obaidullah
, Mridul Ghosh
, Himadri Mukherjee, Kaushik Roy, Umapada Pal:
Automatic Signature-Based Writer Identification in Mixed-Script Scenarios. 364-377
Indexing and Retrieval of Documents
- Pau Riba
, Adrià Molina
, Lluís Gómez
, Oriol Ramos Terrades
, Josep Lladós
:
Learning to Rank Words: Optimizing Ranking Metrics for Word Spotting. 381-395 - Trung Tan Ngo
, Hung Tuan Nguyen
, Masaki Nakagawa
:
A-VLAD: An End-to-End Attention-Based Neural Network for Writer Identification in Historical Documents. 396-409 - Nhu-Van Nguyen
, Christophe Rigaud
, Arnaud Revel
, Jean-Christophe Burie
:
Manga-MMTL: Multimodal Multitask Transfer Learning for Manga Character Analysis. 410-425 - Enrique Vidal, Alejandro H. Toselli
:
Probabilistic Indexing and Search for Hyphenated Words. 426-442
Physical and Logical Layout Analysis
- Sieben Bocklandt
, Gust Verbruggen
, Thomas Winters
:
SandSlide: Automatic Slideshow Normalization. 445-461 - Alejandro H. Toselli
, Si Wu, David A. Smith
:
Digital Editions as Distant Supervision for Layout Analysis of Printed Books. 462-476 - Prema Satish Sharan, Sowmya Aitha
, Amandeep Kumar
, Abhishek Trivedi
, Aaron Augustine
, Ravi Kiran Sarvadevabhatla
:
Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts. 477-491 - Oldrich Kodym
, Michal Hradis
:
Page Layout Analysis System for Unconstrained Historic Documents. 492-506 - José Ramón Prieto
, Enrique Vidal:
Improved Graph Methods for Table Layout Understanding. 507-522 - Berat Kurar Barakat, Ahmad Droby, Raid Saabni, Jihad El-Sana:
Unsupervised Learning of Text Line Segmentation by Differentiating Coarse Patterns. 523-537
Recognition of Tables and Formulas
- Yibo Li, Yilun Huang, Ziyi Zhu, Lemeng Pan, Yongshuai Huang, Lin Du, Zhi Tang, Liangcai Gao:
Rethinking Table Structure Recognition Using Sequence Labeling Methods. 541-553 - Harsh Desai, Pratik Kayal, Mayank Singh:
TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables. 554-569 - Wenqi Zhao
, Liangcai Gao, Zuoyu Yan, Shuai Peng, Lin Du, Ziyin Zhang:
Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer. 570-584 - Umar Khan, Sohaib Zahid, Muhammad Asad Ali, Adnan Ul-Hasan, Faisal Shafait:
TabAug: Data Driven Augmentation for Enhanced Table Structure Recognition. 585-601 - Haisong Ding
, Kai Chen
, Qiang Huo:
An Encoder-Decoder Approach to Handwritten Mathematical Expression Recognition with Multi-head Attention and Stacked Decoder. 602-616 - Cuong Tuan Nguyen, Thanh-Nghia Truong, Hung Tuan Nguyen, Masaki Nakagawa:
Global Context for Improving Recognition of Online Handwritten Mathematical Expressions. 617-631 - Koji Ichikawa
:
Image-Based Relation Classification Approach for Table Structure Recognition. 632-647 - Shuai Peng, Liangcai Gao, Ke Yuan, Zhi Tang:
Image to LaTeX with Graph Neural Network for Mathematical Formula Recognition. 648-663
NLP for Document Understanding
- Badal Agrawal, Mohit Mishra, Varun Parashar:
A Novel Method for Automated Suggestion of Similar Software Incidents Using 2-Stage Filtering: Findings on Primary Data. 667-682 - Lianxi Wang
, Xiaotian Lin, Nankai Lin:
Research on Pseudo-label Technology for Multi-label News Classification. 683-698 - Ahmed Hamdi
, Elodie Carel
, Aurélie Joseph
, Mickaël Coustaty
, Antoine Doucet
:
Information Extraction from Invoices. 699-714 - Apoorva Singh, Sriparna Saha:
Are You Really Complaining? A Multi-task Framework for Complaint Identification, Emotion, and Sentiment Classification. 715-731 - Rafal Powalski, Lukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michal Pietruszka, Gabriela Palka
:
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer. 732-747 - Luisa März
, Stefan Schweter
, Nina Pörner
, Benjamin Roth
, Hinrich Schütze
:
Data Centric Domain Adaptation for Historical Text with OCR Errors. 748-761 - Nafaa Haffar
, Rami Ayadi
, Emna Hkiri, Mounir Zrigui:
Temporal Ordering of Events via Deep Neural Networks. 762-777 - Rubèn Tito, Dimosthenis Karatzas
, Ernest Valveny:
Document Collection Visual Question Answering. 778-792 - Jirí Martínek
, Pavel Král, Ladislav Lenc
:
Dialogue Act Recognition Using Visual Information. 793-807 - Oliver Tüselmann
, Fabian Wolf
, Gernot A. Fink
:
Are End-to-End Systems Really Necessary for NER on Handwritten Document Images? 808-822 - Harsh Kohli
:
Training Bi-Encoders for Word Sense Disambiguation. 823-837 - Freddy C. Chua
, Nigel P. Duffy
:
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction. 838-853 - Djedjiga Belhadj
, Yolande Belaïd
, Abdel Belaïd
:
Consideration of the Word's Neighborhood in GATs for Information Extraction in Semi-structured Documents. 854-869

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.