default search action
Dimosthenis Karatzas
Person information
- affiliation: Universitat Autónoma de Barcelona, Spain
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c124]Qi Dong, Lei Kang, Dimosthenis Karatzas:
Multi-page Document VQA with Recurrent Memory Transformer. DAS 2024: 57-70 - [c123]Artemis Llabrés, Arka Ujjal Dey, Dimosthenis Karatzas, Ernest Valveny:
Image-Text Matching for Large-Scale Book Collections. DAS 2024: 89-102 - [c122]Lei Kang, Fei Yang, Kai Wang, Mohamed Ali Souibgui, Lluís Gómez, Alicia Fornés, Ernest Valveny, Dimosthenis Karatzas:
GRIF-DM: Generation of Rich Impression Fonts Using Diffusion Models. ECAI 2024: 226-233 - [c121]Lei Kang, Mohamed Ali Souibgui, Fei Yang, Lluís Gómez, Ernest Valveny, Dimosthenis Karatzas:
Machine Unlearning for Document Classification. ICDAR (4) 2024: 90-102 - [c120]Emanuele Vivoli, Joan Lafuente Baeza, Ernest Valveny Llobet, Dimosthenis Karatzas:
Multimodal Transformer for Comics Text-Cloze. ICDAR (6) 2024: 128-145 - [c119]Khanh Nguyen, Dimosthenis Karatzas:
Federated Document Visual Question Answering: A Pilot Study. ICDAR (6) 2024: 146-163 - [c118]Emanuele Vivoli, Irene Campaioli, Mariateresa Nardoni, Niccolo Biondi, Marco Bertini, Dimosthenis Karatzas:
Comics Datasets Framework: Mix of Comics Datasets for Detection Benchmarking. ICDAR (Workshops 1) 2024: 154-167 - [c117]Rubèn Tito, Khanh Nguyen, Marlon Tobaben, Raouf Kerkouche, Mohamed Ali Souibgui, Kangsoo Jung, Joonas Jälkö, Vincent Poulain D'Andecy, Aurélie Joseph, Lei Kang, Ernest Valveny, Antti Honkela, Mario Fritz, Dimosthenis Karatzas:
Privacy-Aware Document Visual Question Answering. ICDAR (6) 2024: 199-218 - [c116]Lei Kang, Rubèn Tito, Ernest Valveny, Dimosthenis Karatzas:
Multi-page Document Visual Question Answering Using Self-attention Scoring Mechanism. ICDAR (6) 2024: 219-232 - [c115]Jerod Weinman, Amelia Gómez Grabowska, Dimosthenis Karatzas:
Counting the Corner Cases: Revisiting Robust Reading Challenge Data Sets, Evaluation Protocols, and Metrics. ICDAR (4) 2024: 324-342 - [c114]Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol:
STEP - Towards Structured Scene-Text Spotting. WACV 2024: 872-881 - [i71]Emanuele Vivoli, Joan Lafuente Baeza, Ernest Valveny Llobet, Dimosthenis Karatzas:
Multimodal Transformer for Comics Text-Cloze. CoRR abs/2403.03719 (2024) - [i70]Arka Ujjal Dey, Artemis Llabrés, Ernest Valveny, Dimosthenis Karatzas:
Retrieval Augmented Verification: Unveiling Disinformation with Structured Representations for Zero-Shot Real-Time Evidence-guided Fact-Checking of Multi-modal Social media posts. CoRR abs/2404.10702 (2024) - [i69]Lei Kang, Rubèn Tito, Ernest Valveny, Dimosthenis Karatzas:
Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism. CoRR abs/2404.19024 (2024) - [i68]Lei Kang, Mohamed Ali Souibgui, Fei Yang, Lluís Gómez, Ernest Valveny, Dimosthenis Karatzas:
Machine Unlearning for Document Classification. CoRR abs/2404.19031 (2024) - [i67]Khanh Nguyen, Dimosthenis Karatzas:
Federated Document Visual Question Answering: A Pilot Study. CoRR abs/2405.06636 (2024) - [i66]Emanuele Vivoli, Irene Campaioli, Mariateresa Nardoni, Niccolo Biondi, Marco Bertini, Dimosthenis Karatzas:
Comics Datasets Framework: Mix of Comics datasets for detection benchmarking. CoRR abs/2407.03540 (2024) - [i65]Emanuele Vivoli, Marco Bertini, Dimosthenis Karatzas:
CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding. CoRR abs/2407.03550 (2024) - [i64]Artemis Llabrés, Arka Ujjal Dey, Dimosthenis Karatzas, Ernest Valveny:
Image-text matching for large-scale book collections. CoRR abs/2407.19812 (2024) - [i63]Lei Kang, Fei Yang, Kai Wang, Mohamed Ali Souibgui, Lluís Gómez, Alicia Fornés, Ernest Valveny, Dimosthenis Karatzas:
GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models. CoRR abs/2408.07259 (2024) - [i62]Emanuele Vivoli, Andrey Barsky, Mohamed Ali Souibgui, Artemis Llabrés, Marco Bertini, Dimosthenis Karatzas:
One missing piece in Vision and Language: A Survey on Comics Understanding. CoRR abs/2409.09502 (2024) - [i61]Emanuele Vivoli, Niccolo Biondi, Marco Bertini, Dimosthenis Karatzas:
ComiCap: A VLMs pipeline for dense captioning of Comic Panels. CoRR abs/2409.16159 (2024) - 2023
- [j20]Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny:
Hierarchical multimodal transformers for Multipage DocVQA. Pattern Recognit. 144: 109834 (2023) - [c113]Khanh Nguyen, Ali Furkan Biten, Andrés Mafla, Lluís Gómez, Dimosthenis Karatzas:
Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia. AAAI 2023: 1940-1948 - [c112]Mohamed Ali Souibgui, Sanket Biswas, Andrés Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluís Gómez, Dimosthenis Karatzas:
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement. AAAI 2023: 2330-2338 - [c111]Stepán Simsa, Michal Uricár, Milan Sulc, Yash Patel, Ahmed Hamdi, Matej Kocián, Matyás Skalický, Jirí Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas:
Overview of DocILE 2023: Document Information Localization and Extraction. CLEF 2023: 276-293 - [c110]Stepán Simsa, Michal Uricár, Milan Sulc, Yash Patel, Ahmed Hamdi, Matej Kocián, Matyás Skalický, Jirí Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas:
Extended Overview of DocILE 2023: Document Information Localization and Extraction. CLEF (Working Notes) 2023: 546-571 - [c109]Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering. ICCV (Workshops) 2023: 4648-4652 - [c108]Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol:
Accelerating Transformer-Based Scene Text Detection and Recognition via Token Pruning. ICDAR (6) 2023: 106-121 - [c107]George Tom, Minesh Mathew, Sergi Garcia-Bordils, Dimosthenis Karatzas, C. V. Jawahar:
Reading Between the Lanes: Text VideoQA on the Road. ICDAR (6) 2023: 137-154 - [c106]Stepán Simsa, Milan Sulc, Michal Uricár, Yash Patel, Ahmed Hamdi, Matej Kocián, Matyás Skalický, Jirí Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas:
DocILE Benchmark for Document Information Localization and Extraction. ICDAR (2) 2023: 147-166 - [c105]Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas, Xiang Bai:
ICDAR 2023 Competition on Video Text Reading for Dense and Small Text. ICDAR (2) 2023: 405-419 - [c104]Wenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai:
ICDAR 2023 Competition on Reading the Seal Title. ICDAR (2) 2023: 522-535 - [c103]Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai:
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images. ICDAR (2) 2023: 536-552 - [c102]George Tom, Minesh Mathew, Sergi Garcia-Bordils, Dimosthenis Karatzas, C. V. Jawahar:
ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and Recognition. ICDAR (2) 2023: 577-586 - [c101]Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
Watching the News: Towards VideoQA Models that can Read. WACV 2023: 4430-4439 - [i60]Stepán Simsa, Milan Sulc, Michal Uricár, Yash Patel, Ahmed Hamdi, Matej Kocián, Matyás Skalický, Jirí Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas:
DocILE Benchmark for Document Information Localization and Extraction. CoRR abs/2302.05658 (2023) - [i59]Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas, Xiang Bai:
ICDAR 2023 Video Text Reading Competition for Dense and Small Text. CoRR abs/2304.04376 (2023) - [i58]Wenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai:
ICDAR 2023 Competition on Reading the Seal Title. CoRR abs/2304.11966 (2023) - [i57]Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai:
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images. CoRR abs/2306.03287 (2023) - [i56]George Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas, C. V. Jawahar:
Reading Between the Lanes: Text VideoQA on the Road. CoRR abs/2307.03948 (2023) - [i55]Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering. CoRR abs/2309.01380 (2023) - [i54]Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol:
STEP - Towards Structured Scene-Text Spotting. CoRR abs/2309.02356 (2023) - [i53]Rubèn Tito, Khanh Nguyen, Marlon Tobaben, Raouf Kerkouche, Mohamed Ali Souibgui, Kangsoo Jung, Lei Kang, Ernest Valveny, Antti Honkela, Mario Fritz, Dimosthenis Karatzas:
Privacy-Aware Document Visual Question Answering. CoRR abs/2312.10108 (2023) - 2022
- [c100]Josep Brugués i Pujolràs, Lluís Gómez i Bigorda, Dimosthenis Karatzas:
A Multilingual Approach to Scene Text Visual Question Answering. DAS 2022: 65-79 - [c99]Sergi Garcia-Bordils, George Tom, Sangeeth Reddy, Minesh Mathew, Marçal Rusiñol, C. V. Jawahar, Dimosthenis Karatzas:
Read While You Drive - Multilingual Text Tracking on the Road. DAS 2022: 756-770 - [c98]Ali Furkan Biten, Rubèn Tito, Lluís Gómez, Ernest Valveny, Dimosthenis Karatzas:
OCR-IDL: OCR Annotations for Industry Document Library Dataset. ECCV Workshops (4) 2022: 241-252 - [c97]Emanuele Vivoli, Ali Furkan Biten, Andrés Mafla, Dimosthenis Karatzas, Lluís Gómez:
MUST-VQA: MUltilingual Scene-Text VQA. ECCV Workshops (4) 2022: 345-358 - [c96]Sergi Garcia-Bordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas:
Out-of-Vocabulary Challenge Report. ECCV Workshops (4) 2022: 359-375 - [c95]Ali Furkan Biten, Lluís Gómez, Dimosthenis Karatzas:
Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning. WACV 2022: 2473-2482 - [c94]Ali Furkan Biten, Andrés Mafla, Lluís Gómez, Dimosthenis Karatzas:
Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching. WACV 2022: 2483-2492 - [c93]Mohamed Ali Souibgui, Ali Furkan Biten, Sounak Dey, Alicia Fornés, Yousri Kessentini, Lluís Gómez, Dimosthenis Karatzas, Josep Lladós:
One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition. WACV 2022: 2563-2571 - [c92]Minesh Mathew, Viraj Bagal, Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny, C. V. Jawahar:
InfographicVQA. WACV 2022: 2582-2591 - [i52]Ali Furkan Biten, Rubèn Tito, Lluís Gómez, Ernest Valveny, Dimosthenis Karatzas:
OCR-IDL: OCR Annotations for Industry Document Library Dataset. CoRR abs/2202.12985 (2022) - [i51]Mohamed Ali Souibgui, Sanket Biswas, Andrés Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluís Gómez, Dimosthenis Karatzas:
Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement. CoRR abs/2203.04814 (2022) - [i50]Sergi Garcia-Bordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas:
Out-of-Vocabulary Challenge Report. CoRR abs/2209.06717 (2022) - [i49]Emanuele Vivoli, Ali Furkan Biten, Andrés Mafla, Dimosthenis Karatzas, Lluís Gómez:
MUST-VQA: MUltilingual Scene-text VQA. CoRR abs/2209.06730 (2022) - [i48]Khanh Nguyen, Ali Furkan Biten, Andrés Mafla, Lluís Gómez, Dimosthenis Karatzas:
Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia. CoRR abs/2209.10474 (2022) - [i47]Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
Watching the News: Towards VideoQA Models that can Read. CoRR abs/2211.05588 (2022) - [i46]Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny:
Hierarchical multimodal transformers for Multi-Page DocVQA. CoRR abs/2212.05935 (2022) - 2021
- [j19]Minesh Mathew, Lluís Gómez, Dimosthenis Karatzas, C. V. Jawahar:
Asking questions on handwritten document collections. Int. J. Document Anal. Recognit. 24(3): 235-249 (2021) - [j18]Andrés Mafla, Rubèn Tito, Sounak Dey, Lluís Gómez, Marçal Rusiñol, Ernest Valveny, Dimosthenis Karatzas:
Real-time Lexicon-free Scene Text Retrieval. Pattern Recognit. 110: 107656 (2021) - [j17]Lluís Gómez, Ali Furkan Biten, Rubèn Pérez Tito, Andrés Mafla, Marçal Rusiñol, Ernest Valveny, Dimosthenis Karatzas:
Multimodal grid features and cell pointers for scene text visual question answering. Pattern Recognit. Lett. 150: 242-249 (2021) - [c91]Rubèn Tito, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas:
ICDAR 2021 Competition on Document Visual Question Answering. ICDAR (4) 2021: 635-649 - [c90]Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny:
Document Collection Visual Question Answering. ICDAR (2) 2021: 778-792 - [c89]Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
DocVQA: A Dataset for VQA on Document Images. WACV 2021: 2199-2208 - [c88]Andrés Mafla, Rafael Sampaio de Rezende, Lluís Gómez, Diane Larlus, Dimosthenis Karatzas:
StacMR: Scene-Text Aware Cross-Modal Retrieval. WACV 2021: 2219-2229 - [c87]Andrés Mafla, Sounak Dey, Ali Furkan Biten, Lluís Gómez, Dimosthenis Karatzas:
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval. WACV 2021: 4022-4032 - [i45]Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar:
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction. CoRR abs/2103.10213 (2021) - [i44]Minesh Mathew, Viraj Bagal, Rubèn Pérez Tito, Dimosthenis Karatzas, Ernest Valveny, C. V. Jawahar:
InfographicVQA. CoRR abs/2104.12756 (2021) - [i43]Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny:
Document Collection Visual Question Answering. CoRR abs/2104.14336 (2021) - [i42]Mohamed Ali Souibgui, Ali Furkan Biten, Sounak Dey, Alicia Fornés, Yousri Kessentini, Lluís Gómez, Dimosthenis Karatzas, Josep Lladós:
One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition. CoRR abs/2105.05300 (2021) - [i41]Minesh Mathew, Lluís Gómez, Dimosthenis Karatzas, C. V. Jawahar:
Asking questions on handwritten document collections. CoRR abs/2110.00711 (2021) - [i40]Ali Furkan Biten, Lluís Gómez i Bigorda, Dimosthenis Karatzas:
Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning. CoRR abs/2110.01705 (2021) - [i39]Ali Furkan Biten, Andrés Mafla, Lluís Gómez, Dimosthenis Karatzas:
Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching. CoRR abs/2110.02623 (2021) - [i38]Rubèn Tito, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas:
ICDAR 2021 Competition on Document VisualQuestion Answering. CoRR abs/2111.05547 (2021) - 2020
- [c86]Raúl Gomez, Jaume Gibert, Lluís Gómez, Dimosthenis Karatzas:
Location Sensitive Image Retrieval and Tagging. ECCV (16) 2020: 649-665 - [c85]Klára Janousková, Jiri Matas, Lluís Gómez, Dimosthenis Karatzas:
Text Recognition - Real World Data and Where to Find Them. ICPR 2020: 4489-4496 - [c84]Sangeeth Reddy, Minesh Mathew, Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos. ICRA 2020: 11074-11080 - [c83]Raul Gomez, Yahui Liu, Marco De Nadai, Dimosthenis Karatzas, Bruno Lepri, Nicu Sebe:
Retrieval Guided Unsupervised Multi-domain Image to Image Translation. ACM Multimedia 2020: 3164-3172 - [c82]Raul Gomez, Jaume Gibert, Lluís Gómez, Dimosthenis Karatzas:
Exploring Hate Speech Detection in Multimodal Publications. WACV 2020: 1459-1467 - [c81]Andrés Mafla, Sounak Dey, Ali Furkan Biten, Lluís Gómez, Dimosthenis Karatzas:
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features. WACV 2020: 2939-2948 - [e1]Xiang Bai, Dimosthenis Karatzas, Daniel Lopresti:
Document Analysis Systems - 14th IAPR International Workshop, DAS 2020, Wuhan, China, July 26-29, 2020, Proceedings. Lecture Notes in Computer Science 12116, Springer 2020, ISBN 978-3-030-57057-6 [contents] - [i37]Andrés Mafla, Sounak Dey, Ali Furkan Biten, Lluís Gómez, Dimosthenis Karatzas:
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features. CoRR abs/2001.04732 (2020) - [i36]Sangeeth Reddy, Minesh Mathew, Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos. CoRR abs/2005.09496 (2020) - [i35]Lluís Gómez, Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Marçal Rusiñol, Ernest Valveny, Dimosthenis Karatzas:
Multimodal grid features and cell pointers for Scene Text Visual Question Answering. CoRR abs/2006.00923 (2020) - [i34]Minesh Mathew, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar:
DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020) - [i33]Klára Janousková, Jiri Matas, Lluís Gómez, Dimosthenis Karatzas:
Text Recognition - Real World Data and Where to Find Them. CoRR abs/2007.03098 (2020) - [i32]Raul Gomez, Jaume Gibert, Lluís Gómez, Dimosthenis Karatzas:
Location Sensitive Image Retrieval and Tagging. CoRR abs/2007.03375 (2020) - [i31]Raul Gomez, Yahui Liu, Marco De Nadai, Dimosthenis Karatzas, Bruno Lepri, Nicu Sebe:
Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation. CoRR abs/2008.04991 (2020) - [i30]Minesh Mathew, Rubèn Tito, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar:
Document Visual Question Answering Challenge 2020. CoRR abs/2008.08899 (2020) - [i29]Andrés Mafla, Sounak Dey, Ali Furkan Biten, Lluís Gómez, Dimosthenis Karatzas:
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval. CoRR abs/2009.09809 (2020) - [i28]Andrés Mafla, Rafael Sampaio de Rezende, Lluís Gómez, Diane Larlus, Dimosthenis Karatzas:
StacMR: Scene-Text Aware Cross-Modal Retrieval. CoRR abs/2012.04329 (2020)
2010 – 2019
- 2019
- [j16]Dena Bazazian, Raul Gomez, Anguelos Nicolaou, Lluís Gómez, Dimosthenis Karatzas, Andrew D. Bagdanov:
FAST: Facilitated and Accurate Scene Text Proposals through FCN Guided Pruning. Pattern Recognit. Lett. 119: 112-120 (2019) - [c80]Ali Furkan Biten, Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas:
Good News, Everyone! Context Driven Entity-Aware Captioning for News Images. CVPR 2019: 12466-12475 - [c79]Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Lluís Gómez i Bigorda, Marçal Rusiñol, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas:
Scene Text Visual Question Answering. ICCV 2019: 4290-4300 - [c78]Helena Muñoz, Fernando Vilariño, Dimosthenis Karatzas:
Eye-Movements During Information Extraction from Administrative Documents. HDI@ICDAR 2019: 6-9 - [c77]Mohammed Al-Rawi, Ernest Valveny, Dimosthenis Karatzas:
Can One Deep Learning Model Learn Script-Independent Multilingual Word-Spotting? ICDAR 2019: 260-267 - [c76]Raul Gomez, Ali Furkan Biten, Lluís Gómez, Jaume Gibert, Dimosthenis Karatzas, Marçal Rusiñol:
Selective Style Transfer for Text. ICDAR 2019: 805-812 - [c75]Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar:
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction. ICDAR 2019: 1516-1520 - [c74]Yipeng Sun, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin, Zihan Ni, Chee Kheng Chng, Yuliang Liu, Canjie Luo, Chun Chet Ng, Junyu Han, Errui Ding, Jingtuo Liu:
ICDAR 2019 Competition on Large-Scale Street View Text with Partial Labeling - RRC-LSVT. ICDAR 2019: 1557-1562 - [c73]Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Lluís Gómez, Marçal Rusiñol, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas:
ICDAR 2019 Competition on Scene Text Visual Question Answering. ICDAR 2019: 1563-1570 - [c72]Chee Kheng Chng, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han:
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text - RRC-ArT. ICDAR 2019: 1571-1576 - [c71]Rui Zhang, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao:
ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard. ICDAR 2019: 1577-1581 - [c70]Nibal Nayef, Cheng-Lin Liu, Jean-Marc Ogier, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie:
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition - RRC-MLT-2019. ICDAR 2019: 1582-1587 - [c69]Yash Patel, Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
Self-Supervised Visual Representations for Cross-Modal Retrieval. ICMR 2019: 182-186 - [i27]Raul Gomez, Lluís Gómez, Jaume Gibert, Dimosthenis Karatzas:
Self-Supervised Learning from Web Data for Multimodal Retrieval. CoRR abs/1901.02004 (2019) - [i26]Yash Patel, Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
Self-Supervised Visual Representations for Cross-Modal Retrieval. CoRR abs/1902.00378 (2019) - [i25]Ali Furkan Biten, Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas:
Good News, Everyone! Context driven entity-aware captioning for news images. CoRR abs/1904.01475 (2019) - [i24]Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Lluís Gómez, Marçal Rusiñol, Ernest Valveny, C. V. Jawahar, Dimosthenis Karatzas:
Scene Text Visual Question Answering. CoRR abs/1905.13648 (2019) - [i23]Raul Gomez, Ali Furkan Biten, Lluís Gómez, Jaume Gibert, Marçal Rusiñol, Dimosthenis Karatzas:
Selective Style Transfer for Text. CoRR abs/1906.01466 (2019) - [i22]Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Lluís Gómez, Marçal Rusiñol, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas:
ICDAR 2019 Competition on Scene Text Visual Question Answering. CoRR abs/1907.00490 (2019) - [i21]Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-Lin Liu, Jean-Marc Ogier:
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition - RRC-MLT-2019. CoRR abs/1907.00945 (2019) - [i20]Chee Kheng Chng, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin:
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT). CoRR abs/1909.07145 (2019) - [i19]Yipeng Sun, Zihan Ni, Chee Kheng Chng, Yuliang Liu, Canjie Luo, Chun Chet Ng, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin:
ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling - RRC-LSVT. CoRR abs/1909.07741 (2019) - [i18]Raul Gomez, Jaume Gibert, Lluís Gómez, Dimosthenis Karatzas:
Exploring Hate Speech Detection in Multimodal Publications. CoRR abs/1910.03814 (2019) - [i17]Xi Liu, Rui Zhang, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar:
ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard. CoRR abs/1912.09641 (2019) - 2018
- [c68]Dena Bazazian, Dimosthenis Karatzas, Andrew D. Bagdanov:
Word Spotting in Scene Images Based on Character Recognition. CVPR Workshops 2018: 1872-1874 - [c67]Dimosthenis Karatzas, Lluís Gómez, Anguelos Nicolaou, Marçal Rusiñol:
The Robust Reading Competition Annotation and Evaluation Platform. DAS 2018: 61-66 - [c66]Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas:
Cutting Sayre's Knot: Reading Scene Text without Segmentation. Application to Utility Meters. DAS 2018: 97-102 - [c65]Raul Gomez, Lluís Gómez, Jaume Gibert, Dimosthenis Karatzas:
Learning to Learn from Web Data Through Deep Semantic Embeddings. ECCV Workshops (6) 2018: 514-529 - [c64]Raul Gomez, Lluís Gómez, Jaume Gibert, Dimosthenis Karatzas:
Learning from #Barcelona Instagram Data What Locals and Tourists Post About Its Neighbourhoods. ECCV Workshops (6) 2018: 530-544 - [c63]Lluís Gómez, Andrés Mafla, Marçal Rusiñol, Dimosthenis Karatzas:
Single Shot Scene Text Retrieval. ECCV (14) 2018: 728-744 - [c62]Mohammed Al-Rawi, Dimosthenis Karatzas:
On the Labeling Correctness in Computer Vision Datasets. IAL@PKDD/ECML 2018: 1-23 - [c61]Anguelos Nicolaou, Sounak Dey, Vincent Christlein, Andreas K. Maier, Dimosthenis Karatzas:
Non-deterministic Behavior of Ranking-Based Metrics When Evaluating Embeddings. RRPR 2018: 71-82 - [i16]Anguelos Nicolaou, Sounak Dey, Vincent Christlein, Andreas K. Maier, Dimosthenis Karatzas:
Non-deterministic Behavior of Ranking-based Metrics when Evaluating Embeddings. CoRR abs/1806.07171 (2018) - [i15]Yash Patel, Lluís Gómez i Bigorda, Raul Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
TextTopicNet - Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces. CoRR abs/1807.02110 (2018) - [i14]Raul Gomez, Lluís Gómez, Jaume Gibert, Dimosthenis Karatzas:
Learning to Learn from Web Data through Deep Semantic Embeddings. CoRR abs/1808.06368 (2018) - [i13]Raul Gomez, Lluís Gómez, Jaume Gibert, Dimosthenis Karatzas:
Learning from #Barcelona Instagram data what Locals and Tourists post about its Neighbourhoods. CoRR abs/1808.06369 (2018) - [i12]Lluís Gómez, Andrés Mafla, Marçal Rusiñol, Dimosthenis Karatzas:
Single Shot Scene Text Retrieval. CoRR abs/1808.09044 (2018) - [i11]Dena Bazazian, Dimosthenis Karatzas, Andrew D. Bagdanov:
Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene Images. CoRR abs/1809.00854 (2018) - 2017
- [j15]Lluís Gómez i Bigorda, Anguelos Nicolaou, Dimosthenis Karatzas:
Improving patch-based scene text script identification with ensembles of conjoined networks. Pattern Recognit. 67: 85-96 (2017) - [j14]Lluís Gómez i Bigorda, Dimosthenis Karatzas:
TextProposals: A text-specific selective search algorithm for word spotting in the wild. Pattern Recognit. 70: 60-74 (2017) - [c60]Lluis Gomez-Bigorda, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces. CVPR 2017: 2017-2026 - [c59]Leonardo Galteri, Dena Bazazian, Lorenzo Seidenari, Marco Bertini, Andrew D. Bagdanov, Anguelos Nicolaou, Dimosthenis Karatzas, Alberto Del Bimbo:
Reading Text in the Wild from Compressed Images. ICCV Workshops 2017: 2399-2407 - [c58]Lluis Gomez-Bigorda, Marçal Rusiñol, Dimosthenis Karatzas:
LSDE: Levenshtein Space Deep Embedding for Query-by-String Word Spotting. ICDAR 2017: 499-504 - [c57]Raul Gomez, Baoguang Shi, Lluis Gomez-Bigorda, Lukás Neumann, Andreas Veit, Jiri Matas, Serge J. Belongie, Dimosthenis Karatzas:
ICDAR2017 Robust Reading Challenge on COCO-Text. ICDAR 2017: 1435-1443 - [c56]Chun Yang, Xu-Cheng Yin, Hong Yu, Dimosthenis Karatzas, Yu Cao:
ICDAR2017 Robust Reading Challenge on Text Extraction from Biomedical Literature Figures (DeTEXT). ICDAR 2017: 1444-1447 - [c55]Masakazu Iwamura, Naoyuki Morimoto, Keishi Tainaka, Dena Bazazian, Lluis Gomez-Bigorda, Dimosthenis Karatzas:
ICDAR2017 Robust Reading Challenge on Omnidirectional Video. ICDAR 2017: 1448-1453 - [c54]Nibal Nayef, Fei Yin, Imen Bizid, Hyunsoo Choi, Yuan Feng, Dimosthenis Karatzas, Zhenbo Luo, Umapada Pal, Christophe Rigaud, Joseph Chazalon, Wafa Khlif, Muhammad Muzzamil Luqman, Jean-Christophe Burie, Cheng-Lin Liu, Jean-Marc Ogier:
ICDAR2017 Robust Reading Challenge on Multi-Lingual Scene Text Detection and Script Identification - RRC-MLT. ICDAR 2017: 1454-1459 - [i10]Dena Bazazian, Raul Gomez, Anguelos Nicolaou, Lluís Gómez i Bigorda, Dimosthenis Karatzas, Andrew D. Bagdanov:
Improving Text Proposals for Scene Images with Fully Convolutional Networks. CoRR abs/1702.05089 (2017) - [i9]Lluís Gómez i Bigorda, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
Self-supervised learning of visual features through embedding images into text topic spaces. CoRR abs/1705.08631 (2017) - [i8]Dimosthenis Karatzas, Lluís Gómez i Bigorda, Marçal Rusiñol:
The Robust Reading Competition Annotation and Evaluation Platform. CoRR abs/1710.06617 (2017) - 2016
- [j13]Lluis Gomez-Bigorda, Dimosthenis Karatzas:
A fast hierarchical method for multi-script and arbitrary oriented scene text extraction. Int. J. Document Anal. Recognit. 19(4): 335-349 (2016) - [c53]Lluís Gómez i Bigorda, Dimosthenis Karatzas:
A Fine-Grained Approach to Scene Text Script Identification. DAS 2016: 192-197 - [c52]Dimosthenis Karatzas, Vincent Poulain D'Andecy, Marçal Rusiñol, Antonio Chica, Pere-Pau Vázquez Alcocer:
Human-Document Interaction Systems - A New Frontier for Document Image Analysis. DAS 2016: 369-374 - [c51]Anguelos Nicolaou, Andrew D. Bagdanov, Lluis Gomez i Bigorda, Dimosthenis Karatzas:
Visual Script and Language Identification. DAS 2016: 393-398 - [c50]Yash Patel, Lluís Gómez i Bigorda, Marçal Rusiñol, Dimosthenis Karatzas:
Dynamic Lexicon Generation for Natural Scene Images. ECCV Workshops (1) 2016: 395-410 - [i7]Anguelos Nicolaou, Andrew D. Bagdanov, Lluis Gomez-Bigorda, Dimosthenis Karatzas:
Visual Script and Language Identification. CoRR abs/1601.01885 (2016) - [i6]Lluís Gómez i Bigorda, Dimosthenis Karatzas:
A fine-grained approach to scene text script identification. CoRR abs/1602.07475 (2016) - [i5]Lluís Gómez i Bigorda, Anguelos Nicolaou, Dimosthenis Karatzas:
Boosting patch-based scene text script identification with ensembles of conjoined networks. CoRR abs/1602.07480 (2016) - [i4]Lluis Gomez-Bigorda, Dimosthenis Karatzas:
TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild. CoRR abs/1604.02619 (2016) - 2015
- [j12]Faisal Shafait, Dimosthenis Karatzas, Seiichi Uchida, Masakazu Iwamura:
Preface. Int. J. Document Anal. Recognit. 18(2): 109-110 (2015) - [j11]Christophe Rigaud, Clément Guérin, Dimosthenis Karatzas, Jean-Christophe Burie, Jean-Marc Ogier:
Knowledge-driven understanding of images in comic books. Int. J. Document Anal. Recognit. 18(3): 199-221 (2015) - [c49]Lluís Gómez i Bigorda, Dimosthenis Karatzas:
Object proposals for text extraction in the wild. ICDAR 2015: 206-210 - [c48]Hongxing Gao, Marçal Rusiñol, Dimosthenis Karatzas, Josep Lladós, Rajiv Jain, David S. Doermann:
Novel line verification for multiple instance focused retrieval in document collections. ICDAR 2015: 481-485 - [c47]Anguelos Nicolaou, Andrew D. Bagdanov, Marcus Liwicki, Dimosthenis Karatzas:
Sparse radial sampling LBP for writer identification. ICDAR 2015: 716-720 - [c46]Dimosthenis Karatzas, Lluis Gomez-Bigorda, Anguelos Nicolaou, Suman K. Ghosh, Andrew D. Bagdanov, Masakazu Iwamura, Jiri Matas, Lukás Neumann, Vijay Ramaseshan Chandrasekhar, Shijian Lu, Faisal Shafait, Seiichi Uchida, Ernest Valveny:
ICDAR 2015 competition on Robust Reading. ICDAR 2015: 1156-1160 - [c45]Suman K. Ghosh, Lluís Gómez i Bigorda, Dimosthenis Karatzas, Ernest Valveny:
Efficient indexing for Query By String text retrieval. ICDAR 2015: 1236-1240 - [c44]Jochen Kuhn, Alexander Nussbaumer, Johanna Pirker, Dimosthenis Karatzas, Alain Pagani, Owen Conlan, Martin Memmel, Christina M. Steiner, Christian Gütl, Dietrich Albert, Andreas Dengel:
Advancing Physics Learning Through Traversing a Multi-Modal Experimentation Space. Intelligent Environments (Workshops) 2015: 373-380 - [c43]Marçal Rusiñol, Dimosthenis Karatzas, Josep Lladós:
Automatic Verification of Properly Signed Multi-page Document Images. ISVC (2) 2015: 327-336 - [i3]Anguelos Nicolaou, Andrew D. Bagdanov, Marcus Liwicki, Dimosthenis Karatzas:
Sparse Radial Sampling LBP for Writer Identification. CoRR abs/1504.06133 (2015) - [i2]Lluís Gómez i Bigorda, Dimosthenis Karatzas:
Object Proposals for Text Extraction in the Wild. CoRR abs/1509.02317 (2015) - 2014
- [j10]Antonio Clavelli, Dimosthenis Karatzas, Josep Lladós, Mario Ferraro, Giuseppe Boccignone:
Modelling Task-Dependent Eye Guidance to Objects in Pictures. Cogn. Comput. 6(3): 558-584 (2014) - [j9]C. Alejandro Párraga, Jordi Roca-Vila, Dimosthenis Karatzas, Sophie M. Wuerger:
Limitations of visual gamma corrections in LCD displays. Displays 35(5): 227-239 (2014) - [j8]Marçal Rusiñol, Volkmar Frinken, Dimosthenis Karatzas, Andrew D. Bagdanov, Josep Lladós:
Multimodal page classification in administrative document image streams. Int. J. Document Anal. Recognit. 17(4): 331-341 (2014) - [c42]Lluís Gómez i Bigorda, Dimosthenis Karatzas:
Scene Text Recognition: No Country for Old Men? ACCV Workshops (2) 2014: 157-168 - [c41]Volkmar Frinken, Dimosthenis Karatzas, Andreas Fischer:
A Cache Language Model for Whole Document Handwriting Recognition. Document Analysis Systems 2014: 166-170 - [c40]Dimosthenis Karatzas, Sergi Robles Mestre, Lluis Gomez i Bigorda:
An On-line Platform for Ground Truthing and Performance Evaluation of Text Extraction Systems. Document Analysis Systems 2014: 242-246 - [c39]Christophe Rigaud, Dimosthenis Karatzas, Jean-Christophe Burie, Jean-Marc Ogier:
Color Descriptor for Content-Based Drawing Retrieval. Document Analysis Systems 2014: 267-271 - [c38]Hongxing Gao, Marçal Rusiñol, Dimosthenis Karatzas, Josep Lladós:
Fast structural matching for document image retrieval through spatial databases. DRR 2014: 90210N-90210N-10 - [c37]Hongxing Gao, Marçal Rusiñol, Dimosthenis Karatzas, Josep Lladós:
Embedding Document Structure to Bag-of-Words through Pair-wise Stable Key-Regions. ICPR 2014: 2903-2908 - [c36]Lluís Gómez i Bigorda, Dimosthenis Karatzas:
MSER-Based Real-Time Text Detection and Tracking. ICPR 2014: 3110-3115 - [p2]Anastasios L. Kesidis, Dimosthenis Karatzas:
Logo and Trademark Recognition. Handbook of Document Image Processing and Recognition 2014: 591-646 - [i1]Lluis Gomez i Bigorda, Dimosthenis Karatzas:
A Fast Hierarchical Method for Multi-script and Arbitrary Oriented Scene Text Extraction. CoRR abs/1407.7504 (2014) - 2013
- [c35]Marçal Rusiñol, Dimosthenis Karatzas, Josep Lladós:
Spotting Graphical Symbols in Camera-Acquired Documents in Real Time. GREC 2013: 3-10 - [c34]Christophe Rigaud, Dimosthenis Karatzas, Jean-Christophe Burie, Jean-Marc Ogier:
Adaptive Contour Classification of Comics Speech Balloons. GREC 2013: 53-62 - [c33]Antonio Clavelli, Dimosthenis Karatzas, Josep Lladós, Mario Ferraro, Giuseppe Boccignone:
Towards Modelling an Attention-Based Text Localization Process. IbPRIA 2013: 296-303 - [c32]Hongxing Gao, Marçal Rusiñol, Dimosthenis Karatzas, Josep Lladós, Tomokazu Sato, Masakazu Iwamura, Koichi Kise:
Key-Region Detection for Document Images - Application to Administrative Document Retrieval. ICDAR 2013: 230-234 - [c31]Lluis Gomez i Bigorda, Dimosthenis Karatzas:
Multi-script Text Extraction from Natural Scenes. ICDAR 2013: 467-471 - [c30]Albert Gordo, Marçal Rusiñol, Dimosthenis Karatzas, Andrew D. Bagdanov:
Document Classification and Page Stream Segmentation for Digital Mailroom Applications. ICDAR 2013: 621-625 - [c29]Christophe Rigaud, Jean-Christophe Burie, Jean-Marc Ogier, Dimosthenis Karatzas, Joost van de Weijer:
An Active Contour Model for Speech Balloon Detection in Comics. ICDAR 2013: 1240-1244 - [c28]Dimosthenis Karatzas, Faisal Shafait, Seiichi Uchida, Masakazu Iwamura, Lluis Gomez i Bigorda, Sergi Robles Mestre, Joan Mas, David Fernández Mota, Jon Almazán, Lluís-Pere de las Heras:
ICDAR 2013 Robust Reading Competition. ICDAR 2013: 1484-1493 - [c27]Rahat Khan, Joost van de Weijer, Dimosthenis Karatzas, Damien Muselet:
Towards multispectral data acquisition with hand-held devices. ICIP 2013: 2053-2057 - [c26]Hongxing Gao, Marçal Rusiñol, Dimosthenis Karatzas, Apostolos Antonacopoulos, Josep Lladós:
An Interactive Appearance-based Document Retrieval System for Historical Newspapers. VISAPP (2) 2013: 84-87 - [c25]Christophe Rigaud, Dimosthenis Karatzas, Joost van de Weijer, Jean-Christophe Burie, Jean-Marc Ogier:
Automatic Text Localisation in Scanned Comic Books. VISAPP (1) 2013: 814-819 - 2012
- [c24]Marçal Rusiñol, Lluís-Pere de las Heras, Joan Mas, Oriol Ramos Terrades, Dimosthenis Karatzas, Anjan Dutta, Gemma Sánchez, Josep Lladós:
CVC-UAB's Participation in the Flowchart Recognition Task of CLEF-IP 2012. CLEF (Online Working Notes/Labs/Workshop) 2012 - [c23]Marçal Rusiñol, Dimosthenis Karatzas, Andrew D. Bagdanov, Josep Lladós:
Multipage document retrieval by textual and visual representations. ICPR 2012: 521-524 - 2011
- [j7]Miquel Ferrer, Dimosthenis Karatzas, Ernest Valveny, Itziar Bardají, Horst Bunke:
A generic framework for median graph computation based on a recursive embedding approach. Comput. Vis. Image Underst. 115(7): 919-928 (2011) - [j6]Kaida Xiao, Chenyang Fu, Dimosthenis Karatzas, Sophie M. Wuerger:
Visual gamma correction for LCD displays. Displays 32(1): 17-23 (2011) - [j5]Simone Marinai, Dimosthenis Karatzas:
Report from the AND 2009 working group on noisy text datasets. Int. J. Document Anal. Recognit. 14(2): 113-116 (2011) - [c22]Marçal Rusiñol, David Aldavert, Dimosthenis Karatzas, Ricardo Toledo, Josep Lladós:
Interactive Trademark Image Retrieval by Fusing Semantic and Visual Content. ECIR 2011: 314-325 - [c21]Marçal Rusiñol, Vincent Poulain D'Andecy, Dimosthenis Karatzas, Josep Lladós:
Classification of Administrative Document Images by Logo Identification. GREC 2011: 49-58 - [c20]Dimosthenis Karatzas, Sergi Robles Mestre, Joan Mas, Farshad Nourbakhsh, Partha Pratim Roy:
ICDAR 2011 Robust Reading Competition - Challenge 1: Reading Text in Born-Digital Images (Web and Email). ICDAR 2011: 1485-1490 - [c19]Kaida Xiao, Dimitris S. Mylonas, Chenyang Fu, Dimosthenis Karatzas, Sophie M. Wuerger:
Locating Unique Hues under mixed illumination conditions in CIECAM02. CIC 2011: 94-97 - 2010
- [j4]Mathieu Delalandre, Ernest Valveny, Tony P. Pridmore, Dimosthenis Karatzas:
Generation of synthetic documents for performance evaluation of symbol recognition & spotting systems. Int. J. Document Anal. Recognit. 13(3): 187-207 (2010) - [j3]Alicia Fornés, Josep Lladós, Gemma Sánchez, Dimosthenis Karatzas:
Rotation invariant hand-drawn symbol recognition based on a dynamic time warping model. Int. J. Document Anal. Recognit. 13(3): 229-241 (2010) - [c18]Antonio Clavelli, Dimosthenis Karatzas, Josep Lladós:
A framework for the assessment of text extraction algorithms on complex colour images. Document Analysis Systems 2010: 19-26 - [c17]Farshad Nourbakhsh, Dimosthenis Karatzas, Ernest Valveny:
A polar-based logo representation based on topological and colour features. Document Analysis Systems 2010: 341-348 - [c16]Marçal Rusiñol, Farshad Nourbakhsh, Dimosthenis Karatzas, Ernest Valveny, Josep Lladós:
Perceptual Image Retrieval by Adding Color Information to the Shape Context Descriptor. ICPR 2010: 1594-1597
2000 – 2009
- 2009
- [c15]Miquel Ferrer, Dimosthenis Karatzas, Ernest Valveny, Horst Bunke:
A Recursive Embedding Approach to Median Graph Computation. GbRPR 2009: 113-123 - [c14]Antonio Clavelli, Dimosthenis Karatzas:
Text Segmentation in Colour Posters from the Spanish Civil War Era. ICDAR 2009: 181-185 - 2008
- [j2]Josep Lladós, Dimosthenis Karatzas, Joan Mas, Gemma Sánchez:
A Generic Architecture for the Conversion of Document Collections into Semantically Annotated Digital Archives. J. Univers. Comput. Sci. 14(18): 2912-2935 (2008) - [c13]Dimosthenis Karatzas:
Detecting Gradients in Text Images Using the Hough Transform. Document Analysis Systems 2008: 245-252 - [c12]Joan Mas, José A. Rodríguez, Dimosthenis Karatzas, Gemma Sánchez, Josep Lladós:
HistoSketch: A Semi-Automatic Annotation Tool for Archival Documents. Document Analysis Systems 2008: 517-524 - [c11]Dimosthenis Karatzas, Marçal Rusiñol, Jacobus Antens, Miquel Ferrer:
Segmentation robust to the vignette effect for machine vision systems. ICPR 2008: 1-4 - 2007
- [j1]Dimosthenis Karatzas, Apostolos Antonacopoulos:
Colour text segmentation in web images based on human perception. Image Vis. Comput. 25(5): 564-577 (2007) - 2006
- [c10]Apostolos Antonacopoulos, Dimosthenis Karatzas, David Bridson:
Ground Truth for Layout Analysis Performance Evaluation. Document Analysis Systems 2006: 302-311 - 2005
- [c9]Sophie M. Wuerger, Dimosthenis Karatzas, Georg F. Meyer:
A display calibration technique based on invariant human colour mechanisms. APGV 2005: 171 - [c8]Apostolos Antonacopoulos, Dimosthenis Karatzas:
Semantics-Based Content Extraction in Typewritten Historical Documents. ICDAR 2005: 48-53 - 2004
- [c7]Apostolos Antonacopoulos, Dimosthenis Karatzas:
A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives. Document Analysis Systems 2004: 90-101 - [c6]Apostolos Antonacopoulos, Dimosthenis Karatzas:
Document Image Analysis for World War II Personal Records. DIAL 2004: 336-341 - [c5]Apostolos Antonacopoulos, Dimosthenis Karatzas, Henryk Krawczyk, Bogdan Wiszniewski:
The lifecycle of a digital historical document: structure and content. ACM Symposium on Document Engineering 2004: 147-154 - [c4]Dimosthenis Karatzas, Apostolos Antonacopoulos:
Text Extraction from Web Images Based on A Split-and-Merge Segmentation Method Using Colour Perception. ICPR (2) 2004: 634-637 - 2003
- [b1]Dimosthenis A. Karatzas:
Text segmentation in web images using colour perception and topological features. University of Liverpool, UK, 2003 - [c3]Dimosthenis Karatzas, Apostolos Antonacopoulos:
Two Approaches for Text Segmentation in Web Images. ICDAR 2003: 131-136 - [c2]Apostolos Antonacopoulos, Basilios Gatos, Dimosthenis Karatzas:
ICDAR 2003 Page Segmentation Competition. ICDAR 2003: 688- - [p1]Apostolos Antonacopoulos, Dimosthenis Karatzas:
A fuzzy Approach to Text Segmentation in Web Images based on Human Colour perception. Web Document Analysis 2003: 203-221 - 2002
- [c1]Apostolos Antonacopoulos, Dimosthenis Karatzas:
Fuzzy Segmentation of Characters in Web Images Based on Human Colour Perception. Document Analysis Systems 2002: 295-306
Coauthor Index
aka: Raúl Gomez
aka: Rubèn Pérez Tito
aka: Ernest Valveny Llobet
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-30 21:34 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint