Publications


2025

Lluis Gomez (2025).
“Measuring Text-Image Retrieval Fairness with Synthetic Data”
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval.

Lluis Gomez (2025).
“Over the Top-1: Uncertainty-Aware Cross-Modal Retrieval with CLIP”
Proceedings of the 41st Conference on Uncertainty in Artificial Intelligence (UAI).

Sonia Ruiz and Lluis Gomez (2025).
“Charting Pathways: An Intersectional Impact Assessment for Vision and Language Foundation Models”
Women, Technology, and Power - Unmasking (and dealing with) digital disparities in the times of the platforms. Link

Mustapha El Aichouni and Lluis Gomez and Lei Kang (2025).
“Mitigating Distribution Bias in Multimodal Datasets via Clustering-Based Curation”
Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA).

Net, Francesc and Folia, Marc and Casals, Pep and Bagdanov, Andrew D. and Gómez, Lluis (2025).
“EUFCC-340K: A faceted hierarchical dataset for metadata annotation in GLAM collections”
Multimedia Tools and Applications. Link

Kang, Lei and Fu, Xuanshuo and Gomez, Lluis and Fornés, Alicia and Valveny, Ernest and Karatzas, Dimosthenis (2025).
“Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition”
arXiv preprint arXiv:2504.08616. Link


2024

Net, Francesc and Gomez, Lluis (2024).
“EUFCC-CIR: A Composed Image Retrieval Dataset for GLAM Collections”
Computer Vision – ECCV 2024 Workshops. Link

Net, Francesc and Hernández, Núria and Molina, Adriá and Gómez, Lluis (2024).
“A Transformer-Based Object-Centric Approach for Date Estimation of Historical Photographs”
European Conference on Information Retrieval. Link

Kang, Lei and Souibgui, Mohamed Ali and Yang, Fei and Gomez, Lluis and Valveny, Ernest and Karatzas, Dimosthenis (2024).
“Machine Unlearning for Document Classification”
Int. Conference on Document Analysis and Recognition - ICDAR 2024. Link

Kang, Lei and Yang, Fei and Wang, Kai and Souibgui, Mohamed Ali and Gomez, Lluis and Fornés, Alicia and Valveny, Ernest and Karatzas, Dimosthenis (2024).
“GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models”
European Conference on Artificial Intelligence 2024.


2023

Nguyen, Khanh and Biten, Ali Furkan and Mafla, Andres and Gomez, Lluis and Karatzas, Dimosthenis (2023).
“Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia”
Proceedings of the AAAI Conference on Artificial Intelligence. Link

Souibgui, Mohamed Ali and Biswas, Sanket and Mafla, Andres and Biten, Ali Furkan and Fornés, Alicia and Kessentini, Yousri and Lladós, Josep and Gomez, Lluis and Karatzas, Dimosthenis (2023).
“Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement”
Proceedings of the AAAI Conference on Artificial Intelligence. Link

Vivoli, Emanuele and Biten, Ali Furkan and Mafla, Andres and Karatzas, Dimosthenis and Gomez, Lluis (2023).
“MUST-VQA: MUltilingual Scene-Text VQA”
European Conference on Computer Vision – ECCV 2022 Workshops. Link

Biten, Ali Furkan and Rubèn Tito and Lluis Gomez and Ernest Valveny and Dimosthenis Karatzas (2023).
“OCR-IDL: OCR Annotations for Industry Document Library Dataset”
European Conference on Computer Vision – ECCV 2022 Workshops, Proceedings.

Net, Francesc and Folia, Marc and Casals, Pep and Gómez, Lluis (2023).
“Transductive Learning for Near-Duplicate Image Detection in Scanned Photo Collections”
Int. Conference on Document Analysis and Recognition - ICDAR 2023. Link


2022

Adrià Molina and Lluis Gomez and Ramos Terrades, Oriol and Josep Lladós (2022).
“A Generic Image Retrieval Method for Date Estimation of Historical Document Collections”
Document Analysis Systems - 15th IAPR International Workshop, DAS 2022, Proceedings. Link

Brugués i Pujolràs, Josep and Gómez i Bigordà, Lluis and Dimosthenis Karatzas (2022).
“A Multilingual Approach to Scene Text Visual Question Answering”
Document Analysis Systems - 15th IAPR International Workshop, DAS 2022, Proceedings. Link

Biten, Ali Furkan and Andres Mafla and Lluis Gomez and Dimosthenis Karatzas (2022).
“Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching”
Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022. Link

Biten, Ali Furkan and Lluis Gomez and Dimosthenis Karatzas (2022).
“Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning”
Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022. Link

Souibgui, Mohamed Ali and Biten, Ali Furkan and Sounak Dey and Alicia Fornes and Yousri Kessentini and Lluis Gomez and Dimosthenis Karatzas and Josep Llados (2022).
“One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition”
Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022. Link


2021

Lluís Gómez and Biten, Ali Furkan and Tito, Rubèn Pérez and Andrés Mafla and Mar\c cal Rusi~nol and Ernest Valveny and Dimosthenis Karatzas (2021).
“Multimodal grid features and cell pointers for scene text visual question answering”
Pattern Recognition Letters. Link

Minesh Mathew and Lluis Gomez and Dimosthenis Karatzas and Jawahar, C. V. (2021).
“Asking questions on handwritten document collections”
International journal on document analysis and recognition. Link

Andrés Mafla and Rubèn Tito and Sounak Dey and Lluis Gomez and Marçal Rusiñol and Ernest Valveny and Dimosthenis Karatzas (2021).
“Real-time Lexicon-free Scene Text Retrieval”
Pattern Recognition. Link

Andres Mafla and Rezende, Rafael S. and Lluis Gomez and Diane Larlus and Dimosthenis Karatzas (2021).
“StacMR: Scene-text aware cross-modal retrieval”
IEEE/CVF Winter Conference on Applications of Computer Vision. Link

Molina, Adrià and Riba, Pau and Gomez, Lluis and Ramos-Terrades, Oriol and Lladós, Josep (2021).
“Date Estimation in the Wild of Scanned Historical Photos: An Image Retrieval Approach”
Document Analysis and Recognition – ICDAR 2021. Link

Pau Riba and Adrià Molina and Lluis Gomez and Oriol Ramos-Terrades and Josep Lladós (2021).
“Learning to Rank Words: Optimizing Ranking Metrics for Word Spotting”
Int. Conference on Document Analysis and Recognition – ICDAR 2021.


2020

Sangeeth Reddy and Minesh Mathew and Lluis Gomez and Marcal Rusinol and Dimosthenis Karatzas and Jawahar, C. V. (2020).
“RoadText-1K: Text Detection Recognition Dataset for Driving Videos”
2020 IEEE International Conference on Robotics and Automation, ICRA 2020.

Raul Gomez and Jaume Gibert and Lluis Gomez and Dimosthenis Karatzas (2020).
“Exploring hate speech detection in multimodal publications”
Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020.

Andres Mafla and Sounak Dey and Biten, Ali Furkan and Lluis Gomez and Dimosthenis Karatzas (2020).
“Fine-grained image classification and retrieval by combining visual and locally pooled textual features”
Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020.

Raul Gomez and Jaume Gibert and Lluis Gomez and Dimosthenis Karatzas (2020).
“Location Sensitive Image Retrieval and Tagging”
Computer Vision – ECCV 2020 - 16th European Conference, Proceedings.

Klara Janouskova and Jiri Matas and Lluis Gomez and Dimosthenis Karatzas (2020).
“Text recognition - Real world data and where to find them”
International Conference on Pattern Recognition (ICPR).


2019

Biten, Ali Furkan and Ruben Tito and Andres Mafla and Lluis Gomez and Marcal Rusinol and Jawahar, C. V. and Ernest Valveny and DImosthenis Karatzas (2019).
“Scene text visual question answering”
Proceedings - 2019 International Conference on Computer Vision, ICCV 2019.

Furkan Biten, Ali and Ruben Tito and Andres Mafla and Lluis Gomez and Marcal Rusinol and Minesh Mathew and Jawahar, C. V. and Ernest Valveny and Dimosthenis Karatzas (2019).
“ICDAR 2019 competition on scene text visual question answering”
Proceedings - 15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019.

Raul Gomez and Biten, Ali Furkan and Lluis Gomez and Jaume Gibert and Dimosthenis Karatzas and Marcal Rusinol (2019).
“Selective style transfer for text”
Proceedings - 15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019.

Biten, Ali Furkan and Lluis Gomez and Marcal Rusinol and DImosthenis Karatzas (2019).
“Good news, everyone! context driven entity-aware captioning for news images”
Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019.

Bazazian, D. and Gómez, R. and Nicolaou, A. and Gómez, L. and Karatzas, D. and Bagdanov, A.D. (2019).
“FAST: Facilitated and Accurate Scene Text Proposals through FCN Guided Pruning”
Pattern Recognition Letters.

Biten, Ali Furkan and Gomez, Lluis and Rusinol, Marcal and Karatzas, Dimosthenis (2019).
“Good News, Everyone! Context Driven Entity-Aware Captioning for News Images”
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Gomez, Raul and Gomez, Lluis and Gibert, Jaume and Karatzas, Dimosthenis (2019).
“Learning from #Barcelona Instagram Data What Locals and Tourists Post About Its Neighbourhoods”
Computer Vision – ECCV 2018 Workshops. Link

Gomez, Raul and Gomez, Lluis and Gibert, Jaume and Karatzas, Dimosthenis (2019).
“Learning to Learn from Web Data Through Deep Semantic Embeddings”
Computer Vision – ECCV 2018 Workshops. Link

Biten, Ali Furkan and Tito, Ruben and Mafla, Andres and Gomez, Lluis and Rusinol, Marcal and Valveny, Ernest and Jawahar, C.V. and Karatzas, Dimosthenis (2019).
“Scene Text Visual Question Answering”
The IEEE International Conference on Computer Vision (ICCV).

Gomez, Raul and Gomez, Lluis and Gibert, Jaume and Karatzas, Dimosthenis (2019).
“Self-Supervised Learning from Web Data for Multimodal Retrieval”
Multimodal Scene Understanding. Link

Yash Patel and Lluis Gomez and Mar\ccal Rusi~nol and Dimosthenis Karatzas and C.V. Jawahar (2019).
“Self-Supervised Visual Representations for Cross-Modal Retrieval”
Proceedings of the 2019 on International Conference on Multimedia Retrieval - {ICMR} {\textquotesingle}19. Link


2018

Luis Gomez and Marcal Rusinol and Dimosthenis Karatzas (2018).
“Cutting Sayre\textquotesingles Knot: Reading Scene Text without Segmentation. Application to Utility Meters”
2018 13th {IAPR} International Workshop on Document Analysis Systems ({DAS}). Link

Dimosthenis Karatzas and Luis Gomez and Anguelos Nicolaou and Marcal Rusinol (2018).
“The Robust Reading Competition Annotation and Evaluation Platform”
2018 13th {IAPR} International Workshop on Document Analysis Systems ({DAS}). Link

Gómez, Lluís and Mafla, Andrés and Rusiñol, Marçal and Karatzas, Dimosthenis (2018).
“Single Shot Scene Text Retrieval”
Computer Vision – ECCV 2018. Link


2017

Raul Gomez and Baoguang Shi and Lluis Gomez and Lukas Numann and Andreas Veit and Jiri Matas and Serge Belongie and Dismosthenis Karatzas (2017).
“ICDAR2017 Robust Reading Challenge on COCO-Text”
2017 14th {IAPR} International Conference on Document Analysis and Recognition ({ICDAR}). Link

Masakazu Iwamura and Naoyuki Morimoto and Keishi Tainaka and Dena Bazazian and Lluis Gomez and Dimosthenis Karatzas (2017).
“ICDAR2017 Robust Reading Challenge on Omnidirectional Video”
2017 14th {IAPR} International Conference on Document Analysis and Recognition ({ICDAR}). Link

Lluis Gomez and Marcal Rusinol and Dimosthenis Karatzas (2017).
“LSDE: Levenshtein Space Deep Embedding for Query-by-String Word Spotting”
2017 14th {IAPR} International Conference on Document Analysis and Recognition ({ICDAR}). Link

Lluis Gomez and Dimosthenis Karatzas (2017).
“TextProposals: A text-specific selective search algorithm for word spotting in the wild”
Pattern Recognition. Link

Lluis Gomez and Yash Patel and Marcal Rusinol and Dimosthenis Karatzas and C. V. Jawahar (2017).
“Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces”
2017 {IEEE} Conference on Computer Vision and Pattern Recognition ({CVPR}). Link

Gomez, L. and Nicolaou, A. and Karatzas, D. (2017).
“Improving patch-based scene text script identification with ensembles of conjoined networks”
Pattern Recognition.


2016

Lluis Gomez (2016).
“Exploiting similarity hierarchies for multi-script scene text understanding”
PhD Thesis, UAB.

Lluis Gomez and Dimosthenis Karatzas (2016).
“A Fine-Grained Approach to Scene Text Script Identification”
2016 12th {IAPR} Workshop on Document Analysis Systems ({DAS}). Link

Anguelos Nicolaou and Andrew D. Bagdanov and Lluis Gomez and Dimosthenis Karatzas (2016).
“Visual Script and Language Identification”
2016 12th {IAPR} Workshop on Document Analysis Systems ({DAS}). Link

Gomez, L. and Karatzas, D. (2016).
“A fast hierarchical method for multi-script and arbitrary oriented scene text extraction”
International Journal on Document Analysis and Recognition.

Patel, Y. and Gomez, L. and Rusi~nol, M. and Karatzas, D. (2016).
“Dynamic lexicon generation for natural scene images”
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics).


2015

Suman K. Ghosh and Lluis Gomez and Dimosthenis Karatzas and Ernest Valveny (2015).
“Efficient indexing for Query By String text retrieval”
2015 13th International Conference on Document Analysis and Recognition ({ICDAR}). Link

Lluis Gomez and Dimosthenis Karatzas (2015).
“Object proposals for text extraction in the wild”
2015 13th International Conference on Document Analysis and Recognition ({ICDAR}). Link

Karatzas, D. and Gomez-Bigorda, L. and Nicolaou, A. and Ghosh, S. and Bagdanov, A. and Iwamura, M. and Matas, J. and Neumann, L. and Chandrasekhar, V.R. and Lu, S. and Shafait, F. and Uchida, S. and Valveny, E. (2015).
“ICDAR 2015 competition on Robust Reading”
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR.

Lluis Gomez and Dimosthenis Karatzas (2015).
“Scene Text Recognition: No Country for Old~Men?”
Computer Vision - {ACCV} 2014 Workshops. Link


2014

Karatzas, D. and Robles, S. and Gomez, L. (2014).
“An on-line platform for ground truthing and performance evaluation of text extraction systems”
Proceedings - 11th IAPR International Workshop on Document Analysis Systems, DAS 2014.

Gomez, L. and Karatzas, D. (2014).
“MSER-based real-time text detection and tracking”
Proceedings - International Conference on Pattern Recognition.


2013

Dimosthenis Karatzas and Faisal Shafait and Seiichi Uchida and Masakazu Iwamura and Lluis Gomez i Bigorda and Sergi Robles Mestre and Joan Mas and David Fernandez Mota and Jon Almazan Almazan and Lluis Pere de las Heras (2013).
“ICDAR 2013 Robust Reading Competition”
2013 12th International Conference on Document Analysis and Recognition. Link

Lluis Gomez and Dimosthenis Karatzas (2013).
“Multi-script Text Extraction from Natural Scenes”
2013 12th International Conference on Document Analysis and Recognition. Link


2012

Lluis Gomez (2012).
“Web semántica y gestión de archivos: una introducción”
Gestión de la innovación y nuevas estrategias de investigación y difusión del fondo documental artístico.