Search Results for author: Mohamed Ali Souibgui

Found 11 papers, 6 papers with code

Privacy-Aware Document Visual Question Answering

no code implementations15 Dec 2023 Rubèn Tito, Khanh Nguyen, Marlon Tobaben, Raouf Kerkouche, Mohamed Ali Souibgui, Kangsoo Jung, Lei Kang, Ernest Valveny, Antti Honkela, Mario Fritz, Dimosthenis Karatzas

We employ a federated learning scheme, that reflects the real-life distribution of documents in different businesses, and we explore the use case where the ID of the invoice issuer is the sensitive information to be protected.

document understanding Federated Learning +3

CSSL-MHTR: Continual Self-Supervised Learning for Scalable Multi-script Handwritten Text Recognition

no code implementations16 Mar 2023 Marwa Dhiaf, Mohamed Ali Souibgui, Kai Wang, Yuyang Liu, Yousri Kessentini, Alicia Fornés, Ahmed Cheikh Rouhou

In this paper, we explore the potential of continual self-supervised learning to alleviate the catastrophic forgetting problem in handwritten text recognition, as an example of sequence recognition.

Handwritten Text Recognition Self-Supervised Learning

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

1 code implementation9 Mar 2022 Mohamed Ali Souibgui, Sanket Biswas, Andres Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluis Gomez, Dimosthenis Karatzas

In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement.

Document Enhancement Scene Text Recognition

Few Shots Are All You Need: A Progressive Few Shot Learning Approach for Low Resource Handwritten Text Recognition

1 code implementation21 Jul 2021 Mohamed Ali Souibgui, Alicia Fornés, Yousri Kessentini, Beáta Megyesi

Since this retraining would require annotation of thousands of handwritten symbols together with their bounding boxes, we propose to avoid such human effort through an unsupervised progressive learning approach that automatically assigns pseudo-labels to the non-annotated data.

Few-Shot Learning Handwriting Recognition +1

Enhance to Read Better: A Multi-Task Adversarial Network for Handwritten Document Image Enhancement

1 code implementation26 May 2021 Sana Khamekhem Jemni, Mohamed Ali Souibgui, Yousri Kessentini, Alicia Fornés

Unlike the most well-known document binarization methods, which try to improve the visual quality of the degraded document, the proposed architecture integrates a handwritten text recognizer that promotes the generated document image to be more readable.

Binarization Handwritten Text Recognition +2

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

no code implementations11 May 2021 Mohamed Ali Souibgui, Ali Furkan Biten, Sounak Dey, Alicia Fornés, Yousri Kessentini, Lluis Gomez, Dimosthenis Karatzas, Josep Lladós

Low resource Handwritten Text Recognition (HTR) is a hard problem due to the scarce annotated data and the very limited linguistic information (dictionaries and language models).

Handwritten Text Recognition HTR

DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement

4 code implementations17 Oct 2020 Mohamed Ali Souibgui, Yousri Kessentini

Documents often exhibit various forms of degradation, which make it hard to be read and substantially deteriorate the performance of an OCR system.

Binarization Deblurring +3

Cannot find the paper you are looking for? You can Submit a new open access paper.