Search Results for author: Sanket Biswas

Found 21 papers, 12 papers with code

SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition

no code implementations • 6 May 2024 • Adarsh Tiwari, Sanket Biswas, Josep Lladós

We present SketchGPT, a flexible framework that employs a sequence-to-sequence autoregressive model for sketch generation, and completion, and an interpretation case study for sketch recognition.

Paper
Add Code

GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding

no code implementations • 6 May 2024 • Nil Biescas, Carlos Boned, Josep Lladós, Sanket Biswas

This paper presents GeoContrastNet, a language-agnostic framework to structured document understanding (DU) by integrating a contrastive learning objective with graph attention networks (GATs), emphasizing the significant role of geometric features.

Paper
Add Code

GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation

1 code implementation • 17 Feb 2024 • Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Object detection in documents is a key step to automate the structural elements identification process in a digital or scanned document through understanding the hierarchical structure and relationships between different elements.

Knowledge Distillation object-detection +1

Paper
Code

Synthetic dataset of ID and Travel Document

1 code implementation • 3 Jan 2024 • Carlos Boned, Maxime Talarmain, Nabil Ghanmi, Guillaume Chiron, Sanket Biswas, Ahmad Montaser Awal, Oriol Ramos Terrades

This paper presents a new synthetic dataset of ID and travel documents, called SIDTD.

Paper
Code

The Common Optical Music Recognition Evaluation Framework

no code implementations • 20 Dec 2023 • Pau Torras, Sanket Biswas, Alicia Fornés

The quality of Optical Music Recognition (OMR) systems is a rather difficult magnitude to measure.

Paper
Add Code

Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance

1 code implementation • 2 Oct 2023 • Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya

The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions.

Scene Text Detection Text Detection +1

Paper
Code

Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes

no code implementations • 1 Oct 2023 • Alloy Das, Sanket Biswas, Umapada Pal, Josep Lladós

When used in a real-world noisy environment, the capacity to generalize to multiple domains is essential for any autonomous scene text spotting system.

Super-Resolution Text Spotting

Paper
Add Code

TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language

no code implementations • 11 Sep 2023 • Souhail Bakkali, Sanket Biswas, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol, Oriol Ramos Terrades, Josep Lladós

The field of visual document understanding has witnessed a rapid growth in emerging challenges and powerful multi-modal strategies.

Ranked #19 on Document Image Classification on RVL-CDIP

Document Image Classification document understanding +1

Paper
Add Code

Beyond Document Page Classification: Design, Datasets, and Challenges

1 code implementation • 24 Aug 2023 • Jordy Van Landeghem, Sanket Biswas, Matthew B. Blaschko, Marie-Francine Moens

This paper highlights the need to bring document classification benchmarking closer to real-world applications, both in the nature of data tested ($X$: multi-channel, multi-paged, multi-industry; $Y$: class distributions and label set variety) and in classification tasks considered ($f$: multi-page document, page stream, and document bundle classification, ...).

Benchmarking Classification +1

Paper
Code

SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation

1 code implementation • 8 May 2023 • Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Instance-level segmentation of documents consists in assigning a class-aware and instance-aware label to each pixel of the image.

Decoder Instance Segmentation +2

Paper
Code

SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

1 code implementation • 1 May 2023 • Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal

Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc.

Document Layout Analysis object-detection +1

Paper
Code

A Few Shot Multi-Representation Approach for N-gram Spotting in Historical Manuscripts

no code implementations • 21 Sep 2022 • Giuseppe De Gregorio, Sanket Biswas, Mohamed Ali Souibgui, Asma Bensalah, Josep Lladós, Alicia Fornés, Angelo Marcelli

Despite recent advances in automatic text recognition, the performance remains moderate when it comes to historical manuscripts.

Few-Shot Learning Handwritten Text Recognition +3

Paper
Add Code

Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks

1 code implementation • 23 Aug 2022 • Andrea Gemelli, Sanket Biswas, Enrico Civitelli, Josep Lladós, Simone Marinai

Geometric Deep Learning has recently attracted significant interest in a wide range of machine learning fields, including document analysis.

Ranked #5 on Entity Linking on FUNSD

Document Layout Analysis document understanding +4

106

Paper
Code

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

1 code implementation • 9 Mar 2022 • Mohamed Ali Souibgui, Sanket Biswas, Andres Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluis Gomez, Dimosthenis Karatzas

In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement.

Document Enhancement Scene Text Recognition

Paper
Code

DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer

1 code implementation • 27 Jan 2022 • Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal

has emerged as an interesting problem for the document analysis and understanding community.

Decision Making Document Layout Analysis +4

Paper
Code

DocEnTr: An End-to-End Document Image Enhancement Transformer

1 code implementation • 25 Jan 2022 • Mohamed Ali Souibgui, Sanket Biswas, Sana Khamekhem Jemni, Yousri Kessentini, Alicia Fornés, Josep Lladós, Umapada Pal

Document images can be affected by many degradation scenarios, which cause recognition and processing difficulties.

Ranked #1 on Binarization on H-DIBCO 2011

Binarization Decoder +1

130

Paper
Code

Graph-based Deep Generative Modelling for Document Layout Generation

no code implementations • 9 Jul 2021 • Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

One of the major prerequisites for any deep learning approach is the availability of large-scale training data.

Paper
Add Code

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis

1 code implementation • 6 Jul 2021 • Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

The results highlight that our model can successfully generate realistic and diverse document images with multiple objects.

Document Layout Analysis Image Generation

Paper
Code

Ehrhart-Equivalence, Equidecomposability, and Unimodular Equivalence of Integral Polytopes

no code implementations • 21 Jan 2021 • Fiona Abney-McPeek, Sanket Biswas, Senjuti Dutta, Yongyuan Huang, Deyuan Li, Nancy Xu

In this paper, we establish a relationship between Ehrhart-equivalence and other forms of equivalence: the $\operatorname{GL}_n(\mathbb{Z})$-equidecomposability and unimodular equivalence of two integral $n$-polytopes in $\mathbb{R}^n$.

Combinatorics

Paper
Add Code

Fault Area Detection in Leaf Diseases using k-means Clustering

no code implementations • 24 Oct 2018 • Subhajit Maity, Sujan Sarkar, Avinaba Tapadar, Ayan Dutta, Sanket Biswas, Sayon Nayek, Pritam Saha

With increasing population the crisis of food is getting bigger day by day. In this time of crisis, the leaf disease of crops is the biggest problem in the food industry. In this paper, we have addressed that problem and proposed an efficient method to detect leaf disease. Leaf diseases can be detected from sample images of the leaf with the help of image processing and segmentation. Using k-means clustering and Otsu's method the faulty region in a leaf is detected which helps to determine proper course of action to be taken. Further the ratio of normal and faulty region if calculated would be able to predict if the leaf can be cured at all.

Clustering

Paper
Add Code

A Statistical Approach to Adult Census Income Level Prediction

1 code implementation • 23 Oct 2018 • Navoneel Chakrabarty, Sanket Biswas

The prominent inequality of wealth and income is a huge concern especially in the United States.

valid

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.