Search Results for author: Sanket Biswas

Found 21 papers, 12 papers with code

SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition

no code implementations6 May 2024 Adarsh Tiwari, Sanket Biswas, Josep Lladós

We present SketchGPT, a flexible framework that employs a sequence-to-sequence autoregressive model for sketch generation, and completion, and an interpretation case study for sketch recognition.

GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding

no code implementations6 May 2024 Nil Biescas, Carlos Boned, Josep Lladós, Sanket Biswas

This paper presents GeoContrastNet, a language-agnostic framework to structured document understanding (DU) by integrating a contrastive learning objective with graph attention networks (GATs), emphasizing the significant role of geometric features.

GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation

1 code implementation17 Feb 2024 Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Object detection in documents is a key step to automate the structural elements identification process in a digital or scanned document through understanding the hierarchical structure and relationships between different elements.

Knowledge Distillation object-detection +1

The Common Optical Music Recognition Evaluation Framework

no code implementations20 Dec 2023 Pau Torras, Sanket Biswas, Alicia Fornés

The quality of Optical Music Recognition (OMR) systems is a rather difficult magnitude to measure.

Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes

no code implementations1 Oct 2023 Alloy Das, Sanket Biswas, Umapada Pal, Josep Lladós

When used in a real-world noisy environment, the capacity to generalize to multiple domains is essential for any autonomous scene text spotting system.

Super-Resolution Text Spotting

Beyond Document Page Classification: Design, Datasets, and Challenges

1 code implementation24 Aug 2023 Jordy Van Landeghem, Sanket Biswas, Matthew B. Blaschko, Marie-Francine Moens

This paper highlights the need to bring document classification benchmarking closer to real-world applications, both in the nature of data tested ($X$: multi-channel, multi-paged, multi-industry; $Y$: class distributions and label set variety) and in classification tasks considered ($f$: multi-page document, page stream, and document bundle classification, ...).

Benchmarking Classification +1

SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation

1 code implementation8 May 2023 Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Instance-level segmentation of documents consists in assigning a class-aware and instance-aware label to each pixel of the image.

Decoder Instance Segmentation +2

SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

1 code implementation1 May 2023 Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal

Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc.

Document Layout Analysis object-detection +1

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

1 code implementation9 Mar 2022 Mohamed Ali Souibgui, Sanket Biswas, Andres Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluis Gomez, Dimosthenis Karatzas

In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement.

Document Enhancement Scene Text Recognition

Graph-based Deep Generative Modelling for Document Layout Generation

no code implementations9 Jul 2021 Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

One of the major prerequisites for any deep learning approach is the availability of large-scale training data.

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis

1 code implementation6 Jul 2021 Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

The results highlight that our model can successfully generate realistic and diverse document images with multiple objects.

Document Layout Analysis Image Generation

Ehrhart-Equivalence, Equidecomposability, and Unimodular Equivalence of Integral Polytopes

no code implementations21 Jan 2021 Fiona Abney-McPeek, Sanket Biswas, Senjuti Dutta, Yongyuan Huang, Deyuan Li, Nancy Xu

In this paper, we establish a relationship between Ehrhart-equivalence and other forms of equivalence: the $\operatorname{GL}_n(\mathbb{Z})$-equidecomposability and unimodular equivalence of two integral $n$-polytopes in $\mathbb{R}^n$.

Combinatorics

Fault Area Detection in Leaf Diseases using k-means Clustering

no code implementations24 Oct 2018 Subhajit Maity, Sujan Sarkar, Avinaba Tapadar, Ayan Dutta, Sanket Biswas, Sayon Nayek, Pritam Saha

With increasing population the crisis of food is getting bigger day by day. In this time of crisis, the leaf disease of crops is the biggest problem in the food industry. In this paper, we have addressed that problem and proposed an efficient method to detect leaf disease. Leaf diseases can be detected from sample images of the leaf with the help of image processing and segmentation. Using k-means clustering and Otsu's method the faulty region in a leaf is detected which helps to determine proper course of action to be taken. Further the ratio of normal and faulty region if calculated would be able to predict if the leaf can be cured at all.

Clustering

A Statistical Approach to Adult Census Income Level Prediction

1 code implementation23 Oct 2018 Navoneel Chakrabarty, Sanket Biswas

The prominent inequality of wealth and income is a huge concern especially in the United States.

valid

Cannot find the paper you are looking for? You can Submit a new open access paper.