no code implementations • 11 Apr 2024 • Soumyabrata Chaudhuri, Saumik Bhattacharya
These spatial features then undergo intermediate temporal modeling facilitated by the Mamba block before progressing to the encoder section, which comprises vanilla upsampling Shift S-GCN blocks.
no code implementations • 18 Nov 2023 • Gargi Panda, Soumitra Kundu, Saumik Bhattacharya, Aurobinda Routray
Single image super-resolution (SISR) is an effective technique to improve the quality of low-resolution thermal images.
1 code implementation • 31 Oct 2023 • Vaibhav Khamankar, Sutanu Bera, Saumik Bhattacharya, Debashis Sen, Prabir Kumar Biswas
Style transfer-based data augmentation is an emerging technique that can be used to improve the generalizability of machine learning models for histopathological images.
1 code implementation • 2 Oct 2023 • Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya
The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions.
no code implementations • 7 Aug 2023 • Soumyabrata Chaudhuri, Saumik Bhattacharya
However, the combination of pose, visual information, and text attributes has not been explored yet, though text and pose attributes independently have been proven to be effective in numerous computer vision tasks.
no code implementations • 5 Aug 2023 • Alloy Das, Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein
However, most of the existing STE methods show inferior editing performance because of (1) complex image backgrounds, (2) various font styles, and (3) varying word lengths within the text.
1 code implementation • 2 Aug 2023 • Siladittya Manna, Soumitri Chattopadhyay, Rakesh Dey, Saumik Bhattacharya, Umapada Pal
In contemporary self-supervised contrastive algorithms like SimCLR, MoCo, etc., the task of balancing attraction between two semantically similar samples and repulsion between two samples of different classes is primarily affected by the presence of hard negative samples.
1 code implementation • 1 May 2023 • Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal
Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc.
no code implementations • 24 Apr 2023 • Subhankar Ghosh, Saumik Bhattacharya, Prasun Roy, Umapada Pal, Michael Blumenstein
Handling various objects with different colors is a significant challenge for image colorization techniques.
no code implementations • 28 Feb 2023 • Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein
The proposed strategy enables us to synthesize semantically coherent realistic persons that can blend into an existing scene without altering the global context.
no code implementations • 4 Aug 2022 • Subhankar Ghosh, Prasun Roy, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein
Image colorization is a well-known problem in computer vision.
no code implementations • 24 Jul 2022 • Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein
In computer vision, human pose synthesis and transfer deal with probabilistic image generation of a person in a previously unseen pose from an already available observation of that person.
no code implementations • 6 Jun 2022 • Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein
Finally, the target image is generated from the refined skeleton using another generative network conditioned on a given image of the target person.
no code implementations • 26 Feb 2022 • Siladittya Manna, Soumitri Chattopadhyay, Saumik Bhattacharya, Umapada Pal
Writer independent offline signature verification is one of the most challenging tasks in pattern recognition as there is often a scarcity of training data.
1 code implementation • 14 Feb 2022 • Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal
Pose transfer refers to the probabilistic image generation of a person with a previously unseen novel pose from another image of that person having a different pose.
1 code implementation • 25 Jan 2022 • Soumitri Chattopadhyay, Siladittya Manna, Saumik Bhattacharya, Umapada Pal
This results in robust discriminative learning of the embedding space.
no code implementations • 24 Nov 2021 • Siladittya Manna, Umapada Pal, Saumik Bhattacharya
After 200 epochs of pre-training with ResNet-18 as the backbone, the proposed model achieves an accuracy of 86. 2\%, 58. 18\%, 77. 49\%, and 30. 87\% on CIFAR-10, CIFAR-100, STL-10, and Tiny-ImageNet datasets, respectively, and surpasses the SOTA contrastive baseline by 1. 23\%, 3. 57\%, 2. 00\%, and 0. 33\%, respectively.
no code implementations • 17 Oct 2021 • Shikhar Mohan, Saumik Bhattacharya, Sayantari Ghosh
We propose Attention W-Net, a new U-Net based architecture for retinal vessel segmentation to address these problems.
no code implementations • NeurIPS Workshop ICBINB 2021 • Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda
Deep metric learning (ML) uses a carefully designed loss function to learn distance metrics for improving the discriminatory ability for tasks like clustering and retrieval.
1 code implementation • ICCV 2021 • Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda
Deep metric learning has been effectively used to learn distance metrics for different visual tasks like image retrieval, clustering, etc.
1 code implementation • 6 May 2021 • Dipayan Das, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda
Reservoir Computing (RC) offers a viable option to deploy AI algorithms on low-end embedded system platforms.
1 code implementation • 4 May 2021 • Rangan Das, Bikram Boote, Saumik Bhattacharya, Ujjwal Maulik
Recent research has focused on stacking multiple layers like in convolutional neural networks for the increased expressive power of graph convolution networks.
2 code implementations • 21 Apr 2021 • Siladittya Manna, Saumik Bhattacharya, Umapada Pal
The downstream task in our paper is a class imbalanced multi-label classification.
Ranked #2 on Multi-Label Classification on MRNet
1 code implementation • 23 Oct 2020 • Prasun Roy, Saumik Bhattacharya, Partha Pratim Roy, Umapada Pal
Sign language is a gesture-based symbolic communication medium among speech and hearing impaired people.
no code implementations • 27 Aug 2020 • Sayantari Ghosh, Saumik Bhattacharya
In this work, a probabilistic cellular automata based method has been employed to model the infection dynamics for a significant number of different countries.
1 code implementation • 15 Jul 2020 • Siladittya Manna, Saumik Bhattacharya, Umapada Pal
In this paper, we propose a self-supervised learning approach to learn transferable features from MR video clips by enforcing the model to learn anatomical features.
no code implementations • 24 Feb 2020 • Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Pyari Mohan Pradhan
Although state-of-the-art deep learning based methods have been able to obtain fast, high-quality reconstruction of CS-MR images, their main drawback is that they treat complex-valued MRI data as real-valued entities.
1 code implementation • 14 Oct 2019 • Puneesh Deora, Bhavya Vasudeva, Saumik Bhattacharya, Pyari Mohan Pradhan
Compressive sensing magnetic resonance imaging (CS-MRI) accelerates the acquisition of MR images by breaking the Nyquist sampling limit.
1 code implementation • CVPR 2020 • Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal
In this paper, we propose a method to modify text in an image at character-level.
2 code implementations • 26 Jul 2018 • Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal
Deep convolutional neural networks (CNN) have massively influenced recent advances in large-scale image classification.