Search Results for author: Saumik Bhattacharya

Found 30 papers, 14 papers with code

Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos

no code implementations11 Apr 2024 Soumyabrata Chaudhuri, Saumik Bhattacharya

These spatial features then undergo intermediate temporal modeling facilitated by the Mamba block before progressing to the encoder section, which comprises vanilla upsampling Shift S-GCN blocks.

Action Recognition In Videos

LATIS: Lambda Abstraction-based Thermal Image Super-resolution

no code implementations18 Nov 2023 Gargi Panda, Soumitra Kundu, Saumik Bhattacharya, Aurobinda Routray

Single image super-resolution (SISR) is an effective technique to improve the quality of low-resolution thermal images.

Image Super-Resolution

Histopathological Image Analysis with Style-Augmented Feature Domain Mixing for Improved Generalization

1 code implementation31 Oct 2023 Vaibhav Khamankar, Sutanu Bera, Saumik Bhattacharya, Debashis Sen, Prabir Kumar Biswas

Style transfer-based data augmentation is an emerging technique that can be used to improve the generalizability of machine learning models for histopathological images.

Data Augmentation Domain Generalization +2

ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition

no code implementations7 Aug 2023 Soumyabrata Chaudhuri, Saumik Bhattacharya

However, the combination of pose, visual information, and text attributes has not been explored yet, though text and pose attributes independently have been proven to be effective in numerous computer vision tasks.

Action Recognition Language Modelling +1

FAST: Font-Agnostic Scene Text Editing

no code implementations5 Aug 2023 Alloy Das, Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein

However, most of the existing STE methods show inferior editing performance because of (1) complex image backgrounds, (2) various font styles, and (3) varying word lengths within the text.

Scene Text Editing Style Transfer +1

DySTreSS: Dynamically Scaled Temperature in Self-Supervised Contrastive Learning

no code implementations2 Aug 2023 Siladittya Manna, Soumitri Chattopadhyay, Rakesh Dey, Saumik Bhattacharya, Umapada Pal

We propose a cosine similarity-dependent temperature scaling function to effectively optimize the distribution of the samples in the feature space.

Contrastive Learning

SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

1 code implementation1 May 2023 Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal

Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc.

Document Layout Analysis object-detection +1

Global Context-Aware Person Image Generation

no code implementations28 Feb 2023 Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein

The proposed strategy enables us to synthesize semantically coherent realistic persons that can blend into an existing scene without altering the global context.

Image Generation

TIPS: Text-Induced Pose Synthesis

no code implementations24 Jul 2022 Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

In computer vision, human pose synthesis and transfer deal with probabilistic image generation of a person in a previously unseen pose from an already available observation of that person.

Descriptive Pose Transfer

Scene Aware Person Image Generation through Global Contextual Conditioning

no code implementations6 Jun 2022 Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

Finally, the target image is generated from the refined skeleton using another generative network conditioned on a given image of the target person.

Generative Adversarial Network Image Generation

SWIS: Self-Supervised Representation Learning For Writer Independent Offline Signature Verification

no code implementations26 Feb 2022 Siladittya Manna, Soumitri Chattopadhyay, Saumik Bhattacharya, Umapada Pal

Writer independent offline signature verification is one of the most challenging tasks in pattern recognition as there is often a scarcity of training data.

Representation Learning Self-Supervised Learning

Multi-scale Attention Guided Pose Transfer

1 code implementation14 Feb 2022 Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal

Pose transfer refers to the probabilistic image generation of a person with a previously unseen novel pose from another image of that person having a different pose.

Pose Transfer

MIO : Mutual Information Optimization using Self-Supervised Binary Contrastive Learning

no code implementations24 Nov 2021 Siladittya Manna, Umapada Pal, Saumik Bhattacharya

After 200 epochs of pre-training with ResNet-18 as the backbone, the proposed model achieves an accuracy of 86. 2\%, 58. 18\%, 77. 49\%, and 30. 87\% on CIFAR-10, CIFAR-100, STL-10, and Tiny-ImageNet datasets, respectively, and surpasses the SOTA contrastive baseline by 1. 23\%, 3. 57\%, 2. 00\%, and 0. 33\%, respectively.

Binary Classification Contrastive Learning

Attention W-Net: Improved Skip Connections for better Representations

no code implementations17 Oct 2021 Shikhar Mohan, Saumik Bhattacharya, Sayantari Ghosh

We propose Attention W-Net, a new U-Net based architecture for retinal vessel segmentation to address these problems.

Image Augmentation Retinal Vessel Segmentation +1

GradML: A Gradient-based Loss for Deep Metric Learning

no code implementations NeurIPS Workshop ICBINB 2021 Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda

Deep metric learning (ML) uses a carefully designed loss function to learn distance metrics for improving the discriminatory ability for tasks like clustering and retrieval.

Metric Learning Retrieval

LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning

1 code implementation ICCV 2021 Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda

Deep metric learning has been effectively used to learn distance metrics for different visual tasks like image retrieval, clustering, etc.

Image Retrieval Metric Learning +1

PLSM: A Parallelized Liquid State Machine for Unintentional Action Detection

1 code implementation6 May 2021 Dipayan Das, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda

Reservoir Computing (RC) offers a viable option to deploy AI algorithms on low-end embedded system platforms.

Action Detection

Multipath Graph Convolutional Neural Networks

1 code implementation4 May 2021 Rangan Das, Bikram Boote, Saumik Bhattacharya, Ujjwal Maulik

Recent research has focused on stacking multiple layers like in convolutional neural networks for the increased expressive power of graph convolution networks.

Node Property Prediction Property Prediction +1

A Data-driven Understanding of COVID-19 Dynamics Using Sequential Genetic Algorithm Based Probabilistic Cellular Automata

no code implementations27 Aug 2020 Sayantari Ghosh, Saumik Bhattacharya

In this work, a probabilistic cellular automata based method has been employed to model the infection dynamics for a significant number of different countries.

Self-Supervised Representation Learning for Detection of ACL Tear Injury in Knee MR Videos

1 code implementation15 Jul 2020 Siladittya Manna, Saumik Bhattacharya, Umapada Pal

In this paper, we propose a self-supervised learning approach to learn transferable features from MR video clips by enforcing the model to learn anatomical features.

Representation Learning Self-Supervised Learning

Co-VeGAN: Complex-Valued Generative Adversarial Network for Compressive Sensing MR Image Reconstruction

no code implementations24 Feb 2020 Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Pyari Mohan Pradhan

Although state-of-the-art deep learning based methods have been able to obtain fast, high-quality reconstruction of CS-MR images, their main drawback is that they treat complex-valued MRI data as real-valued entities.

Compressive Sensing Generative Adversarial Network +1

Effects of Degradations on Deep Neural Network Architectures

2 code implementations26 Jul 2018 Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal

Deep convolutional neural networks (CNN) have massively influenced recent advances in large-scale image classification.

General Classification Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.