Search Results for author: Abhishek Aich

Found 13 papers, 4 papers with code

Efficient Transformer Encoders for Mask2Former-style models

no code implementations23 Apr 2024 Manyi Yao, Abhishek Aich, Yumin Suh, Amit Roy-Chowdhury, Christian Shelton, Manmohan Chandraker

The third step is to use the aforementioned derived dataset to train a gating network that predicts the number of encoder layers to be used, conditioned on the input image.

Computational Efficiency Image Segmentation +3

Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation

no code implementations23 Apr 2024 Abhishek Aich, Yumin Suh, Samuel Schulter, Manmohan Chandraker

With efficiency being a high priority for scaling such models, we observed that the state-of-the-art method Mask2Former uses ~50% of its compute only on the transformer encoder.

Universal Segmentation

Efficient Controllable Multi-Task Architectures

no code implementations ICCV 2023 Abhishek Aich, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan Chandraker, Yumin Suh

Further, we present a simple but effective search algorithm that translates user constraints to runtime width configurations of both the shared encoder and task decoders, for sampling the sub-architectures.

Knowledge Distillation

Cross-Domain Video Anomaly Detection without Target Domain Adaptation

no code implementations14 Dec 2022 Abhishek Aich, Kuan-Chuan Peng, Amit K. Roy-Chowdhury

Most cross-domain unsupervised Video Anomaly Detection (VAD) works assume that at least few task-relevant target domain training data are available for adaptation from the source to the target domain.

Anomaly Detection Domain Adaptation +1

Leveraging Local Patch Differences in Multi-Object Scenes for Generative Adversarial Attacks

no code implementations20 Sep 2022 Abhishek Aich, Shasha Li, Chengyu Song, M. Salman Asif, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury

Our goal is to design an attack strategy that can learn from such natural scenes by leveraging the local patch differences that occur inherently in such images (e. g. difference between the local patch on the object `person' and the object `bike' in a traffic scene).

Object

GAMA: Generative Adversarial Multi-Object Scene Attacks

no code implementations20 Sep 2022 Abhishek Aich, Calvin-Khang Ta, Akash Gupta, Chengyu Song, Srikanth V. Krishnamurthy, M. Salman Asif, Amit K. Roy-Chowdhury

Using the joint image-text features to train the generator, we show that GAMA can craft potent transferable perturbations in order to fool victim classifiers in various attack settings.

Language Modelling Object

Poisson2Sparse: Self-Supervised Poisson Denoising From a Single Image

1 code implementation4 Jun 2022 Calvin-Khang Ta, Abhishek Aich, Akash Gupta, Amit K. Roy-Chowdhury

In this work, we explore a sparsity and dictionary learning-based approach and present a novel self-supervised learning method for single-image denoising where the noise is approximated as a Poisson process, requiring no clean ground-truth data.

Dictionary Learning Image Denoising +3

Deep Quantized Representation for Enhanced Reconstruction

1 code implementation29 Jul 2021 Akash Gupta, Abhishek Aich, Kevin Rodriguez, G. Venugopala Reddy, Amit K. Roy-Chowdhury

In this paper, we propose a data-driven Deep Quantized Latent Representation (DQLR) methodology for high-quality image reconstruction in the Shoot Apical Meristem (SAM) of Arabidopsis thaliana.

Image Reconstruction

Spatio-Temporal Representation Factorization for Video-based Person Re-Identification

no code implementations ICCV 2021 Abhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K. Roy-Chowdhury, Ziyan Wu

To alleviate these problems, we propose Spatio-Temporal Representation Factorization (STRF), a flexible new computational unit that can be used in conjunction with most existing 3D convolutional neural network architectures for re-ID.

Video-Based Person Re-Identification

Elastic Weight Consolidation (EWC): Nuts and Bolts

no code implementations10 May 2021 Abhishek Aich

In this report, we present a theoretical support of the continual learning method \textbf{Elastic Weight Consolidation}, introduced in paper titled `Overcoming catastrophic forgetting in neural networks'.

Continual Learning

ALANET: Adaptive Latent Attention Network forJoint Video Deblurring and Interpolation

no code implementations31 Aug 2020 Akash Gupta, Abhishek Aich, Amit K. Roy-Chowdhury

Different from these works, we address a more realistic problem of high frame-rate sharp video synthesis with no prior assumption that input is always blurry.

Deblurring

Non-Adversarial Video Synthesis with Learned Priors

1 code implementation CVPR 2020 Abhishek Aich, Akash Gupta, Rameswar Panda, Rakib Hyder, M. Salman Asif, Amit K. Roy-Chowdhury

Different from these methods, we focus on the problem of generating videos from latent noise vectors, without any reference input frames.

Cannot find the paper you are looking for? You can Submit a new open access paper.