Search Results for author: Vishal M. Patel

Found 178 papers, 86 papers with code

Multiple Class Novelty Detection Under Data Distribution Shift

no code implementations • ECCV 2020 • Poojan Oza, Hien V. Nguyen, Vishal M. Patel

To this end, we consider the problem of multiple class novelty detection under dataset distribution shift to improve the novelty detection performance.

Domain Adaptation Novelty Detection +1

Paper
Add Code

Utilizing Patch-level Category Activation Patterns for Multiple Class Novelty Detection

no code implementations • ECCV 2020 • Poojan Oza, Vishal M. Patel

For any recognition system, the ability to identify novel class samples during inference is an important aspect of the system’s robustness.

Novelty Detection

Paper
Add Code

Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers

no code implementations • 15 Apr 2024 • Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M. Patel

As these parameters are independent, a single diffusion model with these task-specific parameters can be used to perform multiple tasks simultaneously.

Image Generation Unconditional Image Generation

Paper
Add Code

Bigger is not Always Better: Scaling Properties of Latent Diffusion Models

no code implementations • 1 Apr 2024 • Kangfu Mei, Zhengzhong Tu, Mauricio Delbracio, Hossein Talebi, Vishal M. Patel, Peyman Milanfar

We study the scaling properties of latent diffusion models (LDMs) with an emphasis on their sampling efficiency.

Paper
Add Code

Frame by Familiar Frame: Understanding Replication in Video Diffusion Models

no code implementations • 28 Mar 2024 • Aimon Rahman, Malsha V. Perera, Vishal M. Patel

In our paper, we present a systematic investigation into the phenomenon of sample replication in video diffusion models.

Image Generation Video Generation

Paper
Add Code

View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network

2 code implementations • 21 Mar 2024 • Quan Zhang, Lei Wang, Vishal M. Patel, Xiaohua Xie, JianHuang Lai

Experiments on two datasets show that VDT is a feasible and effective solution for AGPReID, surpassing the previous method on mAP/Rank1 by up to 5. 0%/2. 7% on CARGO and 3. 7%/5. 2% on AG-ReID, keeping the same magnitude of computational complexity.

Person Re-Identification

131

Paper
Code

Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions

no code implementations • 21 Mar 2024 • Jiacong Xu, Mingqian Liao, K Ram Prabhakar, Vishal M. Patel

To address these issues, we present Thermal-NeRF, which takes thermal and visible raw images as inputs, considering the thermal camera is robust to the illumination variation and raw images preserve any possible clues in the dark, to accomplish visible and thermal view synthesis simultaneously.

3D Reconstruction Novel View Synthesis

Paper
Add Code

FaceXFormer: A Unified Transformer for Facial Analysis

1 code implementation • 19 Mar 2024 • Kartik Narayan, Vibashan VS, Rama Chellappa, Vishal M. Patel

Unlike these conventional methods, our FaceXformer leverages a transformer-based encoder-decoder architecture where each task is treated as a learnable token, enabling the integration of multiple tasks within a single framework.

Age and Gender Estimation Age Estimation +4

137

Paper
Code

Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

no code implementations • 14 Mar 2024 • Yiqun Mei, Yu Zeng, He Zhang, Zhixin Shu, Xuaner Zhang, Sai Bi, Jianming Zhang, HyunJoon Jung, Vishal M. Patel

At the core of portrait photography is the search for ideal lighting and viewpoint.

Paper
Add Code

Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling

1 code implementation • 11 Mar 2024 • Wele Gedara Chaminda Bandara, Vishal M. Patel

This approach greatly reduces the number of learnable parameters compared to full tuning.

Action Recognition

Paper
Code

Deployment Prior Injection for Run-time Calibratable Object Detection

no code implementations • 27 Feb 2024 • Mo Zhou, Yiding Yang, Haoxiang Li, Vishal M. Patel, Gang Hua

With a strong alignment between the training and test distributions, object relation as a context prior facilitates object detection.

Object object-detection +1

Paper
Add Code

MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers

1 code implementation • 3 Feb 2024 • Yatong Bai, Mo Zhou, Vishal M. Patel, Somayeh Sojoudi

Adversarial robustness often comes at the cost of degraded accuracy, impeding the real-life application of robust classification models.

Adversarial Robustness Robust classification

Paper
Code

Entropic Open-set Active Learning

1 code implementation • 21 Dec 2023 • Bardia Safaei, Vibashan VS, Celso M. de Melo, Vishal M. Patel

Active Learning (AL) aims to enhance the performance of deep models by selecting the most informative samples for annotation from a pool of unlabeled data.

Active Learning

Paper
Code

Guarding Barlow Twins Against Overfitting with Mixed Samples

1 code implementation • 4 Dec 2023 • Wele Gedara Chaminda Bandara, Celso M. de Melo, Vishal M. Patel

Self-supervised Learning (SSL) aims to learn transferable feature representations for downstream applications without relying on labeled data.

Ranked #1 on Self-Supervised Learning on STL-10

Contrastive Learning Self-Supervised Learning

Paper
Code

Latent Feature-Guided Diffusion Models for Shadow Removal

no code implementations • 4 Dec 2023 • Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel

Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images.

Shadow Removal

Paper
Add Code

CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

1 code implementation • 2 Oct 2023 • Kangfu Mei, Mauricio Delbracio, Hossein Talebi, Zhengzhong Tu, Vishal M. Patel, Peyman Milanfar

Our conditional-task learning and distillation approach outperforms previous distillation methods, achieving a new state-of-the-art in producing high-quality images with very few steps (e. g., 1-4) across multiple tasks, including super-resolution, text-guided image editing, and depth-to-image generation.

Image Enhancement Super-Resolution +1

Paper
Code

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

no code implementations • ICCV 2023 • Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

To this end, and capitalizing on the powerful fine-grained generative control offered by the recent diffusion-based generative models, we introduce Steered Diffusion, a generalized framework for photorealistic zero-shot conditional image generation using a diffusion model trained for unconditional generation.

Colorization Conditional Image Generation +2

Paper
Add Code

Towards Federated Learning Under Resource Constraints via Layer-wise Training and Depth Dropout

no code implementations • 11 Sep 2023 • Pengfei Guo, Warren Richard Morningstar, Raviteja Vemulapalli, Karan Singhal, Vishal M. Patel, Philip Andrew Mansfield

To mitigate this issue and facilitate training of large models on edge devices, we introduce a simple yet effective strategy, Federated Layer-wise Learning, to simultaneously reduce per-client memory, computation, and communication costs.

Federated Learning Representation Learning +1

Paper
Add Code

AdaptiveSAM: Towards Efficient Tuning of SAM for Surgical Scene Segmentation

1 code implementation • 7 Aug 2023 • Jay N. Paranjape, Nithin Gopalakrishnan Nair, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

However, SAM does not generalize well to the medical domain as is without utilizing a large amount of compute resources for fine-tuning and using task-specific prompts.

Scene Segmentation Segmentation

Paper
Code

Cross-Dataset Adaptation for Instrument Classification in Cataract Surgery Videos

1 code implementation • 31 Jul 2023 • Jay N. Paranjape, Shameema Sikder, Vishal M. Patel, S. Swaroop Vedula

In this paper, we highlight this domain shift in the commonly performed cataract surgery and propose a novel end-to-end Unsupervised Domain Adaptation (UDA) method called the Barlow Adaptor that addresses the problem of distribution shift without requiring any labels from another domain.

Unsupervised Domain Adaptation

Paper
Code

Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training

no code implementations • 31 Jul 2023 • Jeya Maria Jose Valanarasu, Yucheng Tang, Dong Yang, Ziyue Xu, Can Zhao, Wenqi Li, Vishal M. Patel, Bennett Landman, Daguang Xu, Yufan He, Vishwesh Nath

We curate a large-scale dataset to enable pre-training of 3D medical radiology images (MRI and CT).

Organ Segmentation Representation Learning

Paper
Add Code

GLSFormer : Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos

1 code implementation • 20 Jul 2023 • Nisarg A. Shah, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

These results validate the suitability of our proposed approach for automated surgical step recognition.

Decision Making

Paper
Code

Self-Supervised MRI Reconstruction with Unrolled Diffusion Models

1 code implementation • 29 Jun 2023 • Yilmaz Korkmaz, Tolga Cukur, Vishal M. Patel

Magnetic Resonance Imaging (MRI) produces excellent soft tissue contrast, albeit it is an inherently slow imaging modality.

MRI Reconstruction

Paper
Code

Securing Deep Generative Models with Universal Adversarial Signature

1 code implementation • 25 May 2023 • Yu Zeng, Mo Zhou, Yuan Xue, Vishal M. Patel

Prior research attempted to mitigate these threats by detecting generated images, but the varying traces left by different generative models make it challenging to create a universal detector capable of generalizing to new, unseen generative models.

Paper
Code

T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities

no code implementations • 24 May 2023 • Kangfu Mei, Mo Zhou, Vishal M. Patel

The model can be scaled to generate high-resolution data while unifying multiple modalities.

Paper
Add Code

Analyzing Bias in Diffusion-based Face Generation Models

no code implementations • 10 May 2023 • Malsha V. Perera, Vishal M. Patel

Diffusion models are becoming increasingly popular in synthetic data generation and image editing applications.

Attribute Face Generation +2

Paper
Add Code

Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations

no code implementations • CVPR 2023 • Vibashan VS, Ning Yu, Chen Xing, Can Qin, Mingfei Gao, Juan Carlos Niebles, Vishal M. Patel, ran Xu

In summary, an OV method learns task-specific information using strong supervision from base annotations and novel category information using weak supervision from image-captions pairs.

Image Captioning Instance Segmentation +2

Paper
Add Code

Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic Segmentation

1 code implementation • CVPR 2023 • Shao-Yuan Lo, Poojan Oza, Sumanth Chennupati, Alejandro Galindo, Vishal M. Patel

Unsupervised Domain Adaptation (UDA) of semantic segmentation transfers labeled source knowledge to an unlabeled target domain by relying on accessing both the source and target data.

Contrastive Learning Semantic Segmentation +3

Paper
Code

ReBotNet: Fast Real-time Video Enhancement

no code implementations • 23 Mar 2023 • Jeya Maria Jose Valanarasu, Rahul Garg, Andeep Toor, Xin Tong, Weijuan Xi, Andreas Lugmayr, Vishal M. Patel, Anne Menini

The first branch learns spatio-temporal features by tokenizing the input frames along the spatial and temporal dimensions using a ConvNext-based encoder and processing these abstract tokens using a bottleneck mixer.

Video Enhancement Video Restoration

Paper
Add Code

LightPainter: Interactive Portrait Relighting with Freehand Scribble

no code implementations • CVPR 2023 • Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, HyunJoon Jung, Vishal M. Patel

Recent portrait relighting methods have achieved realistic results of portrait lighting effects given a desired lighting representation such as an environment map.

Paper
Add Code

$CrowdDiff$: Multi-hypothesis Crowd Density Estimation using Diffusion Models

1 code implementation • 22 Mar 2023 • Yasiru Ranasinghe, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

Furthermore, as the intermediate time steps of the diffusion process are noisy, we incorporate a regression branch for direct crowd estimation only during training to improve the feature learning.

Contour Detection Crowd Counting +1

Paper
Code

CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition

1 code implementation • 20 Mar 2023 • Deepti Hegde, Jeya Maria Jose Valanarasu, Vishal M. Patel

Attempting to train the visual and text encoder to account for this shift results in catastrophic forgetting and a notable decrease in performance.

Retrieval Scene Understanding

202

Paper
Code

Deep Metric Learning for Unsupervised Remote Sensing Change Detection

1 code implementation • 16 Mar 2023 • Wele Gedara Chaminda Bandara, Vishal M. Patel

This loss is motivated by the principle of metric learning where we simultaneously maximize the distance between change pair-wise pixels while minimizing the distance between no-change pair-wise pixels in bi-temporal image domain and their deep feature domain.

Change Detection Disaster Response +2

Paper
Code

Deep Learning for Cross-Domain Few-Shot Visual Recognition: A Survey

no code implementations • 15 Mar 2023 • Huali Xu, Shuaifeng Zhi, Shuzhou Sun, Vishal M. Patel, Li Liu

Deep learning has been highly successful in computer vision with large amounts of labeled data, but struggles with limited labeled training data.

cross-domain few-shot learning

Paper
Add Code

Bi-Noising Diffusion: Towards Conditional Diffusion Models with Generative Restoration Priors

no code implementations • 14 Dec 2022 • Kangfu Mei, Nithin Gopalakrishnan Nair, Vishal M. Patel

The improvements obtained by our method suggest that the priors can be incorporated as a general plugin for improving conditional diffusion models.

Colorization Rain Removal +1

Paper
Add Code

Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models

1 code implementation • CVPR 2023 • Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

We also introduce a novel reliability parameter that allows using different off-the-shelf diffusion models trained across various datasets during sampling time alone to guide it to the desired outcome satisfying multiple constraints.

Ranked #1 on Face Sketch Synthesis on Multi-Modal CelebA-HQ

Face Generation Face Sketch Synthesis +4

Paper
Code

VIDM: Video Implicit Diffusion Models

1 code implementation • 1 Dec 2022 • Kangfu Mei, Vishal M. Patel

Diffusion models have emerged as a powerful generative method for synthesizing high-quality and diverse set of images.

Ranked #11 on Video Generation on UCF-101

Generative Adversarial Network Video Generation

Paper
Code

SceneComposer: Any-Level Semantic Image Synthesis

no code implementations • CVPR 2023 • Yu Zeng, Zhe Lin, Jianming Zhang, Qing Liu, John Collomosse, Jason Kuen, Vishal M. Patel

We propose a new framework for conditional image synthesis from semantic layouts of any precision levels, ranging from pure text to a 2D semantic canvas with precise shapes.

Image Generation

Paper
Add Code

AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders

2 code implementations • CVPR 2023 • Wele Gedara Chaminda Bandara, Naman Patel, Ali Gholami, Mehdi Nikkhah, Motilal Agrawal, Vishal M. Patel

Our adaptive masking strategy samples visible tokens based on the semantic context using an auxiliary sampling network.

Ranked #1 on Action Classification on Something-Something V2

Action Classification Representation Learning

Paper
Code

Open-Set Automatic Target Recognition

1 code implementation • 10 Nov 2022 • Bardia Safaei, Vibashan VS, Celso M. de Melo, Shuowen Hu, Vishal M. Patel

Automatic Target Recognition (ATR) is a category of computer vision algorithms which attempts to recognize targets on data obtained from different sensors.

open-set classification Open Set Learning

Paper
Code

NBD-GAP: Non-Blind Image Deblurring Without Clean Target Images

no code implementations • 20 Sep 2022 • Nithin Gopalakrishnan Nair, Rajeev Yasarla, Vishal M. Patel

This results in a pair of images with colored noise.

Blind Image Deblurring Denoising +1

Paper
Add Code

T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models

1 code implementation • 19 Sep 2022 • Nithin Gopalakrishnan Nair, Vishal M. Patel

In this paper, we propose a Denoising Diffusion Probabilistic Model (DDPM) based solution for T2V translation specifically for facial images.

Face Verification Person Recognition +2

Paper
Code

AT-DDPM: Restoring Faces degraded by Atmospheric Turbulence using Denoising Diffusion Probabilistic Models

1 code implementation • 24 Aug 2022 • Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

In recent years, various deep learning-based single image atmospheric turbulence mitigation methods, including CNN-based and GAN inversion-based, have been proposed in the literature which attempt to remove the distortion in the image.

Image Restoration Image Super-Resolution +1

Paper
Code

Learning Feature Decomposition for Domain Adaptive Monocular Depth Estimation

no code implementations • 30 Jul 2022 • Shao-Yuan Lo, Wei Wang, Jim Thomas, Jingjing Zheng, Vishal M. Patel, Cheng-Hao Kuo

In this paper, we propose a novel UDA method for MDE, referred to as Learning Feature Decomposition for Adaptation (LFDA), which learns to decompose the feature space into content and style components.

Monocular Depth Estimation Unsupervised Domain Adaptation

Paper
Add Code

Deep Semantic Statistics Matching (D2SM) Denoising Network

1 code implementation • 19 Jul 2022 • Kangfu Mei, Vishal M. Patel, Rui Huang

The ultimate aim of image restoration like denoising is to find an exact correlation between the noisy and clear image domains.

Denoising Image Restoration +2

Paper
Code

Learning to restore images degraded by atmospheric turbulence using uncertainty

1 code implementation • 7 Jul 2022 • Rajeev Yasarla, Vishal M. Patel

Atmospheric turbulence can significantly degrade the quality of images acquired by long-range imaging systems by causing spatially and temporally random fluctuations in the index of refraction of the atmosphere.

Paper
Code

DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection

1 code implementation • 23 Jun 2022 • Wele Gedara Chaminda Bandara, Nithin Gopalakrishnan Nair, Vishal M. Patel

However, in this work, our focus is not on image synthesis but on utilizing it as a pre-trained feature extractor for the downstream application of change detection.

Ranked #1 on Change Detection on WHU-CD

Change Detection Decision Making +2

230

Paper
Code

SAR Despeckling using a Denoising Diffusion Probabilistic Model

1 code implementation • 9 Jun 2022 • Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

The despeckled image is recovered by a reverse process which iteratively predicts the added noise using a noise predictor which is conditioned on the speckled image.

Change Detection Denoising

Paper
Code

SAR Despeckling Using Overcomplete Convolutional Networks

1 code implementation • 31 May 2022 • Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

We show that the proposed network improves despeckling performance compared to recent despeckling methods on synthetic and real SAR images.

Paper
Code

On Trace of PGD-Like Adversarial Attacks

no code implementations • 19 May 2022 • Mo Zhou, Vishal M. Patel

Adversarial attacks pose safety and security concerns to deep learning applications, but their characteristics are under-explored.

Paper
Add Code

Deep-learning-enabled Brain Hemodynamic Mapping Using Resting-state fMRI

1 code implementation • 25 Apr 2022 • Xirui Hou, Pengfei Guo, Puyang Wang, Peiying Liu, Doris D. M. Lin, Hongli Fan, Yang Li, Zhiliang Wei, Zixuan Lin, Dengrong Jiang, Jin Jin, Catherine Kelly, Jay J. Pillai, Judy Huang, Marco C. Pinho, Binu P. Thomas, Babu G. Welch, Denise C. Park, Vishal M. Patel, Argye E. Hillis, Hanzhang Lu

Deep-learning resting-state vascular imaging has the potential to become a useful tool in clinical cerebrovascular imaging.

Management

Paper
Code

Unsupervised Restoration of Weather-affected Images using Deep Gaussian Process-based CycleGAN

no code implementations • 23 Apr 2022 • Rajeev Yasarla, Vishwanath A. Sindagi, Vishal M. Patel

Existing approaches for restoring weather-degraded images follow a fully-supervised paradigm and they require paired data for training.

Gaussian Processes

Paper
Add Code

A comparison of different atmospheric turbulence simulation methods for image restoration

no code implementations • 19 Apr 2022 • Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

In this paper, we systematically evaluate the effectiveness of various turbulence simulation methods on image restoration.

Face Recognition Image Restoration

Paper
Add Code

Revisiting Consistency Regularization for Semi-supervised Change Detection in Remote Sensing Images

1 code implementation • 18 Apr 2022 • Wele Gedara Chaminda Bandara, Vishal M. Patel

The performance of existing deep supervised CD methods is attributed to the large amounts of annotated data used to train the networks.

Ranked #2 on Semi-supervised Change Detection on LEVIR-CD - 5% labeled data

Earth Observation Semi-supervised Change Detection

109

Paper
Code

Shape-guided Object Inpainting

no code implementations • 16 Apr 2022 • Yu Zeng, Zhe Lin, Vishal M. Patel

Therefore, we propose a new data preparation method and a novel Contextual Object Generator (CogNet) for the object inpainting task.

Image Inpainting Object

Paper
Add Code

Towards Online Domain Adaptive Object Detection

2 code implementations • 11 Apr 2022 • Vibashan VS, Poojan Oza, Vishal M. Patel

To the best of our knowledge, this is the first work to address online and offline adaptation settings for object detection.

Object object-detection +3

Paper
Code

Thermal to Visible Image Synthesis under Atmospheric Turbulence

no code implementations • 6 Apr 2022 • Kangfu Mei, Yiqun Mei, Vishal M. Patel

In this paper, we first investigate the problem with a turbulence simulation method on real-world thermal images.

Face Verification Image Generation +1

Paper
Add Code

Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination

no code implementations • CVPR 2022 • Yiqun Mei, Pengfei Guo, Vishal M. Patel

In Heterogeneous Face Recognition (HFR), the objective is to match faces across two different domains such as visible and thermal.

Face Generation Face Hallucination +5

Paper
Add Code

Target and Task specific Source-Free Domain Adaptive Image Segmentation

1 code implementation • 29 Mar 2022 • Vibashan VS, Jeya Maria Jose Valanarasu, Vishal M. Patel

In task-specific adaptation, we exploit the enhanced pseudo-labels using a student-teacher framework to effectively learn segmentation on the target domain.

Denoising Image Segmentation +4

Paper
Code

Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection

1 code implementation • CVPR 2023 • Vibashan VS, Poojan Oza, Vishal M. Patel

The Source-Free Domain Adaptation (SFDA) setting aims to alleviate these concerns by adapting a source-trained model for the target domain without requiring access to the source data.

Knowledge Distillation Object +6

Paper
Code

Interactive Portrait Harmonization

no code implementations • 15 Mar 2022 • Jeya Maria Jose Valanarasu, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Jose Echevarria, Yinglan Ma, Zijun Wei, Kalyan Sunkavalli, Vishal M. Patel

To enable flexible interaction between user and harmonization, we introduce interactive harmonization, a new setting where the harmonization is performed with respect to a selected \emph{region} in the reference image instead of the entire background.

Image Harmonization

Paper
Add Code

Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation

no code implementations • 12 Mar 2022 • Pengfei Guo, Dong Yang, Ali Hatamizadeh, An Xu, Ziyue Xu, Wenqi Li, Can Zhao, Daguang Xu, Stephanie Harmon, Evrim Turkbey, Baris Turkbey, Bradford Wood, Francesca Patella, Elvira Stellato, Gianpaolo Carrafiello, Vishal M. Patel, Holger R. Roth

Federated learning (FL) is a distributed machine learning technique that enables collaborative model training while avoiding explicit data sharing.

Federated Learning Hyperparameter Optimization +7

Paper
Add Code

On-the-Fly Test-time Adaptation for Medical Image Segmentation

1 code implementation • 10 Mar 2022 • Jeya Maria Jose Valanarasu, Pengfei Guo, Vibashan VS, Vishal M. Patel

During test-time, the model takes in just the new test image and generates a domain code to adapt the features of source model according to the test data.

Image Segmentation Medical Image Segmentation +2

Paper
Code

UNeXt: MLP-based Rapid Medical Image Segmentation Network

2 code implementations • 9 Mar 2022 • Jeya Maria Jose Valanarasu, Vishal M. Patel

Using tokenized MLPs in latent space reduces the number of parameters and computational complexity while being able to result in a better representation to help segmentation.

Ranked #3 on Medical Image Segmentation on ISIC 2018

Image Segmentation Medical Image Segmentation +2

771

Paper
Code

HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening

1 code implementation • CVPR 2022 • Wele Gedara Chaminda Bandara, Vishal M. Patel

Existing pansharpening approaches neglect using an attention mechanism to transfer HR texture features from PAN to LR-HSI features, resulting in spatial and spectral distortions.

Pansharpening Super-Resolution

116

Paper
Code

Enhancing Adversarial Robustness for Deep Metric Learning

2 code implementations • CVPR 2022 • Mo Zhou, Vishal M. Patel

Owing to security implications of adversarial vulnerability, adversarial robustness of deep metric learning models has to be improved.

Adversarial Robustness Metric Learning

Paper
Code

Exploring Adversarially Robust Training for Unsupervised Domain Adaptation

1 code implementation • 18 Feb 2022 • Shao-Yuan Lo, Vishal M. Patel

Adversarial Training (AT) has been considered to be the most successful adversarial defense approach.

Adversarial Defense Adversarial Robustness +1

Paper
Code

Open-set Adversarial Defense with Clean-Adversarial Mutual Learning

1 code implementation • 12 Feb 2022 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

This paper proposes an Open-Set Defense Network with Clean-Adversarial Mutual Learning (OSDN-CAML) as a solution to the OSAD problem.

Adversarial Defense Denoising +2

Paper
Code

ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer

1 code implementation • 23 Jan 2022 • Pengfei Guo, Yiqun Mei, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel

Accelerating magnetic resonance image (MRI) reconstruction process is a challenging ill-posed inverse problem due to the excessive under-sampling operation in k-space.

Feature Correlation MRI Reconstruction

Paper
Code

Transformer-based SAR Image Despeckling

1 code implementation • 23 Jan 2022 • Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

Synthetic Aperture Radar (SAR) images are usually degraded by a multiplicative noise known as speckle which makes processing and interpretation of SAR images difficult.

Sar Image Despeckling

Paper
Code

A Transformer-Based Siamese Network for Change Detection

3 code implementations • 4 Jan 2022 • Wele Gedara Chaminda Bandara, Vishal M. Patel

This paper presents a transformer-based Siamese network architecture (abbreviated by ChangeFormer) for Change Detection (CD) from a pair of co-registered remote sensing images.

Ranked #14 on Change Detection on LEVIR-CD

Change Detection

382

Paper
Code

LTT-GAN: Looking Through Turbulence by Inverting GANs

no code implementations • 4 Dec 2021 • Kangfu Mei, Vishal M. Patel

To mitigate the turbulence effect, in this paper, we propose the first turbulence mitigation method that makes use of visual priors encapsulated by a well-trained GAN.

Face Verification

Paper
Add Code

Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection

1 code implementation • 30 Nov 2021 • Deepti Hegde, Vishal M. Patel

We demonstrate our approach on two recent object detectors and achieve results that out-perform the other domain adaptation works.

3D Object Detection object-detection +2

Paper
Code

SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches

no code implementations • CVPR 2022 • Yu Zeng, Zhe Lin, Vishal M. Patel

Our model can be trained in a self-supervised fashion by learning the reconstruction of an image region from the style vector and sketch.

Image Manipulation

Paper
Add Code

TransWeather: Transformer-based Restoration of Images Degraded by Adverse Weather Conditions

1 code implementation • CVPR 2022 • Jeya Maria Jose Valanarasu, Rajeev Yasarla, Vishal M. Patel

We also introduce a transformer decoder with learnable weather type embeddings to adjust to the weather degradation at hand.

Ranked #1 on Single Image Deraining on Raindrop

Neural Architecture Search Single Image Dehazing +2

144

Paper
Code

Reference-based Magnetic Resonance Image Reconstruction Using Texture Transformer

no code implementations • 18 Nov 2021 • Pengfei Guo, Vishal M. Patel

Deep Learning (DL) based methods for magnetic resonance (MR) image reconstruction have been shown to produce superior performance in recent years.

MRI Reconstruction

Paper
Add Code

Federated Test-Time Adaptive Face Presentation Attack Detection with Dual-Phase Privacy Preservation

no code implementations • 25 Oct 2021 • Rui Shao, Bochao Zhang, Pong C. Yuen, Vishal M. Patel

The generalization ability of face presentation attack detection models to unseen attacks has become a key issue for real-world deployment, which can be improved when models are trained with face images from different input distributions and different types of spoof attacks.

Face Presentation Attack Detection Face Recognition +2

Paper
Add Code

Multimodal Learning using Optimal Transport for Sarcasm and Humor Detection

no code implementations • 21 Oct 2021 • Shraman Pramanick, Aniket Roy, Vishal M. Patel

Multimodal learning is an emerging yet challenging research area.

Humor Detection Sarcasm Detection

Paper
Add Code

Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection using Meta-Learning

no code implementations • 7 Oct 2021 • Vibashan VS, Domenick Poster, Suya You, Shuowen Hu, Vishal M. Patel

Though thermal cameras are widely used for military applications and increasingly for commercial applications, there is a lack of robust algorithms to robustly exploit the thermal imagery due to the limited availability of labeled thermal data.

Meta-Learning object-detection +2

Paper
Add Code

Fine-Context Shadow Detection using Shadow Removal

no code implementations • 20 Sep 2021 • Jeya Maria Jose Valanarasu, Vishal M. Patel

First, we propose a Fine Context-aware Shadow Detection Network (FCSD-Net), where we constraint the receptive field size and focus on low-level features to learn fine context features better.

Shadow Detection And Removal Shadow Removal

Paper
Add Code

SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving

1 code implementation • 16 Sep 2021 • Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

Using just convolution neural networks (ConvNets) for this problem is not effective as it is inefficient at capturing distant dependencies between road segments in the image which is essential to extract road connectivity.

Ranked #1 on Road Segmentation on DeepGlobe

Autonomous Driving Autonomous Navigation +1

Paper
Code

Adversarially Robust One-class Novelty Detection

1 code implementation • 25 Aug 2021 • Shao-Yuan Lo, Poojan Oza, Vishal M. Patel

To this end, we propose a defense strategy that manipulates the latent space of novelty detectors to improve the robustness against adversarial examples.

Adversarial Robustness Novelty Detection

Paper
Code

A Synthesis-Based Approach for Thermal-to-Visible Face Verification

no code implementations • 21 Aug 2021 • Neehar Peri, Joshua Gleason, Carlos D. Castillo, Thirimachos Bourlai, Vishal M. Patel, Rama Chellappa

Lastly, we show that our end-to-end thermal-to-visible face verification system provides strong performance on the MILAB-VTF(B) dataset.

Face Alignment Face Generation +1

Paper
Add Code

Image Fusion Transformer

1 code implementation • 19 Jul 2021 • Vibashan VS, Jeya Maria Jose Valanarasu, Poojan Oza, Vishal M. Patel

Furthermore, we show the effectiveness of the proposed ST fusion strategy with an ablation analysis.

102

Paper
Code

Heterogeneous Face Frontalization via Domain Agnostic Learning

no code implementations • 17 Jul 2021 • Xing Di, Shuowen Hu, Vishal M. Patel

We propose a domain agnostic learning-based generative adversarial network (DAL-GAN) which can synthesize frontal views in the visible domain from thermal faces with pose variations.

Face Generation Generative Adversarial Network

Paper
Add Code

Lidar Light Scattering Augmentation (LISA): Physics-based Simulation of Adverse Weather Conditions for 3D Object Detection

1 code implementation • 14 Jul 2021 • Velat Kilic, Deepti Hegde, Vishwanath Sindagi, A. Brinton Cooper, Mark A. Foster, Vishal M. Patel

Lidar-based object detectors are critical parts of the 3D perception pipeline in autonomous navigation systems such as self-driving cars.

3D Object Detection Autonomous Navigation +2

Paper
Code

Hyperspectral Pansharpening Based on Improved Deep Image Prior and Residual Reconstruction

1 code implementation • 6 Jul 2021 • Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

To estimate the PAN image of the up-sampled HSI, we also propose a learnable spectral response function (SRF).

Ranked #1 on Image Super-Resolution on Chikusei Dataset

Image Super-Resolution Pansharpening

Paper
Code

Over-and-Under Complete Convolutional RNN for MRI Reconstruction

no code implementations • 16 Jun 2021 • Pengfei Guo, Jeya Maria Jose Valanarasu, Puyang Wang, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel

Reconstructing magnetic resonance (MR) images from undersampled data is a challenging problem due to various artifacts introduced by the under-sampling operation.

MRI Reconstruction

Paper
Add Code

Unsupervised Domain Adaptation of Object Detectors: A Survey

no code implementations • 27 May 2021 • Poojan Oza, Vishwanath A. Sindagi, Vibashan VS, Vishal M. Patel

Recent advances in deep learning have led to the development of accurate and efficient models for various computer vision applications such as classification, segmentation, and detection.

Autonomous Navigation Object +3

Paper
Add Code

Federated Generalized Face Presentation Attack Detection

no code implementations • 14 Apr 2021 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

A face presentation attack detection model with good generalization can be obtained when it is trained with face images from different input distributions and different types of spoof attacks.

Disentanglement Face Presentation Attack Detection +2

Paper
Add Code

Federated Learning-based Active Authentication on Mobile Devices

no code implementations • 14 Apr 2021 • Poojan Oza, Vishal M. Patel

Using FL/SL frameworks, we can alleviate the lack of negative data problem by training a user authentication model over multiple user data distributed across devices.

Federated Learning One-Class Classification

Paper
Add Code

Simultaneous Face Hallucination and Translation for Thermal to Visible Face Verification using Axial-GAN

1 code implementation • 13 Apr 2021 • Rakhil Immidisetti, Shuowen Hu, Vishal M. Patel

Existing thermal-to-visible face verification approaches expect the thermal and visible face images to be of similar resolution.

Face Hallucination Face Verification +3

Paper
Code

Multimodal Face Synthesis from Visual Attributes

1 code implementation • 9 Apr 2021 • Xing Di, Vishal M. Patel

Extensive experiments and comparisons with several state-of-the-art methods are performed to verify the effectiveness of the proposed attribute-based multimodal synthesis method.

Attribute Face Generation +1

Paper
Code

MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection

no code implementations • CVPR 2021 • Vibashan VS, Vikram Gupta, Poojan Oza, Vishwanath A. Sindagi, Vishal M. Patel

Existing approaches for unsupervised domain adaptive object detection perform feature alignment via adversarial training.

Domain Adaptation object-detection +1

Paper
Add Code

Multi-institutional Collaborations for Improving Deep Learning-based Magnetic Resonance Image Reconstruction Using Federated Learning

1 code implementation • CVPR 2021 • Pengfei Guo, Puyang Wang, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel

However, the generalizability of models trained with the FL setting can still be suboptimal due to domain shift, which results from the data collected at multiple institutions with different sensors, disease types, and acquisition protocols, etc.

Federated Learning Image Reconstruction

Paper
Code

Medical Transformer: Gated Axial-Attention for Medical Image Segmentation

2 code implementations • 21 Feb 2021 • Jeya Maria Jose Valanarasu, Poojan Oza, Ilker Hacihaliloglu, Vishal M. Patel

The proposed Medical Transformer (MedT) is evaluated on three different medical image segmentation datasets and it is shown that it achieves better performance than the convolutional and other related transformer-based architectures.

Ranked #1 on Medical Image Segmentation on Brain US

Image Segmentation Medical Image Segmentation +2

771

Paper
Code

Error Diffusion Halftoning Against Adversarial Examples

1 code implementation • 23 Jan 2021 • Shao-Yuan Lo, Vishal M. Patel

In this paper, we propose a new image transformation defense based on error diffusion halftoning, and combine it with adversarial training to defend against adversarial examples.

Adversarial Robustness Quantization

Paper
Code

One-Class Classification: A Survey

no code implementations • 8 Jan 2021 • Pramuditha Perera, Poojan Oza, Vishal M. Patel

One-Class Classification (OCC) is a special case of multi-class classification, where data observed during training is from a single positive class.

Classification General Classification +2

Paper
Add Code

A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset

no code implementations • 7 Jan 2021 • Domenick Poster, Matthew Thielke, Robert Nguyen, Srinivasan Rajaraman, Xing Di, Cedric Nimpa Fondje, Vishal M. Patel, Nathaniel J. Short, Benjamin S. Riggan, Nasser M. Nasrabadi, Shuowen Hu

Thermal face imagery, which captures the naturally emitted heat from the face, is limited in availability compared to face imagery in the visible spectrum.

Face Verification

Paper
Add Code

CR-Fill: Generative Image Inpainting With Auxiliary Contextual Reconstruction

1 code implementation • ICCV 2021 • Yu Zeng, Zhe Lin, Huchuan Lu, Vishal M. Patel

The auxiliary branch (i. e. CR loss) is required only during training, and only the inpainting generator is required during the inference.

Ranked #8 on Image Inpainting on Places2

Image Inpainting

219

Paper
Code

Overcomplete Representations Against Adversarial Videos

1 code implementation • 8 Dec 2020 • Shao-Yuan Lo, Jeya Maria Jose Valanarasu, Vishal M. Patel

Adversarial robustness of deep neural networks is an extensively studied problem in the literature and various methods have been proposed to defend against adversarial images.

Adversarial Robustness Video Recognition

Paper
Code

CR-Fill: Generative Image Inpainting with Auxiliary Contexutal Reconstruction

1 code implementation • 25 Nov 2020 • Yu Zeng, Zhe Lin, Huchuan Lu, Vishal M. Patel

Due to the lack of supervision signals for the correspondence between missing regions and known regions, it may fail to find proper reference features, which often leads to artifacts in the results.

Image Inpainting

219

Paper
Code

Overcomplete Deep Subspace Clustering Networks

1 code implementation • 16 Nov 2020 • Jeya Maria Jose Valanarasu, Vishal M. Patel

This method uses undercomplete representations of the input data which makes it not so robust and more dependent on pre-training.

Clustering

Paper
Code

Deep Image Compositing

no code implementations • 4 Nov 2020 • He Zhang, Jianming Zhang, Federico Perazzi, Zhe Lin, Vishal M. Patel

In this paper, we propose a new method which can automatically generate high-quality image compositing without any user input.

Image Matting

Paper
Add Code

Exploring Overcomplete Representations for Single Image Deraining using CNNs

1 code implementation • 20 Oct 2020 • Rajeev Yasarla, Jeya Maria Jose Valanarasu, Vishal M. Patel

Removal of rain streaks from a single image is an extremely challenging problem since the rainy images often contain rain streaks of different size, shape, direction and density.

Single Image Deraining

Paper
Code

KiU-Net: Overcomplete Convolutional Architectures for Biomedical Image and Volumetric Segmentation

1 code implementation • 4 Oct 2020 • Jeya Maria Jose Valanarasu, Vishwanath A. Sindagi, Ilker Hacihaliloglu, Vishal M. Patel

To overcome this issue, we propose using an overcomplete convolutional architecture where we project our input image into a higher dimension such that we constrain the receptive field from increasing in the deep layers of the network.

Ranked #1 on Medical Image Segmentation on RITE

3D Medical Imaging Segmentation Brain Tumor Segmentation +6

340

Paper
Code

MultAV: Multiplicative Adversarial Videos

no code implementations • 17 Sep 2020 • Shao-Yuan Lo, Vishal M. Patel

In this paper, we propose a novel attack method against video recognition models, Multiplicative Adversarial Videos (MultAV), which imposes perturbation on video data by multiplication.

Adversarial Attack Video Recognition

Paper
Add Code

Completely Self-Supervised Crowd Counting via Distribution Matching

1 code implementation • 14 Sep 2020 • Deepak Babu Sam, Abhinav Agarwalla, Jimmy Joseph, Vishwanath A. Sindagi, R. Venkatesh Babu, Vishal M. Patel

Dense crowd counting is a challenging task that demands millions of head annotations for training models.

Crowd Counting Density Estimation

Paper
Code

Defending Against Multiple and Unforeseen Adversarial Videos

no code implementations • 11 Sep 2020 • Shao-Yuan Lo, Vishal M. Patel

With a multiple BN structure, each BN brach is responsible for learning the distribution of a single perturbation type and thus provides more precise distribution estimations.

Adversarial Robustness General Classification +2

Paper
Add Code

Open-set Adversarial Defense

1 code implementation • ECCV 2020 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

In this paper, we show that open-set recognition systems are vulnerable to adversarial attacks.

Adversarial Defense Denoising +1

Paper
Code

Confidence-guided Lesion Mask-based Simultaneous Synthesis of Anatomic and Molecular MR Images in Patients with Post-treatment Malignant Gliomas

1 code implementation • 6 Aug 2020 • Pengfei Guo, Puyang Wang, Rajeev Yasarla, Jinyuan Zhou, Vishal M. Patel, Shanshan Jiang

Data-driven automatic approaches have demonstrated their great potential in resolving various clinical diagnostic dilemmas in neuro-oncology, especially with the help of standard anatomic and advanced molecular MR images.

Paper
Code

Learning to Restore a Single Face Image Degraded by Atmospheric Turbulence using CNNs

no code implementations • 16 Jul 2020 • Rajeev Yasarla, Vishal M. Patel

Atmospheric turbulence significantly affects imaging systems which use light that has propagated through long atmospheric paths.

Paper
Add Code

Anomaly Detection-Based Unknown Face Presentation Attack Detection

1 code implementation • 11 Jul 2020 • Yashasvi Baweja, Poojan Oza, Pramuditha Perera, Vishal M. Patel

Anomaly detection-based spoof attack detection is a recent development in face Presentation Attack Detection (fPAD), where a spoof detector is learned using only non-attacked images of users.

Anomaly Detection Face Presentation Attack Detection +1

Paper
Code

Learning to Count in the Crowd from Limited Labeled Data

no code implementations • ECCV 2020 • Vishwanath A. Sindagi, Rajeev Yasarla, Deepak Sam Babu, R. Venkatesh Babu, Vishal M. Patel

In this work, we focus on reducing the annotation efforts by learning to count in the crowd from limited number of labeled samples while leveraging a large pool of unlabeled data.

Crowd Counting

Paper
Add Code

Lesion Mask-based Simultaneous Synthesis of Anatomic and MolecularMR Images using a GAN

1 code implementation • 26 Jun 2020 • Pengfei Guo, Puyang Wang, Jinyuan Zhou, Vishal M. Patel, Shanshan Jiang

Data-driven automatic approaches have demonstrated their great potential in resolving various clinical diagnostic dilemmas for patients with malignant gliomas in neuro-oncology with the help of conventional and advanced molecular MR images.

Data Augmentation

Paper
Code

Quickest Intruder Detection for Multiple User Active Authentication

no code implementations • 21 Jun 2020 • Pramuditha Perera, Julian Fierrez, Vishal M. Patel

In this paper, we investigate how to detect intruders with low latency for Active Authentication (AA) systems with multiple-users.

Change Detection

Paper
Add Code

KiU-Net: Towards Accurate Segmentation of Biomedical Images using Over-complete Representations

3 code implementations • 8 Jun 2020 • Jeya Maria Jose, Vishwanath Sindagi, Ilker Hacihaliloglu, Vishal M. Patel

Due to its excellent performance, U-Net is the most widely used backbone architecture for biomedical image segmentation in the recent years.

Anatomy Image Segmentation +2

8,226

Paper
Code

Federated Face Presentation Attack Detection

no code implementations • 29 May 2020 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

A face presentation attack detection model with good generalization can be obtained when it is trained with face images from different input distributions and different types of spoof attacks.

Face Anti-Spoofing Face Presentation Attack Detection +2

Paper
Add Code

Multi-Scale Thermal to Visible Face Verification via Attribute Guided Synthesis

no code implementations • 20 Apr 2020 • Xing Di, Benjamin S. Riggan, Shuowen Hu, Nathaniel J. Short, Vishal M. Patel

Finally, a pre-trained VGG-Face network is leveraged to extract features from the synthesized image and the input visible image for verification.

Attribute Face Verification

Paper
Add Code

JHU-CROWD++: Large-Scale Crowd Counting Dataset and A Benchmark Method

no code implementations • 7 Apr 2020 • Vishwanath A. Sindagi, Rajeev Yasarla, Vishal M. Patel

The proposed Confidence Guided Deep Residual Counting Network (CG-DRCN) is evaluated on recent complex datasets, and it achieves significant improvements in errors.

Crowd Counting

Paper
Add Code

Learning to Segment Brain Anatomy from 2D Ultrasound with Less Data

no code implementations • 18 Dec 2019 • Jeya Maria Jose V., Rajeev Yasarla, Puyang Wang, Ilker Hacihaliloglu, Vishal M. Patel

We show that our method can synthesize high-quality US images for every manipulated segmentation label with qualitative and quantitative improvements over the recent state-of-the-art synthesis methods.

Anatomy Image Generation +2

Paper
Add Code

Facial Synthesis from Visual Attributes via Sketch using Multi-Scale Generators

no code implementations • 17 Dec 2019 • Xing Di, Vishal M. Patel

In this paper, we take a different approach, where we formulate the original problem as a stage-wise learning problem.

Attribute Face Generation

Paper
Add Code

Prior-based Domain Adaptive Object Detection for Hazy and Rainy Conditions

no code implementations • ECCV 2020 • Vishwanath A. Sindagi, Poojan Oza, Rajeev Yasarla, Vishal M. Patel

Adverse weather conditions such as haze and rain corrupt the quality of captured images, which cause detection networks trained on clean images to perform poorly on these images.

object-detection Object Detection

Paper
Add Code

Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method

no code implementations • ICCV 2019 • Vishwanath A. Sindagi, Rajeev Yasarla, Vishal M. Patel

The proposed Confidence Guided Deep Residual Counting Network (CG-DRCN) is evaluated on recent complex datasets, and it achieves significant improvements in errors.

Crowd Counting

Paper
Add Code

Confidence Measure Guided Single Image De-raining

no code implementations • 10 Sep 2019 • Rajeev Yasarla, Vishal M. Patel

Single image de-raining is an extremely challenging problem since the rainy images contain rain streaks which often vary in size, direction and density.

Single Image Deraining

Paper
Add Code

Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting

no code implementations • ICCV 2019 • Vishwanath A. Sindagi, Vishal M. Patel

These issues are further exacerbated in highly congested scenes.

Crowd Counting

Paper
Add Code

Deblurring Face Images using Uncertainty Guided Multi-Stream Semantic Networks

1 code implementation • 30 Jul 2019 • Rajeev Yasarla, Federico Perazzi, Vishal M. Patel

We propose a novel multi-stream architecture and training methodology that exploits semantic labels for facial image deblurring.

Deblurring Image Deblurring

114

Paper
Code

HA-CCN: Hierarchical Attention-based Crowd Counting Network

no code implementations • 24 Jul 2019 • Vishwanath A. Sindagi, Vishal M. Patel

The proposed method, which is based on the VGG16 network, consists of a spatial attention module (SAM) and a set of global attention modules (GAM).

Crowd Counting

Paper
Add Code

Inverse Attention Guided Deep Crowd Counting Network

no code implementations • 2 Jul 2019 • Vishwanath A. Sindagi, Vishal M. Patel

In this paper, we address the challenging problem of crowd counting in congested scenes.

Crowd Counting Segmentation

Paper
Add Code

Uncertainty Guided Multi-Scale Residual Learning-using a Cycle Spinning CNN for Single Image De-Raining

1 code implementation • CVPR 2019 • Rajeev Yasarla, Vishal M. Patel

Previous approaches have attempted to address this problem by leveraging some prior information to remove rain streaks from a single image.

Ranked #8 on Single Image Deraining on Test2800

Single Image Deraining

Paper
Code

Deep Sparse Representation-based Classification

1 code implementation • 24 Apr 2019 • Mahdi Abavisani, Vishal M. Patel

The proposed network consists of a convolutional autoencoder along with a fully-connected layer.

Ranked #1 on Sparse Representation-based Classification on SVHN

Classification General Classification +2

Paper
Code

Polarimetric Thermal to Visible Face Verification via Self-Attention Guided Synthesis

no code implementations • 15 Apr 2019 • Xing Di, Benjamin S. Riggan, Shuowen Hu, Nathaniel J. Short, Vishal M. Patel

Polarimetric thermal to visible face verification entails matching two images that contain significant domain differences.

Face Verification Generative Adversarial Network +1

Paper
Add Code

C2AE: Class Conditioned Auto-Encoder for Open-set Recognition

no code implementations • CVPR 2019 • Poojan Oza, Vishal M. Patel

It refers to the problem of identifying the unknown classes during testing, while maintaining performance on the known classes.

Classification General Classification +2

Paper
Add Code

Deep CNN-based Multi-task Learning for Open-Set Recognition

no code implementations • 7 Mar 2019 • Poojan Oza, Vishal M. Patel

We propose a novel deep convolutional neural network (CNN) based multi-task learning approach for open-set visual recognition.

General Classification Image Classification +2

Paper
Add Code

Deep Transfer Learning for Multiple Class Novelty Detection

1 code implementation • CVPR 2019 • Pramuditha Perera, Vishal M. Patel

We show that thresholding the maximal activation of the proposed network can be used to identify novel objects effectively.

Novelty Detection Transfer Learning

Paper
Code

Active Authentication using an Autoencoder regularized CNN-based One-Class Classifier

no code implementations • 4 Mar 2019 • Poojan Oza, Vishal M. Patel

Generally, an active authentication problem is modelled as a one class classification problem due to the unavailability of data from the impostor users.

Classification General Classification +2

Paper
Add Code

One-Class Convolutional Neural Network

4 code implementations • 24 Jan 2019 • Poojan Oza, Vishal M. Patel

We present a novel Convolutional Neural Network (CNN) based approach for one class classification.

General Classification Novelty Detection +1

Paper
Code

DAFE-FD: Density Aware Feature Enrichment for Face Detection

no code implementations • 16 Jan 2019 • Vishwanath A. Sindagi, Vishal M. Patel

In this work, we approach the problem of small face detection with the motivation of enriching the feature maps using a density map estimation module.

Crowd Counting Density Estimation +1

Paper
Add Code

Polarimetric Thermal to Visible Face Verification via Attribute Preserved Synthesis

no code implementations • 3 Jan 2019 • Xing Di, He Zhang, Vishal M. Patel

A pre-trained VGG-Face network is used to extract the attributes from the visible image.

Attribute Face Verification +1

Paper
Add Code

Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training

1 code implementation • CVPR 2019 • Mahdi Abavisani, Hamid Reza Vaezi Joze, Vishal M. Patel

We present an efficient approach for leveraging the knowledge from multiple modalities in training unimodal 3D convolutional neural networks (3D-CNNs) for the task of dynamic hand gesture recognition.

Ranked #1 on Hand Gesture Recognition on VIVA Hand Gestures Dataset

Action Recognition Hand Gesture Recognition +2

Paper
Code

Synthesis of High-Quality Visible Faces from Polarimetric Thermal Faces using Generative Adversarial Networks

no code implementations • 12 Dec 2018 • He Zhang, Benjamin S. Riggan, Shuowen Hu, Nathaniel J. Short, Vishal M. Patel

Previous approaches utilize either a two-step procedure (visible feature estimation and visible image reconstruction) or an input-level fusion technique, where different Stokes images are concatenated and used as a multi-channel input to synthesize the visible image given the corresponding polarimetric signatures.

Face Generation Face Verification +1

Paper
Add Code

Disentangled Variational Representation for Heterogeneous Face Recognition

no code implementations • 6 Sep 2018 • Xiang Wu, Huaibo Huang, Vishal M. Patel, Ran He, Zhenan Sun

Visible (VIS) to near infrared (NIR) face matching is a challenging problem due to the significant domain discrepancy between the domains and a lack of sufficient data for training cross-modal matching algorithms.

Ranked #2 on Face Verification on BUAA-VisNir

Face Recognition Heterogeneous Face Recognition

Paper
Add Code

Simultaneous Segmentation and Classification of Bone Surfaces from Ultrasound Using a Multi-feature Guided CNN

no code implementations • 26 Jun 2018 • Puyang Wang, Vishal M. Patel, Ilker Hacihaliloglu

Various imaging artifacts, low signal-to-noise ratio, and bone surfaces appearing several millimeters in thickness have hindered the success of ultrasound (US) guided computer assisted orthopedic surgery procedures.

General Classification Segmentation

Paper
Add Code

Pushing the Limits of Unconstrained Face Detection: a Challenge Dataset and Baseline Results

no code implementations • 26 Apr 2018 • Hajime Nada, Vishwanath A. Sindagi, He Zhang, Vishal M. Patel

In this work, we identify the next set of challenges that requires attention from the research community and collect a new dataset of face images that involve these issues such as weather-based degradations, motion blur, focus blur and several others.

Face Detection Robust Face Recognition

Paper
Add Code

Deep Multimodal Subspace Clustering Networks

1 code implementation • 17 Apr 2018 • Mahdi Abavisani, Vishal M. Patel

In addition to various spatial fusion-based methods, an affinity fusion-based network is also proposed in which the self-expressive layer corresponding to different modalities is enforced to be the same.

Ranked #1 on Image Clustering on Extended Yale-B

Clustering Multi-modal Subspace Clustering +2

Paper
Code

Densely Connected Pyramid Dehazing Network

1 code implementation • CVPR 2018 • He Zhang, Vishal M. Patel

We propose a new end-to-end single image dehazing method, called Densely Connected Pyramid Dehazing Network (DCPDN), which can jointly learn the transmission map, atmospheric light and dehazing all together.

Ranked #6 on Image Dehazing on RESIDE-6K

Generative Adversarial Network Image Dehazing +1

399

Paper
Code

Generating High Quality Visible Images from SAR Images Using CNNs

no code implementations • 27 Feb 2018 • Puyang Wang, Vishal M. Patel

We propose a novel approach for generating high quality visible-like images from Synthetic Aperture Radar (SAR) images using Deep Convolutional Generative Adversarial Network (GAN) architectures.

Colorization Generative Adversarial Network +2

Paper
Add Code

Density-aware Single Image De-raining using a Multi-stream Dense Network

1 code implementation • CVPR 2018 • He Zhang, Vishal M. Patel

In addition, an ablation study is performed to demonstrate the improvements obtained by different modules in the proposed method.

Ranked #6 on Single Image Deraining on RainCityscapes

Density Estimation Single Image Deraining

237

Paper
Code

Learning Deep Features for One-Class Classification

5 code implementations • 16 Jan 2018 • Pramuditha Perera, Vishal M. Patel

We propose a deep learning-based solution for the problem of feature learning in one-class classification.

Descriptive General Classification +3

185

Paper
Code

Face Synthesis from Visual Attributes via Sketch using Conditional VAEs and GANs

1 code implementation • 30 Dec 2017 • Xing Di, Vishal M. Patel

In this paper, we take a different approach, where we formulate the original problem as a stage-wise learning problem.

Attribute Face Generation

Paper
Code

In2I : Unsupervised Multi-Image-to-Image Translation Using Generative Adversarial Networks

1 code implementation • 26 Nov 2017 • Pramuditha Perera, Mahdi Abavisani, Vishal M. Patel

In unsupervised image-to-image translation, the goal is to learn the mapping between an input image and an output image using a set of unpaired training images.

Ranked #1 on Multimodal Unsupervised Image-To-Image Translation on EPFL NIR-VIS

Generative Adversarial Network Multimodal Unsupervised Image-To-Image Translation +2

Paper
Code

High-Quality Facial Photo-Sketch Synthesis Using Multi-Adversarial Networks

1 code implementation • 27 Oct 2017 • Lidan Wang, Vishwanath A. Sindagi, Vishal M. Patel

To this end, we propose a novel synthesis framework called Photo-Sketch Synthesis using Multi-Adversarial Networks, (PS2-MAN) that iteratively generates low resolution to high resolution images in an adversarial way.

Ranked #2 on Face Sketch Synthesis on CUHK

Face Sketch Synthesis Image Quality Assessment +3

Paper
Code

GP-GAN: Gender Preserving GAN for Synthesizing Faces from Landmarks

2 code implementations • 3 Oct 2017 • Xing Di, Vishwanath A. Sindagi, Vishal M. Patel

The primary aim of this work is to demonstrate that information preserved by landmarks (gender in particular) can be further accentuated by leveraging generative models to synthesize corresponding faces.

Face Generation Generative Adversarial Network

Paper
Code

Generative Adversarial Network-based Synthesis of Visible Faces from Polarimetric Thermal Faces

no code implementations • 8 Aug 2017 • He Zhang, Vishal M. Patel, Benjamin S. Riggan, Shuowen Hu

Previous approaches utilize a two-step procedure (visible feature estimation and visible image reconstruction) to synthesize the visible image given the corresponding polarimetric thermal image.

Face Generation Face Recognition +3

Paper
Add Code

Joint Transmission Map Estimation and Dehazing using Deep Networks

no code implementations • 2 Aug 2017 • He Zhang, Vishwanath Sindagi, Vishal M. Patel

Single image haze removal is an extremely challenging problem due to its inherent ill-posed nature.

Image Dehazing Single Image Dehazing +1

Paper
Add Code

Generating High-Quality Crowd Density Maps using Contextual Pyramid CNNs

no code implementations • ICCV 2017 • Vishwanath A. Sindagi, Vishal M. Patel

DME is a multi-column architecture-based CNN that aims to generate high-dimensional feature maps from the input image which are fused with the contextual information estimated by GCE and LCE using F-CNN.

Ranked #8 on Crowd Counting on WorldExpo’10

Crowd Counting Vocal Bursts Intensity Prediction

Paper
Add Code

CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting

1 code implementation • 30 Jul 2017 • Vishwanath A. Sindagi, Vishal M. Patel

Estimating crowd count in densely crowded scenes is an extremely challenging task due to non-uniform scale variations.

Ranked #16 on Crowd Counting on UCF-QNRF

Crowd Counting Density Estimation +2

193

Paper
Code

Synthesis-based Robust Low Resolution Face Recognition

no code implementations • 10 Jul 2017 • Sumit Shekhar, Vishal M. Patel, Rama Chellappa

Recognition of low resolution face images is a challenging problem in many practical face recognition systems.

Dictionary Learning Face Recognition

Paper
Add Code

A Survey of Recent Advances in CNN-based Single Image Crowd Counting and Density Estimation

1 code implementation • 5 Jul 2017 • Vishwanath A. Sindagi, Vishal M. Patel

Nevertheless, over the last few years, crowd count analysis has evolved from earlier methods that are often limited to small variations in crowd density and scales to the current state-of-the-art methods that have developed the ability to perform successfully on a wide range of scenarios.

Crowd Counting Density Estimation

Paper
Code

Hierarchical Multimodal Metric Learning for Multimodal Classification

no code implementations • CVPR 2017 • Heng Zhang, Vishal M. Patel, Rama Chellappa

The learned metrics can improve multimodal classification accuracy and experimental results on four datasets show that the proposed algorithm outperforms existing learning algorithms based on multiple metrics as well as other approaches tested on these datasets.

Classification General Classification +4

Paper
Add Code

SAR Image Despeckling Using a Convolutional Neural Network

3 code implementations • 2 Jun 2017 • Puyang Wang, He Zhang, Vishal M. Patel

Synthetic Aperture Radar (SAR) images are often contaminated by a multiplicative noise known as speckle.

Sar Image Despeckling

Paper
Code

Sparse Representation-based Open Set Recognition

1 code implementation • 6 May 2017 • He Zhang, Vishal M. Patel

We propose a generalized Sparse Representation- based Classification (SRC) algorithm for open set recognition where not all classes presented during testing are known during training.

Classification General Classification +3

Paper
Code

Learning from Ambiguously Labeled Face Images

no code implementations • 15 Feb 2017 • Ching-Hui Chen, Vishal M. Patel, Rama Chellappa

To prevent the majority labels from dominating the result of MCar, we generalize MCar to a weighted MCar (WMCar) that handles label imbalance.

Matrix Completion

Paper
Add Code

Image De-raining Using a Conditional Generative Adversarial Network

7 code implementations • 21 Jan 2017 • He Zhang, Vishwanath Sindagi, Vishal M. Patel

Hence, it is important to solve the problem of single image de-raining/de-snowing.

Generative Adversarial Network Rain Removal

248

Paper
Code

Active User Authentication for Smartphones: A Challenge Data Set and Benchmark Results

no code implementations • 25 Oct 2016 • Upal Mahbub, Sayantan Sarkar, Vishal M. Patel, Rama Chellappa

In this paper, automated user verification techniques for smartphones are investigated.

Face Detection Face Verification

Paper
Add Code

Unconstrained Still/Video-Based Face Verification with Deep Convolutional Neural Networks

no code implementations • 9 May 2016 • Jun-Cheng Chen, Rajeev Ranjan, Swami Sankaranarayanan, Amit Kumar, Ching-Hui Chen, Vishal M. Patel, Carlos D. Castillo, Rama Chellappa

Over the last five years, methods based on Deep Convolutional Neural Networks (DCNNs) have shown impressive performance improvements for object detection and recognition problems.

Face Detection Face Recognition +3

Paper
Add Code

Partial Face Detection for Continuous Authentication

no code implementations • 30 Mar 2016 • Upal Mahbub, Vishal M. Patel, Deepak Chandra, Brandon Barbello, Rama Chellappa

In this paper, a part-based technique for real time detection of users' faces on mobile devices is proposed.

Face Detection

Paper
Add Code

HyperFace: A Deep Multi-task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition

2 code implementations • 3 Mar 2016 • Rajeev Ranjan, Vishal M. Patel, Rama Chellappa

We present an algorithm for simultaneous face detection, landmarks localization, pose estimation and gender recognition using deep convolutional neural networks (CNN).

Ranked #2 on Face Detection on Annotated Faces in the Wild

Face Detection Multi-Task Learning +1

176

Paper
Code

Deep Feature-based Face Detection on Mobile Devices

no code implementations • 16 Feb 2016 • Sayantan Sarkar, Vishal M. Patel, Rama Chellappa

We propose a deep feature-based face detector for mobile devices to detect user's face acquired by the front facing camera.

Face Detection

Paper
Add Code

Optimized Kernel-based Projection Space of Riemannian Manifolds

no code implementations • 10 Feb 2016 • Azadeh Alavi, Vishal M. Patel, Rama Chellappa

Recently, it was shown that embedding such manifolds into a Random Projection Spaces (RPS), rather than RKHS or tangent space, leads to higher classification and clustering performance.

Classification Clustering +2

Paper
Add Code

Towards the Design of an End-to-End Automated System for Image and Video-based Recognition

no code implementations • 28 Jan 2016 • Rama Chellappa, Jun-Cheng Chen, Rajeev Ranjan, Swami Sankaranarayanan, Amit Kumar, Vishal M. Patel, Carlos D. Castillo

In this paper, we present a brief history of developments in computer vision and artificial neural networks over the last forty years for the problem of image-based recognition.

Face Verification Object +3

Paper
Add Code

Sequential Score Adaptation with Extreme Value Theory for Robust Railway Track Inspection

1 code implementation • 20 Oct 2015 • Xavier Gibert, Vishal M. Patel, Rama Chellappa

Periodic inspections are necessary to keep railroad tracks in state of good repair and prevent train accidents.

Defect Detection

Paper
Code

Deep Multi-task Learning for Railway Track Inspection

no code implementations • 17 Sep 2015 • Xavier Gibert, Vishal M. Patel, Rama Chellappa

Railroad tracks need to be periodically inspected and monitored to ensure safe transportation.

Multi-Task Learning

Paper
Add Code

A Deep Pyramid Deformable Part Model for Face Detection

no code implementations • 18 Aug 2015 • Rajeev Ranjan, Vishal M. Patel, Rama Chellappa

We present a face detection algorithm based on Deformable Part Models and deep pyramidal features.

Face Detection Robust Face Recognition

Paper
Add Code

Unconstrained Face Verification using Deep CNN Features

no code implementations • 7 Aug 2015 • Jun-Cheng Chen, Vishal M. Patel, Rama Chellappa

In this paper, we present an algorithm for unconstrained face verification based on deep convolutional features and evaluate it on the newly released IARPA Janus Benchmark A (IJB-A) dataset.

Ranked #13 on Face Verification on IJB-A

Face Verification

Paper
Add Code

Matrix Completion for Resolving Label Ambiguity

no code implementations • CVPR 2015 • Ching-Hui Chen, Vishal M. Patel, Rama Chellappa

News dataset.

Matrix Completion

Paper
Add Code

Generalized Domain-Adaptive Dictionaries

no code implementations • CVPR 2013 • Sumit Shekhar, Vishal M. Patel, Hien V. Nguyen, Rama Chellappa

Data-driven dictionaries have produced state-of-the-art results in various classification tasks.

General Classification

Paper
Add Code

Dictionary Learning from Ambiguously Labeled Data

no code implementations • CVPR 2013 • Yi-Chen Chen, Vishal M. Patel, Jaishanker K. Pillai, Rama Chellappa, P. J. Phillips

We propose a novel dictionary-based learning method for ambiguously labeled multiclass classification, where each training sample has multiple labels and only one of them is the correct label.

Dictionary Learning General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.