Search Results for author: Naveed Akhtar

Found 58 papers, 15 papers with code

Severity Controlled Text-to-Image Generative Model Bias Manipulation

no code implementations • 3 Apr 2024 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Charting the susceptibility of T2I models to such manipulation, we first expose the new possibility of a dynamic and computationally efficient exploitation of model bias by targeting the embedded language models.

Backdoor Attack Prompt Engineering

Paper
Add Code

ARCNet: An Asymmetric Residual Wavelet Column Correction Network for Infrared Image Destriping

no code implementations • 28 Jan 2024 • Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Shiqi Yang, Shuowen Yang

Our neural model leverages a novel downsampler, residual haar discrete wavelet transform (RHDWT), stripe directional prior knowledge and data-driven learning to induce a model with enriched feature representation of stripe noise and background.

Feature Upsampling Image Reconstruction

Paper
Add Code

SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection

no code implementations • 28 Jan 2024 • Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Ajmal Mian

In the proposed SCTBs, the outputs of all encoders are interacted with cross transformer to generate mixed features, which are redistributed to all decoders to effectively reinforce semantic differences between the target and clutter at full scales.

Paper
Add Code

M3BUNet: Mobile Mean Max UNet for Pancreas Segmentation on CT-Scans

no code implementations • 18 Jan 2024 • Juwita juwita, Ghulam Mubashar Hassan, Naveed Akhtar, Amitava Datta

Segmenting organs in CT scan images is a necessary process for multiple downstream medical image analysis tasks.

Automated Pancreas Segmentation object-detection +3

Paper
Add Code

Quantifying Bias in Text-to-Image Generative Models

no code implementations • 20 Dec 2023 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Bias in text-to-image (T2I) models can propagate unfair social representations and may be used to aggressively market ideas or push controversial agendas.

Hallucination Marketing

Paper
Add Code

Text-image guided Diffusion Model for generating Deepfake celebrity interactions

no code implementations • 26 Sep 2023 • Yunzhuo Chen, Nur Al Hasan Haldar, Naveed Akhtar, Ajmal Mian

To curb their exploitation for Deepfakes, it is imperative to first explore the extent to which diffusion models can be used to generate realistic content that is controllable with convenient prompts.

Face Swapping

Paper
Add Code

On quantifying and improving realism of images generated with diffusion

no code implementations • 26 Sep 2023 • Yunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Ajmal Mian

Recent advances in diffusion models have led to a quantum leap in the quality of generative visual content.

Attribute Benchmarking

Paper
Add Code

PRAT: PRofiling Adversarial aTtacks

no code implementations • 20 Sep 2023 • Rahul Ambati, Naveed Akhtar, Ajmal Mian, Yogesh Singh Rawat

Inspired by this, we introduce a novel problem of PRofiling Adversarial aTtacks (PRAT).

Adversarial Attack

Paper
Add Code

MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using Transformers

no code implementations • 1 Aug 2023 • Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar

This model employs an intuitive approach for the combination of audio-image and video modalities, with a primary aim to escalate the effectiveness of multimodal human action recognition (MHAR).

Action Recognition Temporal Action Localization

Paper
Add Code

BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models

1 code implementation • 31 Jul 2023 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Based on the penetration level, BAGM takes the form of a suite of attacks that are referred to as surface, shallow and deep attacks in this article.

Backdoor Attack Language Modelling +2

Paper
Code

A Comprehensive Overview of Large Language Models

1 code implementation • 12 Jul 2023 • Humza Naveed, Asad Ullah Khan, Shi Qiu, Muhammad Saqib, Saeed Anwar, Muhammad Usman, Naveed Akhtar, Nick Barnes, Ajmal Mian

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond.

Benchmarking

Paper
Code

UnLoc: A Universal Localization Method for Autonomous Vehicles using LiDAR, Radar and/or Camera Input

no code implementations • 3 Jul 2023 • Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Ajmal Mian

The results ascertain the efficacy of our technique.

Autonomous Navigation

Paper
Add Code

Towards credible visual model interpretation with path attribution

no code implementations • 23 May 2023 • Naveed Akhtar, Muhammad A. A. K. Jalwana

We also devise a scheme to preclude the conditions in which visual model interpretation can invalidate the axiomatic properties of path attribution.

Paper
Add Code

Rethinking interpretation: Input-agnostic saliency mapping of deep visual classifiers

no code implementations • 31 Mar 2023 • Naveed Akhtar, Mohammad A. A. K. Jalwana

Addressing the gap, we introduce a new perspective of input-agnostic saliency mapping that computationally estimates the high-level features attributed by the model to its outputs.

Paper
Add Code

A Survey of Explainable AI in Deep Visual Modeling: Methods and Metrics

no code implementations • 31 Jan 2023 • Naveed Akhtar

Deep visual models have widespread applications in high-stake domains.

Paper
Add Code

Pre-text Representation Transfer for Deep Learning with Limited Imbalanced Data : Application to CT-based COVID-19 Detection

no code implementations • 21 Jan 2023 • Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar

We fuse the output of this layer with the predictions of a model induced with the traditional transfer learning performed over our pre-text transferred model.

Transfer Learning

Paper
Add Code

Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps

no code implementations • 21 Jan 2023 • Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Michael Wise, Ajmal Mian

We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data.

Autonomous Vehicles Outdoor Localization +1

Paper
Add Code

AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers

1 code implementation • CVPR 2023 • Zechuan Li, Hongshan Yu, Zhengeng Yang, Tongjia Chen, Naveed Akhtar

In this work, we propose AShapeFormer, a semantics-guided object-level shape encoding module for 3D object detection.

3D Object Detection Object +1

Paper
Code

Query Efficient Cross-Dataset Transferable Black-Box Attack on Action Recognition

no code implementations • 23 Nov 2022 • Rohit Gupta, Naveed Akhtar, Gaurav Kumar Nayak, Ajmal Mian, Mubarak Shah

By using a nearly disjoint dataset to train the substitute model, our method removes the requirement that the substitute model be trained using the same dataset as the target model, and leverages queries to the target model to retain the fooling rate benefits provided by query-based methods.

Action Recognition

Paper
Add Code

Simultaneous Multiple Object Detection and Pose Estimation using 3D Model Infusion with Monocular Vision

2 code implementations • 21 Nov 2022 • Congliang Li, ShiJie Sun, XiangYu Song, HuanSheng Song, Naveed Akhtar, Ajmal Saeed Mian

Using the labeling method, we provide the KITTI-6DoF dataset with $\sim7. 5$K annotated frames.

Autonomous Driving Object +3

Paper
Code

Efficient Diffusion Models for Vision: A Survey

no code implementations • 7 Oct 2022 • Anwaar Ulhaq, Naveed Akhtar

In this review, we present the most recent advances in diffusion models for vision, specifically focusing on the important design aspects that affect the computational efficiency of DMs.

Computational Efficiency

Paper
Add Code

Vision Transformers for Action Recognition: A Survey

no code implementations • 13 Sep 2022 • Anwaar Ulhaq, Naveed Akhtar, Ganna Pogrebna, Ajmal Mian

Finally, it provides a discussion on the challenges, outlook, and future avenues for this research direction.

Action Recognition Dimensionality Reduction +1

Paper
Add Code

MAiVAR: Multimodal Audio-Image and Video Action Recognizer

no code implementations • IEEE International Conference on Visual Communications and Image Processing (VCIP) 2023 • Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar

Currently, action recognition is predominately performed on video data as processed by CNNs.

Action Recognition

Paper
Add Code

Contrastive Self-Supervised Learning Leads to Higher Adversarial Susceptibility

no code implementations • 22 Jul 2022 • Rohit Gupta, Naveed Akhtar, Ajmal Mian, Mubarak Shah

We establish that this is a result of the presence of false negative pairs in the training process, which increases model sensitivity to input perturbations.

Adversarial Robustness Self-Supervised Learning +1

Paper
Add Code

Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds

1 code implementation • CVPR 2022 • Zhao Jin, Yinjie Lei, Naveed Akhtar, Haifeng Li, Munawar Hayat

With that, we develop a large-scale synthetic scene flow dataset GTA-SF.

Autonomous Driving Domain Adaptation +1

Paper
Code

A Survey of Neural Trojan Attacks and Defenses in Deep Learning

no code implementations • 15 Feb 2022 • Jie Wang, Ghulam Mubashar Hassan, Naveed Akhtar

It provides a comprehensible gateway to the broader community to understand the recent developments in Neural Trojans.

Paper
Add Code

Mesh Convolution with Continuous Filters for 3D Surface Parsing

2 code implementations • 3 Dec 2021 • Huan Lei, Naveed Akhtar, Mubarak Shah, Ajmal Mian

In this paper, we propose a series of modular operations for effective geometric feature learning from 3D triangle meshes.

Scene Parsing Scene Segmentation

122

Paper
Code

Resetting the baseline: CT-based COVID-19 diagnosis with Deep Transfer Learning is not as accurate as widely thought

no code implementations • 12 Aug 2021 • Fouzia Altaf, Syed M. S. Islam, Naveed Akhtar

We hope that our reproducible investigation will help in curbing hype-driven claims for the critical problem of COVID-19 diagnosis, and pave the way for a more transparent performance evaluation of techniques for CT-based COVID-19 detection.

Computed Tomography (CT) COVID-19 Diagnosis +1

Paper
Add Code

Advances in adversarial attacks and defenses in computer vision: A survey

no code implementations • 1 Aug 2021 • Naveed Akhtar, Ajmal Mian, Navid Kardan, Mubarak Shah

In [2], we reviewed the contributions made by the computer vision community in adversarial attacks on deep learning (and their defenses) until the advent of year 2018.

Paper
Add Code

Controlled Caption Generation for Images Through Adversarial Attacks

no code implementations • 7 Jul 2021 • Nayyer Aafaq, Naveed Akhtar, Wei Liu, Mubarak Shah, Ajmal Mian

In contrast, we propose a GAN-based algorithm for crafting adversarial examples for neural image captioning that mimics the internal representation of the CNN such that the resulting deep features of the input image enable a controlled incorrect caption generation through the recurrent network.

Caption Generation Image Captioning +1

Paper
Add Code

Attack to Fool and Explain Deep Networks

no code implementations • 20 Jun 2021 • Naveed Akhtar, Muhammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian

Exploring this phenomenon further, we alter the `adversarial' objective of our attack to use it as a tool to `explain' deep visual representation.

Adversarial Attack Image Generation +1

Paper
Add Code

CAMERAS: Enhanced Resolution And Sanity preserving Class Activation Mapping for image saliency

1 code implementation • CVPR 2021 • Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.

Paper
Code

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

2 code implementations • CVPR 2021 • Huan Lei, Naveed Akhtar, Ajmal Mian

We present Picasso, a CUDA-based library comprising novel modules for deep learning over complex real-world 3D meshes.

122

Paper
Code

Boosting Deep Transfer Learning for COVID-19 Classification

no code implementations • 16 Feb 2021 • Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar

COVID-19 classification using chest Computed Tomography (CT) has been found pragmatically useful by several studies.

Classification Computed Tomography (CT) +4

Paper
Add Code

Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking

3 code implementations • ECCV 2020 • Shi-Jie Sun, Naveed Akhtar, Xiang-Yu Song, HuanSheng Song, Ajmal Mian, Mubarak Shah

Deep learning-based Multiple Object Tracking (MOT) currently relies on off-the-shelf detectors for tracking-by-detection. This results in deep models that are detector biased and evaluations that are detector influenced.

Multiple Object Tracking Object

Paper
Code

Orthogonal Deep Models As Defense Against Black-Box Attacks

no code implementations • 26 Jun 2020 • Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

On the other, deep learning has also been found vulnerable to adversarial attacks, which calls for new techniques to defend deep models against these attacks.

Paper
Add Code

Adversarial Perturbations Prevail in the Y-Channel of the YCbCr Color Space

1 code implementation • 25 Feb 2020 • Camilo Pestana, Naveed Akhtar, Wei Liu, David Glance, Ajmal Mian

Our results show that our approach achieves the best balance between defence against adversarial attacks such as FGSM, PGD and DDN and maintaining the original accuracies of VGG-16, ResNet50 and DenseNet121 on clean images.

Paper
Code

Empirical Autopsy of Deep Video Captioning Frameworks

no code implementations • 21 Nov 2019 • Nayyer Aafaq, Naveed Akhtar, Wei Liu, Ajmal Mian

We perform extensive experiments by varying the constituent components of the video captioning framework, and quantify the performance gains that are possible by mere component selection.

Language Modelling Video Captioning +1

Paper
Add Code

Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds

3 code implementations • 20 Sep 2019 • Huan Lei, Naveed Akhtar, Ajmal Mian

We propose a spherical kernel for efficient graph convolution of 3D point clouds.

Ranked #5 on 3D Object Classification on ModelNet40

3D Instance Segmentation 3D Object Classification +3

168

Paper
Code

Adversarial Attack on Skeleton-based Human Action Recognition

no code implementations • 14 Sep 2019 • Jian Liu, Naveed Akhtar, Ajmal Mian

We also explore the possibility of semantically imperceptible localized attacks with CIASA, and succeed in fooling the state-of-the-art skeleton action recognition models with high confidence.

Action Recognition Adversarial Attack +2

Paper
Add Code

Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video

no code implementations • 1 Jun 2019 • Jian Liu, Naveed Akhtar, Ajmal Mian

A major challenge in this regard is the lack of appropriately annotated video data for learning the desired deep models.

Paper
Add Code

Label Universal Targeted Attack

1 code implementation • 27 May 2019 • Naveed Akhtar, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian

We introduce Label Universal Targeted Attack (LUTA) that makes a deep model predict a label of attacker's choice for `any' sample of a given source class with high probability.

Paper
Code

Octree guided CNN with Spherical Kernels for 3D Point Clouds

no code implementations • CVPR 2019 • Huan Lei, Naveed Akhtar, Ajmal Mian

We propose an octree guided neural network architecture and spherical convolutional kernel for machine learning from arbitrary 3D point clouds.

Ranked #10 on 3D Part Segmentation on ShapeNet-Part

3D Object Classification 3D Part Segmentation +1

Paper
Add Code

Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning

no code implementations • CVPR 2019 • Nayyer Aafaq, Naveed Akhtar, Wei Liu, Syed Zulqarnain Gilani, Ajmal Mian

The final representation is projected to a compact space and fed to a language model.

Attribute Caption Generation +2

Paper
Add Code

Going Deep in Medical Image Analysis: Concepts, Methods, Challenges and Future Directions

no code implementations • 15 Feb 2019 • Fouzia Altaf, Syed M. S. Islam, Naveed Akhtar, Naeem K. Janjua

Unique to this study is the Computer Vision/Machine Learning perspective taken on the advances of Deep Learning in Medical Imaging.

Anatomy BIG-bench Machine Learning

Paper
Add Code

Deep Affinity Network for Multiple Object Tracking

1 code implementation • 28 Oct 2018 • Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Ajmal Mian, Mubarak Shah

In this paper, we harness the power of deep learning for data association in tracking by jointly modelling object appearances and their affinities between different frames in an end-to-end fashion.

Benchmarking Multiple Object Tracking +3

396

Paper
Code

Spherical Convolutional Neural Network for 3D Point Clouds

no code implementations • 21 May 2018 • Huan Lei, Naveed Akhtar, Ajmal Mian

We propose a neural network for 3D point cloud processing that exploits `spherical' convolution kernels and octree partitioning of space.

3D Object Classification General Classification +1

Paper
Add Code

Benchmark data and method for real-time people counting in cluttered scenes using depth sensors

1 code implementation • 12 Apr 2018 • Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Chaoyang Zhang, Jian-Xin Li, Ajmal Mian

A thorough evaluation on PCDS demonstrates that our technique is able to count people in cluttered scenes with high accuracy at 45 fps on a 1. 7 GHz processor, and hence it can be deployed for effective real-time people counting for intelligent transportation systems.

Benchmarking

Paper
Code

Hyperspectral recovery from RGB images using Gaussian Processes

no code implementations • 15 Jan 2018 • Naveed Akhtar, Ajmal Mian

We propose to recover spectral details from RGB images of known spectral quantization by modeling natural spectra under Gaussian Processes and combining them with the RGB images.

Gaussian Processes Quantization

Paper
Add Code

Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

3 code implementations • 2 Jan 2018 • Naveed Akhtar, Ajmal Mian

This article presents the first comprehensive survey on adversarial attacks on deep learning in Computer Vision.

Self-Driving Cars

Paper
Code

Defense against Universal Adversarial Perturbations

no code implementations • CVPR 2018 • Naveed Akhtar, Jian Liu, Ajmal Mian

A rigorous evaluation shows that our framework can defend the network classifiers against unseen adversarial perturbations in the real-world scenarios with up to 97. 5% success rate.

Paper
Add Code

Skepxels: Spatio-temporal Image Representation of Human Skeleton Joints for Action Recognition

no code implementations • 16 Nov 2017 • Jian Liu, Naveed Akhtar, Ajmal Mian

The proposed action recognition exploits the representation in a hierarchical manner by first capturing the micro-temporal relations between the skeleton joints with the Skepxels and then exploiting their macro-temporal relations by computing the Fourier Temporal Pyramids over the CNN features of the skeletal images.

Action Analysis Action Recognition +1

Paper
Add Code

Viewpoint Invariant Action Recognition using RGB-D Videos

no code implementations • 15 Sep 2017 • Jian Liu, Naveed Akhtar, Ajmal Mian

The proposed technique capitalizes on the spatio-temporal information available in the two data streams to the extract action features that are largely insensitive to the viewpoint variations.

Action Recognition Temporal Action Localization +1

Paper
Add Code

Learning Human Pose Models from Synthesized Data for Robust RGB-D Action Recognition

no code implementations • 4 Jul 2017 • Jian Liu, Naveed Akhtar, Ajmal Mian

We propose Human Pose Models that represent RGB and depth images of human poses independent of clothing textures, backgrounds, lighting conditions, body shapes and camera viewpoints.

Ranked #94 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

Joint Discriminative Bayesian Dictionary and Classifier Learning

no code implementations • CVPR 2017 • Naveed Akhtar, Ajmal Mian, Fatih Porikli

To further encourage discrimination in the dictionary, our model uses separate (sets of) Bernoulli distributions to represent data from different classes.

Action Recognition Temporal Action Localization

Paper
Add Code

Sparseness helps: Sparsity Augmented Collaborative Representation for Classification

no code implementations • 29 Nov 2015 • Naveed Akhtar, Faisal Shafait, Ajmal Mian

Many classification approaches first represent a test sample using the training samples of all the classes.

Classification General Classification

Paper
Add Code

Bayesian Sparse Representation for Hyperspectral Image Super Resolution

no code implementations • CVPR 2015 • Naveed Akhtar, Faisal Shafait, Ajmal Mian

We propose a hyperspectral image super resolution approach that fuses a high resolution image with the low resolution hyperspectral image using non-parametric Bayesian sparse representation.

Hyperspectral Image Super-Resolution Image Super-Resolution

Paper
Add Code

Discriminative Bayesian Dictionary Learning for Classification

no code implementations • 27 Mar 2015 • Naveed Akhtar, Faisal Shafait, Ajmal Mian

We propose a Bayesian approach to learn discriminative dictionaries for sparse representation of data.

Action Recognition Classification +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.