Search Results for author: Arush Tagade

Found 4 papers, 2 papers with code

Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation

no code implementations6 Nov 2023 Rusheb Shah, Quentin Feuillade--Montixi, Soroush Pour, Arush Tagade, Stephen Casper, Javier Rando

Despite efforts to align large language models to produce harmless responses, they are still vulnerable to jailbreak prompts that elicit unrestricted behaviour.

GPT-4 Language Modelling

Prototype Generation: Robust Feature Visualisation for Data Independent Interpretability

1 code implementation29 Sep 2023 Arush Tagade, Jessica Rumbelow

We introduce Prototype Generation, a stricter and more robust form of feature visualisation for model-agnostic, data-independent interpretability of image classification models.

Image Classification

Why do CNNs excel at feature extraction? A mathematical explanation

no code implementations3 Jul 2023 Vinoth Nandakumar, Arush Tagade, Tongliang Liu

Over the past decade deep learning has revolutionized the field of computer vision, with convolutional neural network models proving to be very effective for image classification benchmarks.

Classification Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.