Search Results for author: Adib Hasan

Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning

Large Language Models (LLMs) are susceptible to `jailbreaking' prompts, which can induce the generation of harmful content.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.