Search Results for author: Sahar Abdelnabi

Found 10 papers, 6 papers with code

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

1 code implementation • 11 Mar 2024 • Egor Zverev, Sahar Abdelnabi, Mario Fritz, Christoph H. Lampert

Instruction-tuned Large Language Models (LLMs) have achieved breakthrough results, opening countless new possibilities for many practical applications.

Paper
Code

Exploring Value Biases: How LLMs Deviate Towards the Ideal

no code implementations • 16 Feb 2024 • Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi, Mario Fritz

We study this sampling of LLMs in light of value bias and show that the sampling of LLMs tends to favour high-value options.

Paper
Add Code

LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Games

2 code implementations • 29 Sep 2023 • Sahar Abdelnabi, Amr Gomaa, Sarath Sivaprasad, Lea Schönherr, Mario Fritz

There is a growing interest in using Large Language Models (LLMs) as agents to tackle real-world tasks that may require assessing complex situations.

Decision Making

Paper
Code

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

2 code implementations • 23 Feb 2023 • Kai Greshake, Sahar Abdelnabi, Shailesh Mishra, Christoph Endres, Thorsten Holz, Mario Fritz

Large Language Models (LLMs) are increasingly being integrated into various applications.

Code Completion Computer Security +2

1,655

Paper
Code

Fact-Saboteurs: A Taxonomy of Evidence Manipulation Attacks against Fact-Verification Systems

1 code implementation • 7 Sep 2022 • Sahar Abdelnabi, Mario Fritz

In this work, we assume an adversary that automatically tampers with the online evidence in order to disrupt the fact-checking model via camouflaging the relevant evidence or planting a misleading one.

Fact Checking Fact Verification +1

Paper
Code

Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources

no code implementations • CVPR 2022 • Sahar Abdelnabi, Rakibul Hasan, Mario Fritz

Misinformation is now a major problem due to its potential high risks to our core democratic and societal values and orders.

Fact Checking Misinformation

Paper
Add Code

"What's in the box?!": Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models

no code implementations • 9 Feb 2021 • Sahar Abdelnabi, Mario Fritz

Machine learning models are now widely deployed in real-world applications.

Paper
Add Code

Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding

1 code implementation • 7 Sep 2020 • Sahar Abdelnabi, Mario Fritz

In this paper, we study natural language watermarking as a defense to help better mark and trace the provenance of text.

Denoising Text Generation

Paper
Code

Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data

1 code implementation • ICCV 2021 • Ning Yu, Vladislav Skripniuk, Sahar Abdelnabi, Mario Fritz

Thus, we seek a proactive and sustainable solution on deepfake detection, that is agnostic to the evolution of generative models, by introducing artificial fingerprints into the models.

DeepFake Detection Face Swapping +2

Paper
Code

VisualPhishNet: Zero-Day Phishing Website Detection by Visual Similarity

no code implementations • 1 Sep 2019 • Sahar Abdelnabi, Katharina Krombholz, Mario Fritz

Phishing websites are still a major threat in today's Internet ecosystem.

Phishing Website Detection valid

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.