Search Results for author: Sahar Abdelnabi

Found 10 papers, 6 papers with code

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

1 code implementation11 Mar 2024 Egor Zverev, Sahar Abdelnabi, Mario Fritz, Christoph H. Lampert

Instruction-tuned Large Language Models (LLMs) have achieved breakthrough results, opening countless new possibilities for many practical applications.

Exploring Value Biases: How LLMs Deviate Towards the Ideal

no code implementations16 Feb 2024 Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi, Mario Fritz

We study this sampling of LLMs in light of value bias and show that the sampling of LLMs tends to favour high-value options.

LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Games

2 code implementations29 Sep 2023 Sahar Abdelnabi, Amr Gomaa, Sarath Sivaprasad, Lea Schönherr, Mario Fritz

There is a growing interest in using Large Language Models (LLMs) as agents to tackle real-world tasks that may require assessing complex situations.

Decision Making

Fact-Saboteurs: A Taxonomy of Evidence Manipulation Attacks against Fact-Verification Systems

1 code implementation7 Sep 2022 Sahar Abdelnabi, Mario Fritz

In this work, we assume an adversary that automatically tampers with the online evidence in order to disrupt the fact-checking model via camouflaging the relevant evidence or planting a misleading one.

Fact Checking Fact Verification +1

Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding

1 code implementation7 Sep 2020 Sahar Abdelnabi, Mario Fritz

In this paper, we study natural language watermarking as a defense to help better mark and trace the provenance of text.

Denoising Text Generation

Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data

1 code implementation ICCV 2021 Ning Yu, Vladislav Skripniuk, Sahar Abdelnabi, Mario Fritz

Thus, we seek a proactive and sustainable solution on deepfake detection, that is agnostic to the evolution of generative models, by introducing artificial fingerprints into the models.

DeepFake Detection Face Swapping +2

Cannot find the paper you are looking for? You can Submit a new open access paper.