Search Results for author: Yuji Wang

Found 1 papers, 1 papers with code

Studious Bob Fight Back Against Jailbreaking via Prompt Adversarial Tuning

1 code implementation9 Feb 2024 Yichuan Mo, Yuji Wang, Zeming Wei, Yisen Wang

To our knowledge, we are the first to implement defense from the perspective of prompt tuning.

Cannot find the paper you are looking for? You can Submit a new open access paper.