Search Results for author: Yukai Zhou

Don't Say No: Jailbreaking LLM by Suppressing Refusal

However, the typical attack in this category GCG has very limited attack success rate.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.