Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models

In this work, we take the first step to exploit the pre-trained (unfine-tuned) weights to mitigate backdoors in fine-tuned language models.

124,593

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.

Search Results