Search Results for author: Do Duc Anh

Found 1 papers, 1 papers with code

Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic

3 code implementations19 Feb 2024 Rishabh Bhardwaj, Do Duc Anh, Soujanya Poria

We demonstrate the effectiveness of RESTA in both parameter-efficient and full fine-tuning, covering a wide range of downstream tasks, including instruction following in Chinese, English, and Hindi, as well as problem-solving capabilities in Code and Math.

Instruction Following Math

Cannot find the paper you are looking for? You can Submit a new open access paper.