Model Editing

51 papers with code • 0 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Model Editing models and implementations
3 papers
1,411

Datasets


Most implemented papers

Language Anisotropic Cross-Lingual Model Editing

franklear/lime 25 May 2022

On the newly defined cross-lingual model editing task, we empirically demonstrate the failure of monolingual baselines in propagating the edit to multiple languages and the effectiveness of the proposed language anisotropic model editing.

Memory-Based Model Editing at Scale

eric-mitchell/serac 13 Jun 2022

We find that only SERAC achieves high performance on all three problems, consistently outperforming existing approaches to model editing by a significant margin.

Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors

thartvigsen/grace NeurIPS 2023

We propose GRACE, a lifelong model editing method, which implements spot-fixes on streaming errors of a deployed model, ensuring minimal impact on unrelated inputs.

Transformer-Patcher: One Mistake worth One Neuron

zeroyuhuang/transformer-patcher 24 Jan 2023

Our method outperforms previous fine-tuning and HyperNetwork-based methods and achieves state-of-the-art performance for Sequential Model Editing (SME).

Editing Implicit Assumptions in Text-to-Image Diffusion Models

bahjat-kawar/time-diffusion ICCV 2023

Our Text-to-Image Model Editing method, TIME for short, receives a pair of inputs: a "source" under-specified prompt for which the model makes an implicit assumption (e. g., "a pack of roses"), and a "destination" prompt that describes the same setting, but with a specified desired attribute (e. g., "a pack of blue roses").

Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark

apartresearch/specificityplus 27 May 2023

We use this improved benchmark to evaluate recent model editing techniques and find that they suffer from low specificity.

Evaluating the Ripple Effects of Knowledge Editing in Language Models

edenbiran/rippleedits 24 Jul 2023

This has led to the development of various editing methods that allow updating facts encoded by the model.

PMET: Precise Model Editing in a Transformer

xpq-tech/pmet 17 Aug 2023

To achieve more precise model editing, we analyze hidden states of MHSA and FFN, finding that MHSA encodes certain general knowledge extraction patterns.

PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification

xaiveryuan/patchbackdoor 22 Aug 2023

However, most backdoor attacks have to modify the neural network models through training with poisoned data and/or direct model editing, which leads to a common but false belief that backdoor attack can be easily avoided by properly protecting the model.