Search Results for author: Yeu-Tong Lau

Found 1 papers, 0 papers with code

An Adversarial Example for Direct Logit Attribution: Memory Management in gelu-4l

no code implementations11 Oct 2023 James Dao, Yeu-Tong Lau, Can Rager, Jett Janiak

That is, clearing residual stream directions set by earlier layers by reading in information and writing out the negative version.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.