Search Results for author: Adam Pearce

Found 5 papers, 2 papers with code

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models

1 code implementation11 Jan 2024 Asma Ghandeharioun, Avi Caciularu, Adam Pearce, Lucas Dixon, Mor Geva

Inspecting the information encoded in hidden representations of large language models (LLMs) can explain models' behavior and verify their alignment with human values.

Acquisition of Chess Knowledge in AlphaZero

no code implementations17 Nov 2021 Thomas McGrath, Andrei Kapishnikov, Nenad Tomašev, Adam Pearce, Demis Hassabis, Been Kim, Ulrich Paquet, Vladimir Kramnik

In this work we provide evidence that human knowledge is acquired by the AlphaZero neural network as it trains on the game of chess.

Game of Chess

An Interpretability Illusion for BERT

no code implementations14 Apr 2021 Tolga Bolukbasi, Adam Pearce, Ann Yuan, Andy Coenen, Emily Reif, Fernanda Viégas, Martin Wattenberg

We describe an "interpretability illusion" that arises when analyzing the BERT model.

Cannot find the paper you are looking for? You can Submit a new open access paper.