Search Results for author: Wes Gurnee

Found 5 papers, 5 papers with code

Universal Neurons in GPT2 Language Models

1 code implementation • 22 Jan 2024 • Wes Gurnee, Theo Horsley, Zifan Carl Guo, Tara Rezaei Kheirkhah, Qinyi Sun, Will Hathaway, Neel Nanda, Dimitris Bertsimas

In other words, are neural mechanisms universal across different models?

Paper
Code

Training Dynamics of Contextual N-Grams in Language Models

1 code implementation • 1 Nov 2023 • Lucia Quirke, Lovis Heindrich, Wes Gurnee, Neel Nanda

We show that this neuron exists within a broader contextual n-gram circuit: we find late layer neurons which recognize and continue n-grams common in German text, but which only activate if the German neuron is active.

Paper
Code

Language Models Represent Space and Time

1 code implementation • 3 Oct 2023 • Wes Gurnee, Max Tegmark

The capabilities of large language models (LLMs) have sparked debate over whether such systems just learn an enormous collection of superficial statistics or a set of more coherent and grounded representations that reflect the real world.

226

Paper
Code

Finding Neurons in a Haystack: Case Studies with Sparse Probing

2 code implementations • 2 May 2023 • Wes Gurnee, Neel Nanda, Matthew Pauly, Katherine Harvey, Dmitrii Troitskii, Dimitris Bertsimas

Despite rapid adoption and deployment of large language models (LLMs), the internal computations of these models remain opaque and poorly understood.

2,063

Paper
Code

Learning Sparse Nonlinear Dynamics via Mixed-Integer Optimization

1 code implementation • 1 Jun 2022 • Dimitris Bertsimas, Wes Gurnee

Discovering governing equations of complex dynamical systems directly from data is a central problem in scientific machine learning.

Model Discovery regression

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.