Search Results for author: Danny Halawi

Found 3 papers, 2 papers with code

Approaching Human-Level Forecasting with Language Models

no code implementations28 Feb 2024 Danny Halawi, Fred Zhang, Chen Yueh-Han, Jacob Steinhardt

In this work, we study whether language models (LMs) can forecast at the level of competitive human forecasters.

Decision Making Retrieval

Overthinking the Truth: Understanding how Language Models Process False Demonstrations

1 code implementation18 Jul 2023 Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt

The first phenomenon, overthinking, appears when we decode predictions from intermediate layers, given correct vs. incorrect few-shot demonstrations.

Few-Shot Learning

Eliciting Latent Predictions from Transformers with the Tuned Lens

2 code implementations14 Mar 2023 Nora Belrose, Zach Furman, Logan Smith, Danny Halawi, Igor Ostrovsky, Lev McKinney, Stella Biderman, Jacob Steinhardt

We analyze transformers from the perspective of iterative inference, seeking to understand how model predictions are refined layer by layer.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.