Search Results for author: Dan Valentine

Found 3 papers, 3 papers with code

Debating with More Persuasive LLMs Leads to More Truthful Answers

1 code implementation • 9 Feb 2024 • Akbir Khan, John Hughes, Dan Valentine, Laura Ruis, Kshitij Sachan, Ansh Radhakrishnan, Edward Grefenstette, Samuel R. Bowman, Tim Rocktäschel, Ethan Perez

In anticipation of this, we ask: can weaker models assess the correctness of stronger models?

Persuasiveness

Paper
Code

Structured World Representations in Maze-Solving Transformers

1 code implementation • 5 Dec 2023 • Michael Igorevich Ivanitskiy, Alex F. Spies, Tilman Räuker, Guillaume Corlouer, Chris Mathwin, Lucia Quirke, Can Rager, Rusheb Shah, Dan Valentine, Cecilia Diniz Behn, Katsumi Inoue, Samy Wu Fung

Transformer models underpin many recent advances in practical machine learning applications, yet understanding their internal behavior continues to elude researchers.

valid

Paper
Code

A Configurable Library for Generating and Manipulating Maze Datasets

1 code implementation • 19 Sep 2023 • Michael Igorevich Ivanitskiy, Rusheb Shah, Alex F. Spies, Tilman Räuker, Dan Valentine, Can Rager, Lucia Quirke, Chris Mathwin, Guillaume Corlouer, Cecilia Diniz Behn, Samy Wu Fung

Understanding how machine learning models respond to distributional shifts is a key research challenge.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.