Search Results for author: Oliver Bentham

Found 1 papers, 0 papers with code

Chain-of-Thought Unfaithfulness as Disguised Accuracy

no code implementations22 Feb 2024 Oliver Bentham, Nathan Stringham, Ana Marasović

Understanding the extent to which Chain-of-Thought (CoT) generations align with a large language model's (LLM) internal computations is critical for deciding whether to trust an LLM's output.

Cannot find the paper you are looking for? You can Submit a new open access paper.