Search Results for author: Sean Hendryx

Found 3 papers, 1 papers with code

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

no code implementations • 1 May 2024 • Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue

Large language models (LLMs) have achieved impressive success on many benchmarks for mathematical reasoning.

GSM8K Language Modelling +3

Paper
Add Code

Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy

1 code implementation • 22 Jan 2024 • Will LeVine, Benjamin Pikus, Jacob Phillips, Berk Norman, Fernando Amat Gil, Sean Hendryx

As deep neural networks become adopted in high-stakes domains, it is crucial to be able to identify when inference inputs are Out-of-Distribution (OOD) so that users can be alerted of likely drops in performance and calibration despite high confidence.

object-detection Object Detection +2

Paper
Code

A Baseline Analysis of Reward Models' Ability To Accurately Analyze Foundation Models Under Distribution Shift

no code implementations • 21 Nov 2023 • Will LeVine, Benjamin Pikus, Anthony Chen, Sean Hendryx

These reward models are additionally used at inference-time to estimate LLM responses' adherence to those desired behaviors.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.