Search Results for author: Shawn Im

Found 2 papers, 1 papers with code

Understanding the Learning Dynamics of Alignment with Human Feedback

1 code implementation • 27 Mar 2024 • Shawn Im, Yixuan Li

Our work provides an initial attempt to theoretically analyze the learning dynamics of human preference alignment.

Paper
Code

Evaluating the Utility of Model Explanations for Model Development

no code implementations • 10 Dec 2023 • Shawn Im, Jacob Andreas, Yilun Zhou

One of the motivations for explainable AI is to allow humans to make better and more informed decisions regarding the use and deployment of AI models.

counterfactual Decision Making +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.