Search Results for author: Karthik Rao

Found 1 papers, 0 papers with code

Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

no code implementations • 23 Mar 2023 • Avi Schwarzschild, Max Cembalest, Karthik Rao, Keegan Hines, John Dickerson

We observe on three datasets that we can train a model with this loss term to improve explanation consensus on unseen data, and see improved consensus between explainers other than those used in the loss term.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.