Search Results for author: Max Cembalest

Found 2 papers, 0 papers with code

Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

no code implementations23 Mar 2023 Avi Schwarzschild, Max Cembalest, Karthik Rao, Keegan Hines, John Dickerson

We observe on three datasets that we can train a model with this loss term to improve explanation consensus on unseen data, and see improved consensus between explainers other than those used in the loss term.

Tensions Between the Proxies of Human Values in AI

no code implementations14 Dec 2022 Teresa Datta, Daniel Nissani, Max Cembalest, Akash Khanna, Haley Massa, John P. Dickerson

Motivated by mitigating potentially harmful impacts of technologies, the AI community has formulated and accepted mathematical definitions for certain pillars of accountability: e. g. privacy, fairness, and model transparency.

Fairness

Cannot find the paper you are looking for? You can Submit a new open access paper.