Paper tables with annotated results for Don't Lie to Me: Avoiding Malicious Explanations with STEALTH

Paper

Don't Lie to Me: Avoiding Malicious Explanations with STEALTH

STEALTH is a method for using some AI-generated model, without suffering from malicious attacks (i.e. lying) or associated unfairness issues. After recursively bi-clustering the data, STEALTH system asks the AI model a limited number of queries about class labels. STEALTH asks so few queries (1 per data cluster) that malicious algorithms (a) cannot detect its operation, nor (b) know when to lie.

PDF Paper record

Results in Papers With Code

(↓ scroll down to see all results)

Don't Lie to Me: Avoiding Malicious Explanations with STEALTH

Reader Guidelines

Editor Guidelines