no code implementations • NeurIPS 2023 • Roland S. Zimmermann, Thomas Klein, Wieland Brendel
We use a psychophysical paradigm to quantify one form of mechanistic interpretability for a diverse suite of nine models and find no scaling effect for interpretability - neither for model nor dataset size.