no code implementations • 30 Jan 2024 • Shrayani Mondal, Rishabh Garodia, Arbaaz Qureshi, Taesung Lee, Youngja Park
We leverage the potential of generative language models to discover human-interpretable descriptors present in a dataset and use an unsupervised approach to explain neurons with these descriptors.
1 code implementation • 2 May 2023 • Sunjae Kwon, Rishabh Garodia, Minhwa Lee, Zhichao Yang, Hong Yu
Specifically, we suggest employing Bayesian inference to incorporate the sense definitions when sense information of the answer is not provided.