no code implementations • 13 Aug 2023 • Aaditya Naik, Adam Stein, Yinjun Wu, Mayur Naik, Eric Wong
Finding errors in machine learning applications requires a thorough exploration of their behavior over data.
no code implementations • 1 Jun 2023 • Shreya Havaldar, Adam Stein, Eric Wong, Lyle Ungar
Meaningfully comparing language models is challenging with current explanation methods.
no code implementations • 25 May 2023 • Adam Stein, Yinjun Wu, Eric Wong, Mayur Naik
It is well-known that real-world changes constituting distribution shift adversely affect model performance.
1 code implementation • 9 Feb 2023 • Yinjun Wu, Adam Stein, Jacob Gardner, Mayur Naik
In this paper, we study how to learn to identify such a meta sample set from a large, imperfect training set, that is subsequently cleaned and used to optimize performance in the meta re-weighting setting.
1 code implementation • 31 Jan 2023 • Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-Burch
While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka.