no code implementations • 4 Mar 2024 • Leon Weber-Genzel, Siyao Peng, Marie-Catherine de Marneffe, Barbara Plank
To fill this gap, we introduce a systematic methodology and a new dataset, VariErr (variation versus error), focusing on the NLI task in English.
1 code implementation • 22 Feb 2024 • Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank
The open-ended nature of language generation makes the evaluation of autoregressive large language models (LLMs) challenging.
no code implementations • 19 Feb 2024 • Mario Sänger, Samuele Garda, Xing David Wang, Leon Weber-Genzel, Pia Droop, Benedikt Fuchs, Alan Akbik, Ulf Leser
Instead, they are applied in the wild, i. e., on application-dependent text collections different from those used for the tools' training, varying, e. g., in focus, genre, style, and text type.
1 code implementation • 4 Sep 2023 • Leon Weber-Genzel, Robert Litschko, Ekaterina Artemova, Barbara Plank
Our results show that the choice of the right AED method and model size is indeed crucial and derive practical recommendations for how to use AED methods to clean instruction-tuning data.
1 code implementation • 22 Aug 2023 • Samuele Garda, Leon Weber-Genzel, Robert Martin, Ulf Leser
Biomedical entity linking (BEL) is the task of grounding entity mentions to a knowledge base.