1 code implementation • 27 Mar 2024 • Philip Kenneweg, Alexander Schulz, Sarah Schröder, Barbara Hammer
We combine the learning rate distributions thus found and show that they generalize to better performance with respect to the problem of catastrophic forgetting.
1 code implementation • 27 Mar 2024 • Philip Kenneweg, Sarah Schröder, Alexander Schulz, Barbara Hammer
It is problematic that most debiasing approaches are directly transferred from word embeddings, therefore these approaches fail to take into account the nonlinear nature of sentence embedders and the embeddings they produce.
1 code implementation • 27 Mar 2024 • Philip Kenneweg, Sarah Schröder, Barbara Hammer
Pre training of language models on large text corpora is common practice in Natural Language Processing.
1 code implementation • 27 Mar 2024 • Philip Kenneweg, Tristan Kenneweg, Barbara Hammer
In recent studies, line search methods have shown significant improvements in the performance of traditional stochastic gradient descent techniques, eliminating the need for a specific learning rate schedule.
1 code implementation • 27 Mar 2024 • Philip Kenneweg, Leonardo Galli, Tristan Kenneweg, Barbara Hammer
Recent works have shown that line search methods greatly increase performance of traditional stochastic gradient descent methods on a variety of datasets and architectures [1], [2].
1 code implementation • 26 Feb 2024 • Tristan Kenneweg, Philip Kenneweg, Barbara Hammer
We use a dataset created this way for the development and evaluation of a boolean agent RAG setup: A system in which a LLM can decide whether to query a vector database or not, thus saving tokens on questions that can be answered with internal knowledge.
1 code implementation • 21 Nov 2022 • Dominik Stallmann, Philip Kenneweg, Barbara Hammer
We make the data sets available at https://pub. uni-bielefeld. de/record/2960030.
no code implementations • 28 Mar 2022 • Sarah Schröder, Alexander Schulz, Philip Kenneweg, Robert Feldhans, Fabian Hinder, Barbara Hammer
Furthermore, we thoroughly investigate the existing cosine-based scores and their limitations in order to show why these scores fail to report biases in some situations.
no code implementations • 15 Nov 2021 • Sarah Schröder, Alexander Schulz, Philip Kenneweg, Robert Feldhans, Fabian Hinder, Barbara Hammer
However, lately some works have raised doubts about these metrics showing that even though such metrics report low biases, other tests still show biases.