Search Results for author: Philip Kenneweg

Found 9 papers, 7 papers with code

Intelligent Learning Rate Distribution to reduce Catastrophic Forgetting in Transformers

1 code implementation27 Mar 2024 Philip Kenneweg, Alexander Schulz, Sarah Schröder, Barbara Hammer

We combine the learning rate distributions thus found and show that they generalize to better performance with respect to the problem of catastrophic forgetting.

Hyperparameter Optimization

Debiasing Sentence Embedders through Contrastive Word Pairs

1 code implementation27 Mar 2024 Philip Kenneweg, Sarah Schröder, Alexander Schulz, Barbara Hammer

It is problematic that most debiasing approaches are directly transferred from word embeddings, therefore these approaches fail to take into account the nonlinear nature of sentence embedders and the embeddings they produce.

Sentence Sentence Embeddings +1

Neural Architecture Search for Sentence Classification with BERT

1 code implementation27 Mar 2024 Philip Kenneweg, Sarah Schröder, Barbara Hammer

Pre training of language models on large text corpora is common practice in Natural Language Processing.

Classification Neural Architecture Search +2

Improving Line Search Methods for Large Scale Neural Network Training

1 code implementation27 Mar 2024 Philip Kenneweg, Tristan Kenneweg, Barbara Hammer

In recent studies, line search methods have shown significant improvements in the performance of traditional stochastic gradient descent techniques, eliminating the need for a specific learning rate schedule.

Faster Convergence for Transformer Fine-tuning with Line Search Methods

1 code implementation27 Mar 2024 Philip Kenneweg, Leonardo Galli, Tristan Kenneweg, Barbara Hammer

Recent works have shown that line search methods greatly increase performance of traditional stochastic gradient descent methods on a variety of datasets and architectures [1], [2].

Retrieval Augmented Generation Systems: Automatic Dataset Creation, Evaluation and Boolean Agent Setup

1 code implementation26 Feb 2024 Tristan Kenneweg, Philip Kenneweg, Barbara Hammer

We use a dataset created this way for the development and evaluation of a boolean agent RAG setup: A system in which a LLM can decide whether to query a vector database or not, thus saving tokens on questions that can be answered with internal knowledge.

Language Modelling Large Language Model +1

The SAME score: Improved cosine based bias score for word embeddings

no code implementations28 Mar 2022 Sarah Schröder, Alexander Schulz, Philip Kenneweg, Robert Feldhans, Fabian Hinder, Barbara Hammer

Furthermore, we thoroughly investigate the existing cosine-based scores and their limitations in order to show why these scores fail to report biases in some situations.

Sentence Sentence Embeddings +1

Evaluating Metrics for Bias in Word Embeddings

no code implementations15 Nov 2021 Sarah Schröder, Alexander Schulz, Philip Kenneweg, Robert Feldhans, Fabian Hinder, Barbara Hammer

However, lately some works have raised doubts about these metrics showing that even though such metrics report low biases, other tests still show biases.

Sentence Sentence Embeddings +1

Cannot find the paper you are looking for? You can Submit a new open access paper.