Search Results for author: Alan Akbik

Found 40 papers, 11 papers with code

NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition

no code implementations • 13 May 2024 • Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik

Available training data for named entity recognition (NER) often contains a significant percentage of incorrect labels for entity types and entity boundaries.

Benchmarking named-entity-recognition +2

Paper
Add Code

PECC: Problem Extraction and Coding Challenges

1 code implementation • 29 Apr 2024 • Patrick Haller, Jonas Golde, Alan Akbik

Recent advancements in large language models (LLMs) have showcased their exceptional abilities across various tasks, such as code generation, problem-solving and reasoning.

Ranked #1 on Code Generation on PECC

Code Generation Math +1

Paper
Code

BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models

2 code implementations • 5 Apr 2024 • Jacek Wiland, Max Ploner, Alan Akbik

We release the BEAR datasets and an open-source framework that implements the probing approach to the research community to facilitate the evaluation and development of LMs.

Ranked #1 on Factual probe on BEAR-probe

Factual probe General Knowledge +3

254

Paper
Code

Fundus: A Simple-to-Use News Scraper Optimized for High Quality Extractions

2 code implementations • 22 Mar 2024 • Max Dallabetta, Conrad Dobberstein, Adrian Breiding, Alan Akbik

This paper introduces Fundus, a user-friendly news scraper that enables users to obtain millions of high-quality news articles with just a few lines of code.

119

Paper
Code

Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition

no code implementations • 21 Mar 2024 • Jonas Golde, Felix Hamborg, Alan Akbik

In an initial label interpretation learning phase, the model learns to interpret such verbalized descriptions of entity types.

Entity Linking few-shot-ner +4

Paper
Add Code

HunFlair2 in a cross-corpus evaluation of biomedical named entity recognition and normalization tools

no code implementations • 19 Feb 2024 • Mario Sänger, Samuele Garda, Xing David Wang, Leon Weber-Genzel, Pia Droop, Benedikt Fuchs, Alan Akbik, Ulf Leser

Instead, they are applied in the wild, i. e., on application-dependent text collections different from those used for the tools' training, varying, e. g., in focus, genre, style, and text type.

Cross-corpus named-entity-recognition +1

Paper
Add Code

SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

no code implementations • 30 Jan 2024 • Ansar Aynetdinov, Alan Akbik

Instruction-tuned Large Language Models (LLMs) have recently showcased remarkable advancements in their ability to generate fitting responses to natural language instructions.

Semantic Textual Similarity STS +1

Paper
Add Code

CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset

1 code implementation • 24 Oct 2023 • Susanna Rücker, Alan Akbik

The CoNLL-03 corpus is arguably the most well-known and utilized benchmark dataset for named entity recognition (NER).

Entity Linking named-entity-recognition +2

Paper
Code

Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

1 code implementation • 18 Sep 2023 • Jonas Golde, Patrick Haller, Felix Hamborg, Julian Risch, Alan Akbik

Here, a powerful LLM is prompted with a task description to generate labeled data that can be used to train a downstream NLP model.

Question Answering text-classification +2

Paper
Code

OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs

no code implementations • 7 Sep 2023 • Patrick Haller, Ansar Aynetdinov, Alan Akbik

The demo will answer this question using a model fine-tuned on text representing each of the selected biases, allowing side-by-side comparison.

Paper
Add Code

Task-Specific Embeddings for Ante-Hoc Explainable Text Classification

no code implementations • 30 Nov 2022 • Kishaloy Halder, Josip Krapac, Alan Akbik, Anthony Brew, Matti Lyra

In a series of experiments, we show that this yields a number of interesting benefits: (1) The resulting order induced by distances in the embedding space can be used to directly explain classification decisions.

Incremental Learning text-classification +1

Paper
Add Code

Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning

no code implementations • NAACL (ACL) 2022 • Angelo Ziletti, Alan Akbik, Christoph Berns, Thomas Herold, Marion Legler, Martina Viell

Medical coding (MC) is an essential pre-requisite for reliable data retrieval and reporting.

Few-Shot Learning Retrieval

Paper
Add Code

Early Detection of Sexual Predators in Chats

1 code implementation • ACL 2021 • Matthias Vogt, Ulf Leser, Alan Akbik

We define and study the task of early sexual predator detection (eSPD) in chats, where the goal is to analyze a running chat from its beginning and predict grooming attempts as early and as accurately as possible.

Paper
Code

Task-Aware Representation of Sentences for Generic Text Classification

1 code implementation • COLING 2020 • Kishaloy Halder, Alan Akbik, Josip Krapac, Roland Vollgraf

State-of-the-art approaches for text classification leverage a transformer architecture with a linear layer on top that outputs a class distribution for a given prediction problem.

Binary Classification text-classification +2

13,627

Paper
Code

FLERT: Document-Level Features for Named Entity Recognition

1 code implementation • 13 Nov 2020 • Stefan Schweter, Alan Akbik

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries.

Ranked #1 on Named Entity Recognition (NER) on CoNLL 2003 (German) Revised

named-entity-recognition Named Entity Recognition +2

13,627

Paper
Code

HunFlair: An Easy-to-Use Tool for State-of-the-Art Biomedical Named Entity Recognition

2 code implementations • 17 Aug 2020 • Leon Weber, Mario Sänger, Jannes Münchmeyer, Maryam Habibi, Ulf Leser, Alan Akbik

Summary: Named Entity Recognition (NER) is an important step in biomedical information extraction pipelines.

named-entity-recognition Named Entity Recognition +1

13,627

Paper
Code

FLAIR: An Easy-to-Use Framework for State-of-the-Art NLP

1 code implementation • NAACL 2019 • Alan Akbik, Tanja Bergmann, Duncan Blythe, Kashif Rasul, Stefan Schweter, Rol Vollgraf,

We present FLAIR, an NLP framework designed to facilitate training and distribution of state-of-the-art sequence labeling, text classification and language models.

Chunking Named Entity Recognition (NER) +2

13,626

Paper
Code

Pooled Contextualized Embeddings for Named Entity Recognition

no code implementations • NAACL 2019 • Alan Akbik, Tanja Bergmann, Rol Vollgraf,

We make all code and pre-trained models available to the research community for use and reproduction.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Contextual String Embeddings for Sequence Labeling

1 code implementation • COLING 2018 • Alan Akbik, Duncan Blythe, Rol Vollgraf,

Recent advances in language modeling using recurrent neural networks have made it viable to model language as distributions over characters.

Ranked #2 on Chunking on Penn Treebank

Chunking Language Modelling +4

13,626

Paper
Code

ZAP: An Open-Source Multilingual Annotation Projection Framework

no code implementations • LREC 2018 • Alan Akbik, Rol Vollgraf,

Word Alignment

Paper
Add Code

FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German

no code implementations • LREC 2018 • Leonidas Lefakis, Alan Akbik, Rol Vollgraf,

Image Retrieval

Paper
Add Code

Syntax-Aware Language Modeling with Recurrent Neural Networks

no code implementations • 2 Mar 2018 • Duncan Blythe, Alan Akbik, Roland Vollgraf

Neural language models (LMs) are typically trained using only lexical features, such as surface forms of words.

Language Modelling

Paper
Add Code

CROWD-IN-THE-LOOP: A Hybrid Approach for Annotating Semantic Roles

no code implementations • EMNLP 2017 • Chenguang Wang, Alan Akbik, Laura Chiticariu, Yunyao Li, Fei Xia, Anbang Xu

Crowdsourcing has proven to be an effective method for generating labeled data for a range of NLP tasks.

Machine Translation Question Answering +1

Paper
Add Code

The Projector: An Interactive Annotation Projection Visualization Tool

no code implementations • EMNLP 2017 • Alan Akbik, Rol Vollgraf,

Previous works proposed annotation projection in parallel corpora to inexpensively generate treebanks or propbanks for new languages.

Paper
Add Code

Multilingual Information Extraction with PolyglotIE

no code implementations • COLING 2016 • Alan Akbik, Laura Chiticariu, Marina Danilevsky, Yonas Kbrom, Yunyao Li, Huaiyu Zhu

We present PolyglotIE, a web-based tool for developing extractors that perform Information Extraction (IE) over multilingual data.

Semantic Parsing

Paper
Add Code

Multilingual Aliasing for Auto-Generating Proposition Banks

no code implementations • COLING 2016 • Alan Akbik, Xinyu Guan, Yunyao Li

To address these issues, we propose to manually alias TL verbs to existing English frames.

Machine Translation Question Answering +1

Paper
Add Code

K-SRL: Instance-based Learning for Semantic Role Labeling

no code implementations • COLING 2016 • Alan Akbik, Yunyao Li

To overcome this challenge, we propose the use of instance-based learning that performs no explicit generalization, but rather extrapolates predictions from the most similar instances in the training data.

Machine Translation Question Answering +1

Paper
Add Code

Towards Semi-Automatic Generation of Proposition Banks for Low-Resource Languages

no code implementations • EMNLP 2016 • Alan Akbik, Vishwajeet Kumar, Yunyao Li

Paper
Add Code

POLYGLOT: Multilingual Semantic Role Labeling with Unified Labels

no code implementations • ACL 2016 • Alan Akbik, Yunyao Li

Question Answering Semantic Role Labeling

Paper
Add Code

SCHN\"APPER: A Web Toolkit for Exploratory Relation Extraction

no code implementations • IJCNLP 2015 • Thilo Michael, Alan Akbik

Relation Relation Extraction

Paper
Add Code

Generating High Quality Proposition Banks for Multilingual Semantic Role Labeling

no code implementations • IJCNLP 2015 • Alan Akbik, Laura Chiticariu, Marina Danilevsky, Yunyao Li, Shivakumar Vaithyanathan, Huaiyu Zhu

Question Answering Semantic Role Labeling +1

Paper
Add Code

Extracting a Repository of Events and Event References from News Clusters

no code implementations • WS 2014 • Silvia Julinda, Christoph Boden, Alan Akbik

Paper
Add Code

Nerdle: Topic-Specific Question Answering Using Wikia Seeds

no code implementations • COLING 2014 • Umar Maqsud, Sebastian Arnold, Michael H{\"u}lfenhaus, Alan Akbik

Question Answering Semantic Role Labeling

Paper
Add Code

Exploratory Relation Extraction in Large Text Corpora

no code implementations • COLING 2014 • Alan Akbik, Thilo Michael, Christoph Boden

Relation Relation Extraction

Paper
Add Code

Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction

no code implementations • LREC 2014 • Johannes Kirschnick, Alan Akbik, Holmer Hemsen

The increasing availability and maturity of both scalable computing architectures and deep syntactic parsers is opening up new possibilities for Relation Extraction (RE) on large corpora of natural language text.

Entity Linking Relation +1

Paper
Add Code

The Weltmodell: A Data-Driven Commonsense Knowledge Base

no code implementations • LREC 2014 • Alan Akbik, Thilo Michael

We present the Weltmodell, a commonsense knowledge base that was automatically generated from aggregated dependency parse fragments gathered from over 3. 5 million English language books.

Open Information Extraction