Search Results for author: Manuel Stoeckel

Found 7 papers, 2 papers with code

Syntactic Language Change in English and German: Metrics, Parsers, and Convergences

1 code implementation18 Feb 2024 Yanran Chen, Wei Zhao, Anne Breitbarth, Manuel Stoeckel, Alexander Mehler, Steffen Eger

Even though we have evidence that recent parsers trained on modern treebanks are not heavily affected by data 'noise' such as spelling changes and OCR errors in our historic data, we find that results of syntactic language change are sensitive to the parsers involved, which is a caution against using a single parser for evaluating syntactic language change as done in previous work.

Optical Character Recognition (OCR) Sentence

I still have Time(s): Extending HeidelTime for German Texts

1 code implementation LREC 2022 Andy Lücking, Manuel Stoeckel, Giuseppe Abrami, Alexander Mehler

HeidelTime is one of the most widespread and successful tools for detecting temporal expressions in texts.

Voting for POS tagging of Latin texts: Using the flair of FLAIR to better Ensemble Classifiers by Example of Latin

no code implementations LREC 2020 Manuel Stoeckel, Alex Henlein, Wahed Hemati, Alex Mehler, er

Since most of the available Latin word embeddings were trained on either few or inaccurate data, we trained several embeddings on better data in the first step.

Lemmatization Part-Of-Speech Tagging +3

TextAnnotator: A UIMA Based Tool for the Simultaneous and Collaborative Annotation of Texts

no code implementations LREC 2020 Giuseppe Abrami, Manuel Stoeckel, Alex Mehler, er

The annotation of texts and other material in the field of digital humanities and Natural Language Processing (NLP) is a common task of research projects.

BIOfid Dataset: Publishing a German Gold Standard for Named Entity Recognition in Historical Biodiversity Literature

no code implementations CONLL 2019 Sajawel Ahmed, Manuel Stoeckel, Christine Driller, Adrian Pachzelt, Alex Mehler, er

The Specialized Information Service Biodiversity Research (BIOfid) has been launched to mobilize valuable biological data from printed literature hidden in German libraries for over the past 250 years.

named-entity-recognition Named Entity Recognition +2

When Specialization Helps: Using Pooled Contextualized Embeddings to Detect Chemical and Biomedical Entities in Spanish

no code implementations WS 2019 Manuel Stoeckel, Wahed Hemati, Alexander Mehler

The recognition of pharmacological substances, compounds and proteins is an essential preliminary work for the recognition of relations between chemicals and other biomedically relevant units.

Word Embeddings

SenseFitting: Sense Level Semantic Specialization of Word Embeddings for Word Sense Disambiguation

no code implementations30 Jul 2019 Manuel Stoeckel, Sajawel Ahmed, Alexander Mehler

We outperform knowledge-based WSD methods by up to 25% F1-score and produce a new state-of-the-art on the German sense-annotated dataset WebCAGe.

LEMMA Word Embeddings +1

Cannot find the paper you are looking for? You can Submit a new open access paper.