SpEL: Structured Prediction for Entity Linking

23 Oct 2023  ·  Hassan S. Shavarani, Anoop Sarkar ·

Entity linking is a prominent thread of research focused on structured data creation by linking spans of text to an ontology or knowledge source. We revisit the use of structured prediction for entity linking which classifies each individual input token as an entity, and aggregates the token predictions. Our system, called SpEL (Structured prediction for Entity Linking) is a state-of-the-art entity linking system that uses some new ideas to apply structured prediction to the task of entity linking including: two refined fine-tuning steps; a context sensitive prediction aggregation strategy; reduction of the size of the model's output vocabulary, and; we address a common problem in entity-linking systems where there is a training vs. inference tokenization mismatch. Our experiments show that we can outperform the state-of-the-art on the commonly used AIDA benchmark dataset for entity linking to Wikipedia. Our method is also very compute efficient in terms of number of parameters and speed of inference.

PDF Abstract

Datasets


Introduced in the Paper:

AIDA/testc

Used in the Paper:

CoNLL AIDA CoNLL-YAGO

Results from the Paper


 Ranked #1 on Entity Linking on AIDA/testc (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Entity Linking AIDA-CoNLL SpEL-base (2023) Micro-F1 strong 88.1 # 2
Entity Linking AIDA-CoNLL SpEL-large (2023) Micro-F1 strong 88.6 # 1
Entity Linking AIDA/testc SpEL-large (2023) Micro-F1 strong 77.5 # 1
Entity Linking AIDA/testc SpEL-base (2023) Micro-F1 strong 73.7 # 2

Methods