Key Information Extraction

28 papers with code • 6 benchmarks • 10 datasets

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Benchmarks

Add a Result

These leaderboards are used to track progress in Key Information Extraction

Dataset	Best Model	Compare
CORD	GeoLayoutLM	See all
Kleister NDA	LayoutLMv2LARGE	See all
SROIE	LayoutLMv2LARGE (Excluding OCR mismatch)	See all
EPHOIE	LayoutLMv3	See all
ETD500	CRF-visual	See all
SIMARA	DAN	See all

Libraries

Use these libraries to find Key Information Extraction models and implementations

PaddlePaddle/PaddleOCR

5 papers

38,759

huggingface/transformers

4 papers

125,629

microsoft/unilm

2 papers

18,406

open-mmlab/mmocr

2 papers

4,097

See all 6 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

Key Information Extraction From Documents: Evaluation And Generator

obensch/DocumentInformationExtraction • 9 Jun 2021

Therefore, natural language processing models have already been combined with computer vision models in the past, to benefit from e. g. positional information and to improve performance of these key information extraction models.

Paper
Code

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

clovaai/bros • • 10 Aug 2021

On the other hand, this paper tackles the problem by going back to the basic: effective combination of text and layout.

Paper
Code

MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding

open-mmlab/mmocr • • 14 Aug 2021

We present MMOCR-an open-source toolbox which provides a comprehensive pipeline for text detection and recognition, as well as their downstream tasks such as named entity recognition and key information extraction.

Paper
Code

Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks

andreagemelli/doc2graph • • 23 Aug 2022

Geometric Deep Learning has recently attracted significant interest in a wide range of machine learning fields, including document analysis.

Paper
Code

PP-StructureV2: A Stronger Document Analysis System

PaddlePaddle/PaddleOCR • • 11 Oct 2022

For Table Recognition model, we utilize PP-LCNet, CSP-PAN and SLAHead to optimize the backbone module, feature fusion module and decoding module, respectively, which improved the table structure accuracy by 6\% with comparable inference speed.

Paper
Code

DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-Loop

neeleshkshukla/dosa • 9 Nov 2022

An initial document-specific model can be trained and its inference can be used as feedback for generating more automated annotations.

Paper
Code

DocILE Benchmark for Document Information Localization and Extraction

rossumai/docile • • 11 Feb 2023

This paper introduces the DocILE benchmark with the largest dataset of business documents for the tasks of Key Information Localization and Extraction and Line Item Recognition.

Paper
Code

Form-NLU: Dataset for the Form Natural Language Understanding

adlnlp/form_nlu • 4 Apr 2023

Compared to general document analysis tasks, form document structure understanding and retrieval are challenging.

Paper
Code

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

alibabaresearch/advancedliteratemachinery • • CVPR 2023

Additionally, novel relation heads, which are pre-trained by the geometric pre-training tasks and fine-tuned for RE, are elaborately designed to enrich and enhance the feature representation.

Paper
Code

Information Redundancy and Biases in Public Document Information Extraction Benchmarks

seif-lat/bias-study-funsd-sroie • 28 Apr 2023

Advances in the Visually-rich Document Understanding (VrDU) field and particularly the Key-Information Extraction (KIE) task are marked with the emergence of efficient Transformer-based approaches such as the LayoutLM models.

Paper
Code

Key Information Extraction

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result