Key Information Extraction

28 papers with code • 6 benchmarks • 10 datasets

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding.

Libraries

Use these libraries to find Key Information Extraction models and implementations
5 papers
38,759
2 papers
18,406
2 papers
4,097
See all 6 libraries.

Most implemented papers

Key Information Extraction From Documents: Evaluation And Generator

obensch/DocumentInformationExtraction 9 Jun 2021

Therefore, natural language processing models have already been combined with computer vision models in the past, to benefit from e. g. positional information and to improve performance of these key information extraction models.

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

clovaai/bros 10 Aug 2021

On the other hand, this paper tackles the problem by going back to the basic: effective combination of text and layout.

MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding

open-mmlab/mmocr 14 Aug 2021

We present MMOCR-an open-source toolbox which provides a comprehensive pipeline for text detection and recognition, as well as their downstream tasks such as named entity recognition and key information extraction.

Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks

andreagemelli/doc2graph 23 Aug 2022

Geometric Deep Learning has recently attracted significant interest in a wide range of machine learning fields, including document analysis.

PP-StructureV2: A Stronger Document Analysis System

PaddlePaddle/PaddleOCR 11 Oct 2022

For Table Recognition model, we utilize PP-LCNet, CSP-PAN and SLAHead to optimize the backbone module, feature fusion module and decoding module, respectively, which improved the table structure accuracy by 6\% with comparable inference speed.

DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-Loop

neeleshkshukla/dosa 9 Nov 2022

An initial document-specific model can be trained and its inference can be used as feedback for generating more automated annotations.

DocILE Benchmark for Document Information Localization and Extraction

rossumai/docile 11 Feb 2023

This paper introduces the DocILE benchmark with the largest dataset of business documents for the tasks of Key Information Localization and Extraction and Line Item Recognition.

Form-NLU: Dataset for the Form Natural Language Understanding

adlnlp/form_nlu 4 Apr 2023

Compared to general document analysis tasks, form document structure understanding and retrieval are challenging.

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

alibabaresearch/advancedliteratemachinery CVPR 2023

Additionally, novel relation heads, which are pre-trained by the geometric pre-training tasks and fine-tuned for RE, are elaborately designed to enrich and enhance the feature representation.

Information Redundancy and Biases in Public Document Information Extraction Benchmarks

seif-lat/bias-study-funsd-sroie 28 Apr 2023

Advances in the Visually-rich Document Understanding (VrDU) field and particularly the Key-Information Extraction (KIE) task are marked with the emergence of efficient Transformer-based approaches such as the LayoutLM models.