Table Recognition

21 papers with code • 5 benchmarks • 5 datasets

Table recognition refers to the process of automatically identifying and extracting tabular structures from unstructured data sources such as text documents, images, or scanned documents. The goal of table recognition is to accurately detect the presence of tables within the data and extract their contents, including rows, columns, headers, and cell values.

Benchmarks

Add a Result

These leaderboards are used to track progress in Table Recognition

Dataset	Best Model	Compare
PubTabNet	TableMaster	See all
ICDAR2013 table structure recognition	Proposed System (With post- processing)	See all
WTW	StrucTexTv2 (small)	See all
Table Recognition Challenge mini-test	Habitat-Web	See all
Table Recognition Challenge test	Habitat-Web	See all

Libraries

Use these libraries to find Table Recognition models and implementations

PaddlePaddle/PaddleOCR

3 papers

38,684

alibabaresearch/advancedliteratemac…

2 papers

965

Datasets

Most implemented papers

Most implemented Social Latest No code

Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations

Psarpei/Multi-Type-TD-TSR • • 23 May 2021

It utilizes state-of-the-art deep learning models for table detection and differentiates between 3 different types of tables based on the tables' borders.

Paper
Code

Flexible Table Recognition and Semantic Interpretation System

mnamysl/table-interpretation • 25 May 2021

Moreover, to incorporate the extraction of semantic information, we develop a graph-based table interpretation method.

Paper
Code

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

xuewenyuan/TGRNet • • ICCV 2021

A table arranging data in rows and columns is a very effective data structure, which has been widely used in business and scientific research.

Paper
Code

Detecting Layout Templates in Complex Multiregion Files

hpi-information-systems/mondrian • 14 Sep 2021

We present the Mondrian approach to automatically identify layout templates across multiple files and systematically extract the corresponding regions.

Paper
Code

PP-StructureV2: A Stronger Document Analysis System

PaddlePaddle/PaddleOCR • • 11 Oct 2022

For Table Recognition model, we utilize PP-LCNet, CSP-PAN and SLAHead to optimize the backbone module, feature fusion module and decoding module, respectively, which improved the table structure accuracy by 6\% with comparable inference speed.

Paper
Code

LORE: Logical Location Regression Network for Table Structure Recognition

alibabaresearch/advancedliteratemachinery • • 7 Mar 2023

Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats.

Paper
Code

Rethinking Image-based Table Recognition Using Weakly Supervised Methods

namtuanly/wikitableset • 14 Mar 2023

In this paper, we propose a weakly supervised model named WSTabNet for table recognition that relies only on HTML (or LaTeX) code-level annotations of table images.

Paper
Code

An End-to-End Multi-Task Learning Model for Image-based Table Recognition

namtuanly/MTL-TabNet • • 15 Mar 2023

Most of the previous methods focus on a non-end-to-end approach which divides the problem into two separate sub-problems: table structure recognition; and cell-content recognition and then attempts to solve each sub-problem independently using two separate systems.

Paper
Code

A large-scale dataset for end-to-end table recognition in the wild

maxkinny/tabrecset • • 27 Mar 2023

To this end, we propose a new large-scale dataset named Table Recognition Set (TabRecSet) with diverse table forms sourcing from multiple scenarios in the wild, providing complete annotation dedicated to end-to-end TR research.

Paper
Code

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

alibabaresearch/advancedliteratemachinery • • 28 Mar 2024

Recently, visually-situated text parsing (VsTP) has experienced notable advancements, driven by the increasing demand for automated document understanding and the emergence of Generative Large Language Models (LLMs) capable of processing document-based questions.

Paper
Code

Table Recognition

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result