Text Summarization

369 papers with code • 33 benchmarks • 87 datasets

Text Summarization is a natural language processing (NLP) task that involves condensing a lengthy text document into a shorter, more compact version while still retaining the most important information and meaning. The goal is to produce a summary that accurately represents the content of the original text in a concise form.

There are different approaches to text summarization, including extractive methods that identify and extract important sentences or phrases from the text, and abstractive methods that generate new text based on the content of the original text.

Benchmarks

Add a Result

These leaderboards are used to track progress in Text Summarization

Dataset	Best Model	Compare
GigaWord	Pegasus+DotProd	See all
Arxiv HEP-TH citation graph	Top Down Transformer (AdaPool) (464M)	See all
Pubmed	Top Down Transformer (AdaPool) (464M)	See all
MTEB	MPNet-multilingual	See all
X-Sum	Pegasus 2B + SLiC	See all
DUC 2004 Task 1	Transformer+WDrop	See all
CNN / Daily Mail (Anonymized)	HSSAS	See all
SAMSum	InstructDS	See all
Reddit TIFU	PEGASUS 2B + SLiC	See all
Klexikon	Luhn's algorithm (25 sentences)	See all
GigaWord-10k	ERNIE-GENLARGE (large-scale text corpora)	See all
WikiHow	BertSum	See all
arXiv Summarization Dataset	FactorSum	See all
BookSum	Echoes-Extractive-Abstractive	See all
BigPatent	LongT5	See all
How2	Ground-truth transcript + Action with Hierarchical Attn	See all
OrangeSum	mBARThez (OrangeSum abstract)	See all
GovReport	FactorSum	See all
DialogSum	InstructDS	See all
BBC XSum	MatchSum	See all
CL-SciSumm	GCN Hybrid	See all
Webis-Snippet-20 Corpus	Anchor-context + Query biased	See all
AMI	HAT-CNNDM	See all
Gazeta	Finetuned mBART	See all
S2ORC	GenCompareSum	See all
CORD-19	GenCompareSum	See all
BillSum	Longformer Encoder Decoder	See all
MentSum	BART	See all
QMSum	BART-LS	See all
LCSTS	LSTM-seq2seq	See all
MeQSum	BiomedGPT	See all
MediaSum	SRformer-BART	See all
XSum	SRformer-BART	See all

Show all 33 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Text Summarization models and implementations

huggingface/transformers

8 papers

125,118

theamrzaki/text_summurization_abstr…

5 papers

518

HHousen/TransformerSum

3 papers

425

dennlinger/summaries

3 papers

See all 10 libraries.

Datasets

Subtasks

Opinion Summarization

Extractive Text Summarization

Sentence Compression

Sentence Summarization

Scientific Document Summarization

Timeline Summarization

Unsupervised Opinion Summarization

Query-Based Extractive Summarization

Email Thread Summarization

Latest papers

Most implemented Social Latest No code

The Radiation Oncology NLP Database

zl-liu/radiation-oncology-nlp-database • 19 Jan 2024

ROND is specifically designed to address this gap in the domain of radiation oncology, a field that offers many opportunities for NLP exploration.

19 Jan 2024

Paper
Code

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding

CyberAgentAILab/adaptive-mbr • • 5 Jan 2024

Minimum Bayes-Risk (MBR) decoding is shown to be a powerful alternative to beam search decoding for a wide range of text generation tasks.

05 Jan 2024

Paper
Code

Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy

alipay/PainlessInferenceAcceleration • • 20 Dec 2023

Hence, this paper presents a generic framework for accelerating the inference process, resulting in a substantial increase in speed and cost reduction for our RAG system, with lossless generation accuracy.

239

20 Dec 2023

Paper
Code

Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation

yale-lily/ascle • 28 Nov 2023

This study introduces Ascle, a pioneering natural language processing (NLP) toolkit designed for medical text generation.

28 Nov 2023

Paper
Code

Exploring Prompting Large Language Models as Explainable Metrics

ghazaleh-mahmoodi/Prompting_LLMs_AS_Explainable_Metrics • • 20 Nov 2023

This paper describes the IUST NLP Lab submission to the Prompting Large Language Models as Explainable Metrics Shared Task at the Eval4NLP 2023 Workshop on Evaluation & Comparison of NLP Systems.

20 Nov 2023

Paper
Code

DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

awslabs/optimizing-multitask-training-through-dynamic-pipelines • • 17 Nov 2023

This paper proposes a dynamic micro-batching approach to tackle sequence length variation and enable efficient multi-task model training.

17 Nov 2023

Paper
Code

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization

yale-nlp/instrusum • 15 Nov 2023

Our study reveals that instruction controllable text summarization remains a challenging task for LLMs, since (1) all LLMs evaluated still make factual and other types of errors in their summaries; (2) all LLM-based evaluation methods cannot achieve a strong alignment with human annotators when judging the quality of candidate summaries; (3) different LLMs show large performance gaps in summary generation and evaluation.

15 Nov 2023

Paper
Code

Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects -- A Survey

ashokurlana/controllable_text_summarization_survey • 15 Nov 2023

Generic text summarization approaches often fail to address the specific intent and needs of individual users.

15 Nov 2023

Paper
Code

GreekT5: A Series of Greek Sequence-to-Sequence Models for News Summarization

nc0der/greekt5 • 13 Nov 2023

The proposed models were thoroughly evaluated on the same dataset against GreekBART, which is the state-of-the-art model in Greek abstractive news summarization.

13 Nov 2023

Paper
Code

Boosting Summarization with Normalizing Flows and Aggressive Training

yuyangstat/flowsum • • 1 Nov 2023

This paper presents FlowSUM, a normalizing flows-based variational encoder-decoder framework for Transformer-based summarization.

01 Nov 2023

Paper
Code

Text Summarization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result