Abstractive Text Summarization

326 papers with code • 19 benchmarks • 48 datasets

Abstractive Text Summarization is the task of generating a short and concise summary that captures the salient ideas of the source text. The generated summaries potentially contain new phrases and sentences that may not appear in the source text.

Source: Generative Adversarial Network for Abstractive Text Summarization

Image credit: Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond

Benchmarks

Add a Result

These leaderboards are used to track progress in Abstractive Text Summarization

Dataset	Best Model	Compare
CNN / Daily Mail	Pegasus	See all
Abstractive Text Summarization from Il Post	mBART	See all
Abstractive Text Summarization from Fanpage	mBART	See all
EDUsum	Seq2seq	See all
MLSum-it	mBART	See all
WITS	BART-IT	See all
vietnews	ViT5 large	See all
AESLC	PEGASUS	See all
CNN/Daily Mail	BART (TextBox 2.0)	See all
WikiHow	BertSum	See all
MLSUM de	mBART	See all
MLSUM es	mBART	See all
Inshorts News	T2SAM	See all

Show all 19 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Abstractive Text Summarization models and implementations

huggingface/transformers

5 papers

125,290

microsoft/unilm

5 papers

18,356

theamrzaki/text_summurization_abstr…

5 papers

518

pytorch/fairseq

3 papers

29,287

See all 16 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

TLDR: Extreme Summarization of Scientific Documents

allenai/scitldr • Findings of the Association for Computational Linguistics 2020

We introduce TLDR generation, a new form of extreme summarization, for scientific papers.

Paper
Code

Deep Reinforcement Learning For Sequence to Sequence Models

yaserkl/RLSeq2Seq • • 24 May 2018

In this survey, we consider seq2seq problems from the RL point of view and provide a formulation combining the power of RL methods in decision-making with sequence-to-sequence models that enable remembering long-term memories.

Paper
Code

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

ChenRocks/fast_abs_rl • • ACL 2018

Inspired by how humans summarize long documents, we propose an accurate and fast summarization model that first selects salient sentences and then rewrites them abstractively (i. e., compresses and paraphrases) to generate a concise overall summary.

Paper
Code

Scoring Sentence Singletons and Pairs for Abstractive Summarization

ucfnlp/summarization-sing-pair-mix • • ACL 2019

There is thus a crucial gap between sentence selection and fusion to support summarizing by both compressing single sentences and fusing pairs.

Paper
Code

Unsupervised Opinion Summarization as Copycat-Review Generation

ixlan/CopyCat-abstractive-opinion-summarizer • • ACL 2020

At test time, when generating summaries, we force the novelty to be minimal, and produce a text reflecting consensus opinions.

Paper
Code

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

microsoft/unilm • • 28 Feb 2020

We propose to pre-train a unified language model for both autoencoding and partially autoregressive language modeling tasks using a novel training procedure, referred to as a pseudo-masked language model (PMLM).

Paper
Code

A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining

microsoft/HMNet • • Findings of the Association for Computational Linguistics 2020

With the abundance of automatic meeting transcripts, meeting summarization is of great interest to both participants and other parties.

Paper
Code

Better Fine-Tuning by Reducing Representational Collapse

pytorch/fairseq • • ICLR 2021

Although widely adopted, existing approaches for fine-tuning pre-trained language models have been shown to be unstable across hyper-parameter settings, motivating recent work on trust region methods.

Paper
Code