Document Summarization

29 papers with code · Natural Language Processing
Subtask of Text Summarization

State-of-the-art leaderboards

Trend Dataset Best Method Paper title Paper Code Compare

Greatest papers with code

Generating Wikipedia by Summarizing Long Sequences

ICLR 2018 tensorflow/tensor2tensor

We show that generating English Wikipedia articles can be approached as a multi- document summarization of source documents. We use extractive summarization to coarsely identify salient information and a neural abstractive model to generate the article.

DOCUMENT SUMMARIZATION MULTI-DOCUMENT SUMMARIZATION

Language Models are Unsupervised Multitask Learners

Preprint 2019 openai/gpt-2

Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are typically approached with supervised learning on taskspecific datasets. We demonstrate that language models begin to learn these tasks without any explicit supervision when trained on a new dataset of millions of webpages called WebText.

 SOTA for Language Modelling on enwiki8 (using extra training data)

COMMON SENSE REASONING DOCUMENT SUMMARIZATION LANGUAGE MODELLING MACHINE TRANSLATION QUESTION ANSWERING READING COMPREHENSION

Ranking Sentences for Extractive Summarization with Reinforcement Learning

HLT 2018 shashiongithub/Refresh

Single document summarization is the task of producing a shorter version of a document while preserving its principal information content. In this paper we conceptualize extractive summarization as a sentence ranking task and propose a novel training algorithm which globally optimizes the ROUGE evaluation metric through a reinforcement learning objective.

DOCUMENT SUMMARIZATION

Bottom-Up Abstractive Summarization

EMNLP 2018 sebastianGehrmann/bottom-up-summary

Neural network-based methods for abstractive summarization produce outputs that are more fluent than other techniques, but which can be poor at content selection. We use this selector as a bottom-up attention step to constrain the model to likely phrases.

ABSTRACTIVE TEXT SUMMARIZATION DOCUMENT SUMMARIZATION

Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization

EMNLP 2018 shashiongithub/XSum

We introduce extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach. The idea is to create a short, one-sentence news summary answering the question "What is the article about?".

DOCUMENT SUMMARIZATION

Neural Document Summarization by Jointly Learning to Score and Select Sentences

ACL 2018 magic282/NeuSum

However, previous works treat them as two separated subtasks. In this paper, we present a novel end-to-end neural network framework for extractive document summarization by jointly learning to score and select sentences.

DOCUMENT SUMMARIZATION EXTRACTIVE DOCUMENT SUMMARIZATION

SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive Summarization of Documents

14 Nov 2016kedz/nnsum

We present SummaRuNNer, a Recurrent Neural Network (RNN) based sequence model for extractive summarization of documents and show that it achieves performance better than or comparable to state-of-the-art. Our model has the additional advantage of being very interpretable, since it allows visualization of its predictions broken up by abstract features such as information content, salience and novelty.

DOCUMENT SUMMARIZATION

Neural Summarization by Extracting Sentences and Words

ACL 2016 kedz/nnsum

Traditional approaches to extractive summarization rely heavily on human-engineered features. In this work we propose a data-driven approach based on neural networks and continuous sentence features.

DOCUMENT SUMMARIZATION