TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-base (extractive)	ROUGE-1	48.20	# 10
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-base (extractive)	ROUGE-2	20.50	# 7
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-base (extractive)	ROUGE-L	42.28	# 9
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-large (extractive)	ROUGE-1	48.45	# 7
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-large (extractive)	ROUGE-2	20.72	# 5
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-large (extractive)	ROUGE-L	42.55	# 7
Text Summarization	Pubmed	Lodoss-full-large (extractive)	ROUGE-1	49.38	# 5
Text Summarization	Pubmed	Lodoss-full-large (extractive)	ROUGE-2	23.89	# 2
Text Summarization	Pubmed	Lodoss-full-large (extractive)	ROUGE-L	44.84	# 4
Text Summarization	Pubmed	Lodoss-full-base (extractive)	ROUGE-1	48.93	# 7
Text Summarization	Pubmed	Lodoss-full-base (extractive)	ROUGE-2	23.51	# 4
Text Summarization	Pubmed	Lodoss-full-base (extractive)	ROUGE-L	44.40	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/toward-unifying-text-segmentation-and-long/text-summarization-on-pubmed-1)](https://paperswithcode.com/sota/text-summarization-on-pubmed-1?p=toward-unifying-text-segmentation-and-long)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/toward-unifying-text-segmentation-and-long/text-summarization-on-arxiv)](https://paperswithcode.com/sota/text-summarization-on-arxiv?p=toward-unifying-text-segmentation-and-long)`

Toward Unifying Text Segmentation and Long Document Summarization

28 Oct 2022 · Sangwoo Cho, Kaiqiang Song, Xiaoyang Wang, Fei Liu, Dong Yu ·

Text segmentation is important for signaling a document's structure. Without segmenting a long document into topically coherent sections, it is difficult for readers to comprehend the text, let alone find important information. The problem is only exacerbated by a lack of segmentation in transcripts of audio/video recordings. In this paper, we explore the role that section segmentation plays in extractive summarization of written and spoken documents. Our approach learns robust sentence representations by performing summarization and segmentation simultaneously, which is further enhanced by an optimization-based regularizer to promote selection of diverse summary sentences. We conduct experiments on multiple datasets ranging from scientific articles to spoken transcripts to evaluate the model's performance. Our findings suggest that the model can not only achieve state-of-the-art performance on publicly available benchmarks, but demonstrate better cross-genre transferability when equipped with text segmentation. We perform a series of analyses to quantify the impact of section segmentation on summarizing written and spoken documents of substantial length and complexity.

PDF Abstract

Code

Add Remove Mark official

tencent-ailab/lodoss official

Tasks

Add Remove

Document Summarization

Extractive Summarization

Segmentation

Sentence

Text Segmentation

Text Summarization

Datasets

Pubmed Arxiv HEP-TH citation graph

Results from the Paper

Add Remove

Ranked #5 on Text Summarization on Pubmed

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-base (extractive)	ROUGE-1	48.20	# 10	Compare
			ROUGE-2	20.50	# 7	Compare
			ROUGE-L	42.28	# 9	Compare
Text Summarization	Arxiv HEP-TH citation graph	Lodoss-full-large (extractive)	ROUGE-1	48.45	# 7	Compare
			ROUGE-2	20.72	# 5	Compare
			ROUGE-L	42.55	# 7	Compare
Text Summarization	Pubmed	Lodoss-full-large (extractive)	ROUGE-1	49.38	# 5	Compare
			ROUGE-2	23.89	# 2	Compare
			ROUGE-L	44.84	# 4	Compare
Text Summarization	Pubmed	Lodoss-full-base (extractive)	ROUGE-1	48.93	# 7	Compare
			ROUGE-2	23.51	# 4	Compare
			ROUGE-L	44.40	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Toward Unifying Text Segmentation and Long Document Summarization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove