About

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Greatest papers with code

Abstractive Summarization of Spoken andWritten Instructions with BERT

KDD Converse 2020 nlpyang/PreSumm

Summarization of speech is a difficult problem due to the spontaneity of the flow, disfluencies, and other issues that are not usually encountered in written texts.

ABSTRACTIVE TEXT SUMMARIZATION SENTENCE SEGMENTATION TRANSFER LEARNING

Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation

EMNLP 2020 csebuetnlp/banglanmt

With the segmenter and the two methods combined, we compile a high-quality Bengali-English parallel corpus comprising of 2. 75 million sentence pairs, more than 2 million of which were not available before.

MACHINE TRANSLATION SENTENCE SEGMENTATION

Abstractive Summarization of Spoken and Written Instructions with BERT

21 Aug 2020alebryvas/berk266

Summarization of speech is a difficult problem due to the spontaneity of the flow, disfluencies, and other issues that are not usually encountered in written texts.

ABSTRACTIVE TEXT SUMMARIZATION SENTENCE SEGMENTATION TRANSFER LEARNING

Fine-Grained Argument Unit Recognition and Classification

22 Apr 2019trtm/AURC

In this work, we argue that the task should be performed on a more fine-grained level of sequence labeling.

ARGUMENT MINING SENTENCE SEGMENTATION

Using Punkt for Sentence Segmentation in non-Latin Scripts: Experiments on Kurdish (Sorani) Texts

9 Apr 2020KurdishBLARK/KTC-Segmented

The Kurdish language is a multi-dialect, under-resourced language which is written in different scripts.

SENTENCE SEGMENTATION

Evaluating Sentence Segmentation and Word Tokenization Systems on Estonian Web Texts

16 Nov 2020ksirts/EWTB_sentence_seg

Texts obtained from web are noisy and do not necessarily follow the orthographic sentence and word boundary rules.

SENTENCE SEGMENTATION TOKENIZATION