Sentence segmentation

19 papers with code • 1 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation

yale-lily/ehrkit-2022 28 Nov 2023

This study introduces Ascle, a pioneering natural language processing (NLP) toolkit designed for medical text generation.

59
28 Nov 2023

KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models

jiho283/kg-gpt 17 Oct 2023

While large language models (LLMs) have made considerable advancements in understanding and generating unstructured text, their application in structured data remains underexplored.

42
17 Oct 2023

Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

bminixhofer/nnsplit 30 May 2023

Many NLP pipelines split text into sentences as one of the crucial preprocessing steps.

494
30 May 2023

Prosodic features improve sentence segmentation and parsing

ekayen/prosody_nlp 23 Feb 2023

Parsing spoken dialogue presents challenges that parsing text does not, including a lack of clear sentence boundaries.

0
23 Feb 2023

SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content

slateauthors/slate 8 Nov 2022

We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard.

2
08 Nov 2022

Mukayese: Turkish NLP Strikes Back

alisafaya/mukayese Findings (ACL) 2022

As a solution, we present Mukayese, a set of NLP benchmarks for the Turkish language that contains several NLP tasks.

62
02 Mar 2022

Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data

universaldependencies/ud_frisian_dutch-fame 22 Feb 2021

This paper explores the difficulties of annotating transcribed spoken Dutch-Frisian code-switch utterances into Universal Dependencies.

0
22 Feb 2021

Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing

nlp-uoregon/trankit EACL 2021

Finally, we create a demo video for Trankit at: https://youtu. be/q0KGP3zGjGc.

705
09 Jan 2021

Evaluating Sentence Segmentation and Word Tokenization Systems on Estonian Web Texts

ksirts/EWTB_sentence_seg 16 Nov 2020

Texts obtained from web are noisy and do not necessarily follow the orthographic sentence and word boundary rules.

0
16 Nov 2020