PropSegmEnt is a corpus of over 35K propositions annotated by expert human raters. The dataset structure resembles the tasks of (1) segmenting sentences within a document to the set of propositions, and (2) classifying the entailment relation of each proposition with respect to a different yet topically-aligned document, i.e. documents describing the same event or entity.
Source: PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and Entailment RecognitionPaper | Code | Results | Date | Stars |
---|