HLGD (Headline Grouping Dataset)

Introduced by Laban et al. in News Headline Grouping as a Challenging NLU Task

The Headline Grouping dataset is a binary classification dataset on pairs of news headline. For each pair of headline, the binary label indicates whether the two headlines are part of the same group (and describe the same underlying event), or whether they are in distinct groups. The dataset contains a total of 20k annotated headline pairs, further split in a train, validation and test portions.

Papers


Paper Code Results Date Stars

Tasks


Modalities


Languages