Chinese Gigaword

Introduced by Graff et al. in LDC Catalog No.: LDC2003T09, ISBN

Chinese Gigaword corpus consists of 2.2M of headline-document pairs of news stories covering over 284 months from two Chinese newspapers, namely the Xinhua News Agency of China (XIN) and the Central News Agency of Taiwan (CNA).

Source: Order-Preserving Abstractive Summarization for Spoken Content Based on Connectionist Temporal Classification

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets