Chinese Gigaword

Introduced by Graff et al. in LDC Catalog No.: LDC2003T09, ISBN

Chinese Gigaword corpus consists of 2.2M of headline-document pairs of news stories covering over 284 months from two Chinese newspapers, namely the Xinhua News Agency of China (XIN) and the Central News Agency of Taiwan (CNA).

Source: Order-Preserving Abstractive Summarization for Spoken Content Based on Connectionist Temporal Classification

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Named Entity Recognition (NER)
Word Embeddings
Decipherment

Similar Datasets

Resume NER

Source: https://catalog.ldc.upenn.edu/desc/addenda/LDC2011T13.jpg.

Chinese Gigaword

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

Resume NER

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages