NCBI Disease Corpus

Introduced by Do{\u{g}}an et al. in An improved corpus of disease mentions in PubMed citations

NCBI Disease Corpus is a large-scale disease corpus consisting of 6900 disease mentions in 793 PubMed citations, derived from an earlier corpus. The corpus contains rich annotations, was developed by a team of 12 annotators (two people per annotation) and covers all sentences in a PubMed abstract. Disease mentions are categorized into Specific Disease, Disease Class, Composite Mention and Modifier categories.

Source: An improved corpus of disease mentions in PubMed citations

License

  • Unknown

Modalities

Languages

Tasks