Datasets > Modality > Texts > NCBI Disease

The NCBI Disease corpus consists of 793 PubMed abstracts, which are separated into training (593), development (100) and test (100) subsets. The NCBI Disease corpus is annotated with disease mentions, using concept identifiers from either MeSH or OMIM.

Source: A Neural Multi-Task Learning Framework to Jointly Model Medical Named Entity Recognition and Normalization

License

  • Unknown

Modalities

Languages

Tasks