Enhancing Label Consistency on Document-level Named Entity Recognition

24 Oct 2022  ·  Minbyul Jeong, Jaewoo Kang ·

Named entity recognition (NER) is a fundamental part of extracting information from documents in biomedical applications. A notable advantage of NER is its consistency in extracting biomedical entities in a document context. Although existing document NER models show consistent predictions, they still do not meet our expectations. We investigated whether the adjectives and prepositions within an entity cause a low label consistency, which results in inconsistent predictions. In this paper, we present our method, ConNER, which enhances the label dependency of modifiers (e.g., adjectives and prepositions) to achieve higher label agreement. ConNER refines the draft labels of the modifiers to improve the output representations of biomedical entities. The effectiveness of our method is demonstrated on four popular biomedical NER datasets; in particular, its efficacy is proved on two datasets with 7.5-8.6% absolute improvements in the F1 score. We interpret that our ConNER method is effective on datasets that have intrinsically low label consistency. In the qualitative analysis, we demonstrate how our approach makes the NER model generate consistent predictions. Our code and resources are available at https://github.com/dmis-lab/ConNER/.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Named Entity Recognition (NER) AnatEM ConNER F1 83.5 # 1
Named Entity Recognition (NER) BC5CDR ConNER F1 91.3 # 2
Named Entity Recognition (NER) Gellus ConNER F1 (%) 63.4 # 1
Named Entity Recognition (NER) NCBI-disease ConNER F1 89.9 # 4

Methods


No methods listed for this paper. Add relevant methods here