TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Chinese Word Segmentation	AS	Glyce + BERT	F1	96.7	# 1
Chinese Word Segmentation	AS	Glyce + BERT	Precision	96.6	# 1
Chinese Word Segmentation	AS	Glyce + BERT	Recall	96.8	# 1
Chinese Sentence Pair Classification	BQ	Glyce + BERT	Accuracy	85.8	# 1
Chinese Sentence Pair Classification	BQ	Glyce + BERT	F1	85.5	# 2
Chinese Sentence Pair Classification	BQ	Glyce + BERT	Precision	84.2	# 1
Chinese Sentence Pair Classification	BQ	Glyce + BERT	Recall	86.9	# 1
Chinese Dependency Parsing	Chinese Pennbank	Biaffine + Glyce	LAS	89	# 1
Chinese Dependency Parsing	Chinese Pennbank	Biaffine + Glyce	UAS	90.2	# 1
Chinese Sentence Pair Classification	ChnSentiCorp	Glyce + BERT	Accuracy	95.9	# 1
Chinese Word Segmentation	CITYU	Glyce + BERT	F1	97.9	# 2
Chinese Word Segmentation	CITYU	Glyce + BERT	Precision	97.9	# 1
Chinese Word Segmentation	CITYU	Glyce + BERT	Recall	98	# 1
Chinese Semantic Role Labeling	CoNLL-2009	k-order pruning + Glyce	F1	83.7	# 1
Chinese Semantic Role Labeling	CoNLL-2009	k-order pruning + Glyce	Precision	85.4	# 1
Chinese Semantic Role Labeling	CoNLL-2009	k-order pruning + Glyce	Recall	82.1	# 1
Chinese Part-of-Speech Tagging	CTB5	Glyce + BERT	F1	96.61	# 2
Chinese Part-of-Speech Tagging	CTB5	Glyce + BERT	Precision	96.5	# 1
Chinese Part-of-Speech Tagging	CTB5	Glyce + BERT	Recall	96.74	# 1
Chinese Part-of-Speech Tagging	CTB6	Glyce + BERT	F1	95.41	# 1
Chinese Part-of-Speech Tagging	CTB6	Glyce + BERT	Precision	95.56	# 1
Chinese Part-of-Speech Tagging	CTB6	Glyce + BERT	Recall	95.26	# 1
Chinese Part-of-Speech Tagging	CTB9	Glyce + BERT	F1	93.15	# 1
Chinese Part-of-Speech Tagging	CTB9	Glyce + BERT	Precision	93.49	# 1
Chinese Part-of-Speech Tagging	CTB9	Glyce + BERT	Recall	92.84	# 1
Chinese Sentence Pair Classification	Fudan corpus	Glyce + BERT	Accuracy	99.8	# 1
Chinese Sentence Pair Classification	iFeng	Glyce + BERT	Accuracy	87.5	# 1
Chinese Sentence Pair Classification	LCQMC	Glyce + BERT....	Accuracy	88.7	# 1
Chinese Sentence Pair Classification	LCQMC	Glyce + BERT....	F1	88.8	# 1
Chinese Sentence Pair Classification	LCQMC	Glyce + BERT....	Precision	86.8	# 1
Chinese Sentence Pair Classification	LCQMC	Glyce + BERT....	Recall	91.2	# 1
Chinese Word Segmentation	MSR	Glyce + BERT	F1	98.3	# 5
Chinese Word Segmentation	MSR	Glyce + BERT	Precision	98.2	# 1
Chinese Word Segmentation	MSR	Glyce + BERT	Recall	98.3	# 1
Chinese Named Entity Recognition	MSRA	Glyce + BERT	F1	95.54	# 8
Chinese Named Entity Recognition	MSRA	Glyce + BERT	Precision	95.57	# 1
Chinese Named Entity Recognition	MSRA	Glyce + BERT	Recall	95.51	# 1
Chinese Sentence Pair Classification	NLPCC-DBQA	Glyce + BERT	F1	83.4	# 1
Chinese Sentence Pair Classification	NLPCC-DBQA	Glyce + BERT	Precision	81.1	# 1
Chinese Sentence Pair Classification	NLPCC-DBQA	Glyce + BERT	Recall	85.8	# 1
Chinese Named Entity Recognition	OntoNotes 4	Glyce + BERT	F1	80.62	# 8
Chinese Named Entity Recognition	OntoNotes 4	Glyce + BERT	Precision	81.87	# 1
Chinese Named Entity Recognition	OntoNotes 4	Glyce + BERT	Recall	81.4	# 1
Chinese Word Segmentation	PKU	Glyce + BERT	F1	96.7	# 2
Chinese Word Segmentation	PKU	Glyce + BERT	Precision	97.1	# 1
Chinese Word Segmentation	PKU	Glyce + BERT	Recall	96.4	# 1
Chinese Named Entity Recognition	Resume NER	Glyce + BERT	F1	96.54	# 5
Chinese Named Entity Recognition	Resume NER	Glyce + BERT	Precision	96.62	# 1
Chinese Named Entity Recognition	Resume NER	Glyce + BERT	Recall	96.48	# 1
Chinese Part-of-Speech Tagging	UD1	Glyce + BERT	F1	96.14	# 1
Chinese Part-of-Speech Tagging	UD1	Glyce + BERT	Precision	96.19	# 1
Chinese Part-of-Speech Tagging	UD1	Glyce + BERT	Recall	96.1	# 1
Chinese Named Entity Recognition	Weibo NER	Glyce + BERT	F1	67.6	# 9
Chinese Named Entity Recognition	Weibo NER	Glyce + BERT	Precision	67.68	# 1
Chinese Named Entity Recognition	Weibo NER	Glyce + BERT	Recall	67.71	# 1
Chinese Sentence Pair Classification	XNLI	Glyce + BERT	Accuracy	79.2	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-word-segmentation-on-as)](https://paperswithcode.com/sota/chinese-word-segmentation-on-as?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-sentence-pair-classification-on-bq)](https://paperswithcode.com/sota/chinese-sentence-pair-classification-on-bq?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-dependency-parsing-on-chinese)](https://paperswithcode.com/sota/chinese-dependency-parsing-on-chinese?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-sentence-pair-classification-on)](https://paperswithcode.com/sota/chinese-sentence-pair-classification-on?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-semantic-role-labeling-on-conll-2009)](https://paperswithcode.com/sota/chinese-semantic-role-labeling-on-conll-2009?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-part-of-speech-tagging-on-ctb6)](https://paperswithcode.com/sota/chinese-part-of-speech-tagging-on-ctb6?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-part-of-speech-tagging-on-ctb9)](https://paperswithcode.com/sota/chinese-part-of-speech-tagging-on-ctb9?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-sentence-pair-classification-on-fudan)](https://paperswithcode.com/sota/chinese-sentence-pair-classification-on-fudan?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-sentence-pair-classification-on-ifeng)](https://paperswithcode.com/sota/chinese-sentence-pair-classification-on-ifeng?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-sentence-pair-classification-on-lcqmc)](https://paperswithcode.com/sota/chinese-sentence-pair-classification-on-lcqmc?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-sentence-pair-classification-on-nlpcc)](https://paperswithcode.com/sota/chinese-sentence-pair-classification-on-nlpcc?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-part-of-speech-tagging-on-ud1)](https://paperswithcode.com/sota/chinese-part-of-speech-tagging-on-ud1?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-sentence-pair-classification-on-xnli)](https://paperswithcode.com/sota/chinese-sentence-pair-classification-on-xnli?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-word-segmentation-on-cityu)](https://paperswithcode.com/sota/chinese-word-segmentation-on-cityu?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-part-of-speech-tagging-on-ctb5)](https://paperswithcode.com/sota/chinese-part-of-speech-tagging-on-ctb5?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-word-segmentation-on-pku)](https://paperswithcode.com/sota/chinese-word-segmentation-on-pku?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-word-segmentation-on-msr)](https://paperswithcode.com/sota/chinese-word-segmentation-on-msr?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-named-entity-recognition-on-resume)](https://paperswithcode.com/sota/chinese-named-entity-recognition-on-resume?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-named-entity-recognition-on-msra)](https://paperswithcode.com/sota/chinese-named-entity-recognition-on-msra?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-named-entity-recognition-on-ontonotes)](https://paperswithcode.com/sota/chinese-named-entity-recognition-on-ontonotes?p=glyce-glyph-vectors-for-chinese-character)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/glyce-glyph-vectors-for-chinese-character/chinese-named-entity-recognition-on-weibo-ner)](https://paperswithcode.com/sota/chinese-named-entity-recognition-on-weibo-ner?p=glyce-glyph-vectors-for-chinese-character)`

Glyce: Glyph-vectors for Chinese Character Representations

NeurIPS 2019 · Yuxian Meng, Wei Wu, Fei Wang, Xiaoya Li, Ping Nie, Fan Yin, Muyu Li, Qinghong Han, Xiaofei Sun, Jiwei Li ·

It is intuitive that NLP tasks for logographic languages like Chinese should benefit from the use of the glyph information in those languages. However, due to the lack of rich pictographic evidence in glyphs and the weak generalization ability of standard computer vision models on character data, an effective way to utilize the glyph information remains to be found. In this paper, we address this gap by presenting Glyce, the glyph-vectors for Chinese character representations. We make three major innovations: (1) We use historical Chinese scripts (e.g., bronzeware script, seal script, traditional Chinese, etc) to enrich the pictographic evidence in characters; (2) We design CNN structures (called tianzege-CNN) tailored to Chinese character image processing; and (3) We use image-classification as an auxiliary task in a multi-task learning setup to increase the model's ability to generalize. We show that glyph-based models are able to consistently outperform word/char ID-based models in a wide range of Chinese NLP tasks. We are able to set new state-of-the-art results for a variety of Chinese NLP tasks, including tagging (NER, CWS, POS), sentence pair classification, single sentence classification tasks, dependency parsing, and semantic role labeling. For example, the proposed model achieves an F1 score of 80.6 on the OntoNotes dataset of NER, +1.5 over BERT; it achieves an almost perfect accuracy of 99.8\% on the Fudan corpus for text classification. Code found at https://github.com/ShannonAI/glyce.

PDF Abstract NeurIPS 2019 PDF NeurIPS 2019 Abstract

Code

Add Remove Mark official

ShannonAI/glyce official

417

zhangyuwangumass/Glyph-based-Chines…

Tasks

Add Remove

Chinese Dependency Parsing

Chinese Named Entity Recognition

Chinese Part-of-Speech Tagging

Chinese Semantic Role Labeling

Chinese Sentence Pair Classification

Chinese Word Segmentation

Classification

Dependency Parsing

Document Classification

General Classification

Image Classification

Language Modelling

Machine Translation

Multi-Task Learning

NER

Part-Of-Speech Tagging

POS

Semantic Role Labeling

Semantic Textual Similarity

Sentence

Sentence Classification

Sentence-Pair Classification

Sentiment Analysis

text-classification

Text Classification

Datasets

ImageNet

XNLI CoNLL

Weibo NER

Resume NER MSRA CN NER OntoNotes 4.0 CoNLL-2009 LCQMC

Results from the Paper

Edit

Ranked #1 on Chinese Sentence Pair Classification on LCQMC

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Chinese Word Segmentation	AS	Glyce + BERT	F1	96.7	# 1	Compare
			Precision	96.6	# 1	Compare
			Recall	96.8	# 1	Compare
Chinese Sentence Pair Classification	BQ	Glyce + BERT	Accuracy	85.8	# 1	Compare
			F1	85.5	# 2	Compare
			Precision	84.2	# 1	Compare
			Recall	86.9	# 1	Compare
Chinese Dependency Parsing	Chinese Pennbank	Biaffine + Glyce	LAS	89	# 1	Compare
Chinese Dependency Parsing	Chinese Pennbank	Biaffine + Glyce	UAS	90.2	# 1	Compare
Chinese Sentence Pair Classification	ChnSentiCorp	Glyce + BERT	Accuracy	95.9	# 1	Compare
Chinese Word Segmentation	CITYU	Glyce + BERT	F1	97.9	# 2	Compare
			Precision	97.9	# 1	Compare
			Recall	98	# 1	Compare
Chinese Semantic Role Labeling	CoNLL-2009	k-order pruning + Glyce	F1	83.7	# 1	Compare
			Precision	85.4	# 1	Compare
			Recall	82.1	# 1	Compare
Chinese Part-of-Speech Tagging	CTB5	Glyce + BERT	F1	96.61	# 2	Compare
			Precision	96.5	# 1	Compare
			Recall	96.74	# 1	Compare
Chinese Part-of-Speech Tagging	CTB6	Glyce + BERT	F1	95.41	# 1	Compare
			Precision	95.56	# 1	Compare
			Recall	95.26	# 1	Compare
Chinese Part-of-Speech Tagging	CTB9	Glyce + BERT	F1	93.15	# 1	Compare
			Precision	93.49	# 1	Compare
			Recall	92.84	# 1	Compare
Chinese Sentence Pair Classification	Fudan corpus	Glyce + BERT	Accuracy	99.8	# 1	Compare
Chinese Sentence Pair Classification	iFeng	Glyce + BERT	Accuracy	87.5	# 1	Compare
Chinese Sentence Pair Classification	LCQMC	Glyce + BERT....	Accuracy	88.7	# 1	Compare
			F1	88.8	# 1	Compare
			Precision	86.8	# 1	Compare
			Recall	91.2	# 1	Compare
Chinese Word Segmentation	MSR	Glyce + BERT	F1	98.3	# 5	Compare
			Precision	98.2	# 1	Compare
			Recall	98.3	# 1	Compare
Chinese Named Entity Recognition	MSRA	Glyce + BERT	F1	95.54	# 8	Compare
			Precision	95.57	# 1	Compare
			Recall	95.51	# 1	Compare
Chinese Sentence Pair Classification	NLPCC-DBQA	Glyce + BERT	F1	83.4	# 1	Compare
			Precision	81.1	# 1	Compare
			Recall	85.8	# 1	Compare
Chinese Named Entity Recognition	OntoNotes 4	Glyce + BERT	F1	80.62	# 8	Compare
			Precision	81.87	# 1	Compare
			Recall	81.4	# 1	Compare
Chinese Word Segmentation	PKU	Glyce + BERT	F1	96.7	# 2	Compare
			Precision	97.1	# 1	Compare
			Recall	96.4	# 1	Compare
Chinese Named Entity Recognition	Resume NER	Glyce + BERT	F1	96.54	# 5	Compare
			Precision	96.62	# 1	Compare
			Recall	96.48	# 1	Compare
Chinese Part-of-Speech Tagging	UD1	Glyce + BERT	F1	96.14	# 1	Compare
			Precision	96.19	# 1	Compare
			Recall	96.1	# 1	Compare
Chinese Named Entity Recognition	Weibo NER	Glyce + BERT	F1	67.6	# 9	Compare
			Precision	67.68	# 1	Compare
			Recall	67.71	# 1	Compare
Chinese Sentence Pair Classification	XNLI	Glyce + BERT	Accuracy	79.2	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Glyce: Glyph-vectors for Chinese Character Representations

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove