Chinese Word Segmentation
48 papers with code • 6 benchmarks • 3 datasets
Chinese word segmentation is the task of splitting Chinese text (i.e. a sequence of Chinese characters) into words (Source: www.nlpprogress.com).
Benchmarks
These leaderboards are used to track progress in Chinese Word Segmentation
Latest papers with no code
An Effective Incorporating Heterogeneous Knowledge Curriculum Learning for Sequence Labeling
To address this challenge, we propose a two-stage curriculum learning (TCL) framework specifically designed for sequence labeling tasks.
Incorporating Deep Syntactic and Semantic Knowledge for Chinese Sequence Labeling with GCN
Recently, it is quite common to integrate Chinese sequence labeling results to enhance syntactic and semantic parsing.
Joint Chinese Word Segmentation and Span-based Constituency Parsing
In constituency parsing, span-based decoding is an important direction.
Mining Word Boundaries in Speech as Naturally Annotated Word Segmentation Data
Inspired by early research on exploring naturally annotated data for Chinese word segmentation (CWS), and also by recent research on integration of speech and text processing, this work for the first time proposes to mine word boundaries from parallel speech/text data.
That Slepen Al the Nyght with Open Ye! Cross-era Sequence Segmentation with Switch-memory
The evolution of language follows the rule of gradual change.
A New Evaluation Method: Evaluation Data and Metrics for Chinese Grammar Error Correction
In terms of the reference-based metric, we introduce sentence-level accuracy and char-level BLEU to evaluate the corrected sentences.
Chinese Word Segmentation with Heterogeneous Graph Neural Network
In recent years, deep learning has achieved significant success in the Chinese word segmentation (CWS) task.
Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-stage Span Labeling
Previous studies on joint Chinese word segmentation and part-of-speech tagging mainly follow the character-based tagging model focusing on modeling n-gram features.
Green CWS: Extreme Distillation and Efficient Decode Method Towards Industrial Application
Benefiting from the strong ability of the pre-trained model, the research on Chinese Word Segmentation (CWS) has made great progress in recent years.
Unsupervised Chinese Word Segmentation with BERT Oriented Probing and Transformation
Word Segmentation is a fundamental step for understanding Chinese language.