Chunking

68 papers with code • 5 benchmarks • 5 datasets

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

Vinken	,	61	years	old
B-NLP	I-NP	I-NP	I-NP	I-NP

Benchmarks

Add a Result

These leaderboards are used to track progress in Chunking

Dataset	Best Model	Compare
CoNLL 2000	ACE	See all
Penn Treebank	ACE	See all
CoNLL 2003 (German)	ACE	See all
CoNLL 2003 (English)	ACE	See all
CoNLL 2003	Def2Vec	See all

Libraries

Use these libraries to find Chunking models and implementations

jiesutd/NCRFpp

3 papers

1,880

jiesutd/PyTorchSeqLabel

3 papers

1,880

zalandoresearch/flair

2 papers

13,615

Datasets

Latest papers

Most implemented Social Latest No code

Weighted Training for Cross-Task Learning

HornHehhf/TAWT • • ICLR 2022

In this paper, we introduce Target-Aware Weighted Training (TAWT), a weighted training algorithm for cross-task learning based on minimizing a representation-based task distance between the source and target tasks.

28 May 2021

Paper
Code

Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

modelscope/adaseq • • ACL 2021

We find empirically that the contextual representations computed on the retrieval-based input view, constructed through the concatenation of a sentence and its external contexts, can achieve significantly improved performance compared to the original input view based only on the sentence.

369

08 May 2021

Paper
Code

Neural Sequence Segmentation as Determining the Leftmost Segments

LeePleased/LeftmostSeg • • NAACL 2021

Prior methods to text segmentation are mostly at token level.

15 Apr 2021

Paper
Code

Does Chinese BERT Encode Word Structure?

ylwangy/BERT_zh_Analysis • • COLING 2020

Contextualized representations give significantly improved results for a wide range of NLP tasks.

15 Oct 2020

Paper
Code

Automated Concatenation of Embeddings for Structured Prediction

Alibaba-NLP/ACE • • ACL 2021

Pretrained contextualized embeddings are powerful word representations for structured prediction tasks.

293

10 Oct 2020

Paper
Code

AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network

Alibaba-NLP/AIN • • EMNLP 2020

The linear-chain Conditional Random Field (CRF) model is one of the most widely-used neural sequence labeling approaches.

17 Sep 2020

Paper
Code

Continual General Chunking Problem and SyncMap

zweifel/SyncMap • • 14 Jun 2020

Here, we propose a continual generalization of the chunking problem (an unsupervised problem), encompassing fixed and probabilistic chunks, discovery of temporal and causal structures and their continual variations.

14 Jun 2020

Paper
Code

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension

HongyuGong/RCM-Question-Answering • • ACL 2020

In this paper, we study machine reading comprehension (MRC) on long texts, where a model takes as inputs a lengthy document and a question and then extracts a text span from the document as an answer.

16 May 2020

Paper
Code

The Structured Weighted Violations MIRA

dorringel/SWVM • 9 May 2020

We present the Structured Weighted Violation MIRA (SWVM), a new structured prediction algorithm that is based on an hybridization between MIRA (Crammer and Singer, 2003) and the structured weighted violations perceptron (SWVP) (Dror and Reichart, 2016).

09 May 2020

Paper
Code

Capturing Global Informativeness in Open Domain Keyphrase Extraction

thunlp/BERT-KPE • • 28 Apr 2020

Open-domain KeyPhrase Extraction (KPE) aims to extract keyphrases from documents without domain or quality restrictions, e. g., web pages with variant domains and qualities.

438

28 Apr 2020

Paper
Code

Chunking

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result