Chunking

68 papers with code • 5 benchmarks • 5 datasets

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

Vinken , 61 years old
B-NLP I-NP I-NP I-NP I-NP

Libraries

Use these libraries to find Chunking models and implementations
3 papers
1,880
2 papers
13,615

Weighted Training for Cross-Task Learning

HornHehhf/TAWT ICLR 2022

In this paper, we introduce Target-Aware Weighted Training (TAWT), a weighted training algorithm for cross-task learning based on minimizing a representation-based task distance between the source and target tasks.

0
28 May 2021

Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

modelscope/adaseq ACL 2021

We find empirically that the contextual representations computed on the retrieval-based input view, constructed through the concatenation of a sentence and its external contexts, can achieve significantly improved performance compared to the original input view based only on the sentence.

369
08 May 2021

Neural Sequence Segmentation as Determining the Leftmost Segments

LeePleased/LeftmostSeg NAACL 2021

Prior methods to text segmentation are mostly at token level.

8
15 Apr 2021

Does Chinese BERT Encode Word Structure?

ylwangy/BERT_zh_Analysis COLING 2020

Contextualized representations give significantly improved results for a wide range of NLP tasks.

3
15 Oct 2020

Automated Concatenation of Embeddings for Structured Prediction

Alibaba-NLP/ACE ACL 2021

Pretrained contextualized embeddings are powerful word representations for structured prediction tasks.

293
10 Oct 2020

AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network

Alibaba-NLP/AIN EMNLP 2020

The linear-chain Conditional Random Field (CRF) model is one of the most widely-used neural sequence labeling approaches.

18
17 Sep 2020

Continual General Chunking Problem and SyncMap

zweifel/SyncMap 14 Jun 2020

Here, we propose a continual generalization of the chunking problem (an unsupervised problem), encompassing fixed and probabilistic chunks, discovery of temporal and causal structures and their continual variations.

5
14 Jun 2020

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension

HongyuGong/RCM-Question-Answering ACL 2020

In this paper, we study machine reading comprehension (MRC) on long texts, where a model takes as inputs a lengthy document and a question and then extracts a text span from the document as an answer.

28
16 May 2020

The Structured Weighted Violations MIRA

dorringel/SWVM 9 May 2020

We present the Structured Weighted Violation MIRA (SWVM), a new structured prediction algorithm that is based on an hybridization between MIRA (Crammer and Singer, 2003) and the structured weighted violations perceptron (SWVP) (Dror and Reichart, 2016).

0
09 May 2020

Capturing Global Informativeness in Open Domain Keyphrase Extraction

thunlp/BERT-KPE 28 Apr 2020

Open-domain KeyPhrase Extraction (KPE) aims to extract keyphrases from documents without domain or quality restrictions, e. g., web pages with variant domains and qualities.

438
28 Apr 2020