Part-Of-Speech Tagging

214 papers with code • 15 benchmarks • 26 datasets

Part-of-speech tagging (POS tagging) is the task of tagging a word in a text with its part of speech. A part of speech is a category of words with similar grammatical properties. Common English parts of speech are noun, verb, adjective, adverb, pronoun, preposition, conjunction, etc.

Example:

Vinken	,	61	years	old
NNP	,	CD	NNS	JJ

Benchmarks

Add a Result

These leaderboards are used to track progress in Part-Of-Speech Tagging

Dataset	Best Model	Compare
Penn Treebank	SALE-BART encoder	See all
UD	BiLSTM-LAN	See all
Ritter	ACE	See all
Social media	PretRand	See all
ARK	ACE	See all
Tweebank	ACE	See all
UD2.5 test	Trankit	See all
French GSD	CamemBERT	See all
Sequoia Treebank	CamemBERT	See all
Spoken Corpus	CamemBERT	See all
ParTUT	CamemBERT	See all
DaNE	da_dacy_large_tft-0.0.0	See all
XGLUE	mGPT	See all
ANTILLES	Bi-LSTM-CRF + Flair Embeddings + CamemBERT (oscar−138gb−base) Embeddings	See all
Morphosyntactic-analysis-dataset	MyBert	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Part-Of-Speech Tagging models and implementations

jiesutd/NCRFpp

2 papers

1,880

jiesutd/PyTorchSeqLabel

2 papers

1,880

Datasets

Subtasks

Unsupervised Part-Of-Speech Tagging

Latest papers

Most implemented Social Latest No code

Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation

kevinkrahn/ancient-greek-datasets • 24 Aug 2023

In this work, we use a multilingual knowledge distillation approach to train BERT models to produce sentence embeddings for Ancient Greek text.

24 Aug 2023

Paper
Code

MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction

adlnlp/mc-dre • 12 Aug 2023

Extracting meaningful drug-related information chunks, such as adverse drug events (ADE), is crucial for preventing morbidity and saving many lives.

12 Aug 2023

Paper
Code

Enhancing Cross-lingual Transfer via Phonemic Transcription Integration

nhhoang96/phonemic_xlingual • • 10 Jul 2023

Particularly, we propose unsupervised alignment objectives to capture (1) local one-to-one alignment between the two different modalities, (2) alignment via multi-modality contexts to leverage information from additional modalities, and (3) alignment via multilingual contexts where additional bilingual dictionaries are incorporated.

10 Jul 2023

Paper
Code

Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models

arbml/taqyim • 28 Jun 2023

Large language models (LLMs) have demonstrated impressive performance on various downstream tasks without requiring fine-tuning, including ChatGPT, a chat-based model built on top of LLMs such as GPT-3. 5 and GPT-4.

28 Jun 2023

Paper
Code

Supplementary Features of BiLSTM for Enhanced Sequence Labeling

conglei2xu/global-context-mechanism • • 31 May 2023

Sequence labeling tasks require the computation of sentence representations for each word within a given sentence.

31 May 2023

Paper
Code

MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages

masakhane-io/masakhane-pos • • 23 May 2023

In this paper, we present MasakhaPOS, the largest part-of-speech (POS) dataset for 20 typologically diverse African languages.

23 May 2023

Paper
Code

Technical Report: Impact of Position Bias on Language Models in Token Classification

mehdibenamorr/Token-Positional-Bias • • 26 Apr 2023

Therefore, we conduct an in-depth evaluation of the impact of position bias on the performance of LMs when fine-tuned on token classification benchmarks.

26 Apr 2023

Paper
Code

Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages

mainlp/noisydialect • • 20 Apr 2023

This can for instance be observed when finetuning PLMs on one language and evaluating them on data in a closely related language variety with no standardized orthography.

20 Apr 2023

Paper
Code

BRENT: Bidirectional Retrieval Enhanced Norwegian Transformer

ltgoslo/brent • • 19 Apr 2023

After training, we also separate the language model, which we call the reader, from the retriever components, and show that this can be fine-tuned on a range of downstream tasks.

19 Apr 2023

Paper
Code

Classification of US Supreme Court Cases using BERT-Based Techniques

shubham30vatsal/web-of-law • • 17 Apr 2023

We compare our results for two classification tasks: (1) a broad classification task with 15 categories and (2) a fine-grained classification task with 279 categories.

17 Apr 2023

Paper
Code

Part-Of-Speech Tagging

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result