Part-Of-Speech Tagging

214 papers with code • 15 benchmarks • 26 datasets

Part-of-speech tagging (POS tagging) is the task of tagging a word in a text with its part of speech. A part of speech is a category of words with similar grammatical properties. Common English parts of speech are noun, verb, adjective, adverb, pronoun, preposition, conjunction, etc.

Example:

Vinken	,	61	years	old
NNP	,	CD	NNS	JJ

Benchmarks

Add a Result

These leaderboards are used to track progress in Part-Of-Speech Tagging

Dataset	Best Model	Compare
Penn Treebank	SALE-BART encoder	See all
UD	BiLSTM-LAN	See all
Ritter	ACE	See all
Social media	PretRand	See all
ARK	ACE	See all
Tweebank	ACE	See all
UD2.5 test	Trankit	See all
French GSD	CamemBERT	See all
Sequoia Treebank	CamemBERT	See all
Spoken Corpus	CamemBERT	See all
ParTUT	CamemBERT	See all
DaNE	da_dacy_large_tft-0.0.0	See all
XGLUE	mGPT	See all
ANTILLES	Bi-LSTM-CRF + Flair Embeddings + CamemBERT (oscar−138gb−base) Embeddings	See all
Morphosyntactic-analysis-dataset	MyBert	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Part-Of-Speech Tagging models and implementations

jiesutd/NCRFpp

2 papers

1,880

jiesutd/PyTorchSeqLabel

2 papers

1,880

Datasets

Subtasks

Unsupervised Part-Of-Speech Tagging

Latest papers with no code

Most implemented Social Latest No code

Punctuation Restoration Improves Structure Understanding without Supervision

no code yet • 13 Feb 2024

Unsupervised learning objectives like language modeling and de-noising constitute a significant part in producing pre-trained models that perform various downstream applications from natural language understanding to conversational tasks.

Paper
Add Code

A Comprehensive View of the Biases of Toxicity and Sentiment Analysis Methods Towards Utterances with African American English Expressions

no code yet • 23 Jan 2024

One explanation for this bias is that AI models are trained on limited datasets, and using such a term in training data is more likely to appear in a toxic utterance.

Paper
Add Code

Zero Resource Cross-Lingual Part Of Speech Tagging

no code yet • 11 Jan 2024

Our conclusion is that projected alignment data in zero-resource language can be beneficial to predict POS tags.

Paper
Add Code

Part-of-Speech Tagger for Bodo Language using Deep Learning approach

no code yet • 6 Jan 2024

We cover several language models in the experiment to see how well they work in POS tagging tasks.

Paper
Add Code

Make BERT-based Chinese Spelling Check Model Enhanced by Layerwise Attention and Gaussian Mixture Model

no code yet • 27 Dec 2023

Meanwhile, to incorporate implicit hierarchical linguistic knowledge within the encoder, we propose a novel form of n-gram-based layerwise self-attention to generate a multilayer representation.

Paper
Add Code

Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

no code yet • 14 Dec 2023

The automatic identification of planetary feature names in astronomy publications presents numerous challenges.

Paper
Add Code

Augmenty: A Python Library for Structured Text Augmentation

no code yet • 9 Dec 2023

Augmnety is a Python library for structured text augmentation.

Paper
Add Code

Bit Cipher -- A Simple yet Powerful Word Representation System that Integrates Efficiently with Language Models

no code yet • 18 Nov 2023

While Large Language Models (LLMs) become ever more dominant, classic pre-trained word embeddings sustain their relevance through computational efficiency and nuanced linguistic interpretation.

Paper
Add Code

Colloquial Persian POS (CPPOS) Corpus: A Novel Corpus for Colloquial Persian Part of Speech Tagging

no code yet • 1 Oct 2023

A comparison with another well-known Persian POS corpus named "Bijankhan" and the Persian Hazm POS tool trained on Bijankhan revealed that our model trained on CPPOS outperforms them.

Paper
Add Code

Unsupervised Domain Adaptation using Lexical Transformations and Label Injection for Twitter Data

no code yet • 14 Jul 2023

A large body of literature tries to solve this problem by adapting models trained on the source domain to the target domain.

Paper
Add Code

Part-Of-Speech Tagging

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result