Part-Of-Speech Tagging

214 papers with code • 15 benchmarks • 26 datasets

Part-of-speech tagging (POS tagging) is the task of tagging a word in a text with its part of speech. A part of speech is a category of words with similar grammatical properties. Common English parts of speech are noun, verb, adjective, adverb, pronoun, preposition, conjunction, etc.

Example:

Vinken , 61 years old
NNP , CD NNS JJ

Libraries

Use these libraries to find Part-Of-Speech Tagging models and implementations
2 papers
1,880

Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation

kevinkrahn/ancient-greek-datasets 24 Aug 2023

In this work, we use a multilingual knowledge distillation approach to train BERT models to produce sentence embeddings for Ancient Greek text.

6
24 Aug 2023

MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction

adlnlp/mc-dre 12 Aug 2023

Extracting meaningful drug-related information chunks, such as adverse drug events (ADE), is crucial for preventing morbidity and saving many lives.

0
12 Aug 2023

Enhancing Cross-lingual Transfer via Phonemic Transcription Integration

nhhoang96/phonemic_xlingual 10 Jul 2023

Particularly, we propose unsupervised alignment objectives to capture (1) local one-to-one alignment between the two different modalities, (2) alignment via multi-modality contexts to leverage information from additional modalities, and (3) alignment via multilingual contexts where additional bilingual dictionaries are incorporated.

2
10 Jul 2023

Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models

arbml/taqyim 28 Jun 2023

Large language models (LLMs) have demonstrated impressive performance on various downstream tasks without requiring fine-tuning, including ChatGPT, a chat-based model built on top of LLMs such as GPT-3. 5 and GPT-4.

18
28 Jun 2023

Supplementary Features of BiLSTM for Enhanced Sequence Labeling

conglei2xu/global-context-mechanism 31 May 2023

Sequence labeling tasks require the computation of sentence representations for each word within a given sentence.

17
31 May 2023

MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages

masakhane-io/masakhane-pos 23 May 2023

In this paper, we present MasakhaPOS, the largest part-of-speech (POS) dataset for 20 typologically diverse African languages.

15
23 May 2023

Technical Report: Impact of Position Bias on Language Models in Token Classification

mehdibenamorr/Token-Positional-Bias 26 Apr 2023

Therefore, we conduct an in-depth evaluation of the impact of position bias on the performance of LMs when fine-tuned on token classification benchmarks.

0
26 Apr 2023

Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages

mainlp/noisydialect 20 Apr 2023

This can for instance be observed when finetuning PLMs on one language and evaluating them on data in a closely related language variety with no standardized orthography.

2
20 Apr 2023

BRENT: Bidirectional Retrieval Enhanced Norwegian Transformer

ltgoslo/brent 19 Apr 2023

After training, we also separate the language model, which we call the reader, from the retriever components, and show that this can be fine-tuned on a range of downstream tasks.

3
19 Apr 2023

Classification of US Supreme Court Cases using BERT-Based Techniques

shubham30vatsal/web-of-law 17 Apr 2023

We compare our results for two classification tasks: (1) a broad classification task with 15 categories and (2) a fine-grained classification task with 279 categories.

2
17 Apr 2023