Semantic Textual Similarity

560 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Textual Similarity

Dataset	Best Model	Compare
STS Benchmark	MT-DNN-SMART	See all
MRPC	MT-DNN-SMART	See all
MTEB	ST5-XXL	See all
STS13	AnglE-LLaMA-13B	See all
SICK	PromCSE-RoBERTa-large (0.355B)	See all
STS12	PromptEOL+CSE+OPT-13B	See all
STS14	AnglE-LLaMA-13B	See all
STS15	AnglE-LLaMA-13B	See all
STS16	AnglE-LLaMA-13B	See all
SentEval	GenSen	See all
CxC	PromCSE-RoBERTa-large (0.355B)	See all
SICK-R	AnglE-LLaMA-7B	See all
MRPC Dev	Synthesizer (R+V)	See all

Show all 13 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

huggingface/transformers

9 papers

125,290

facebookresearch/xformers

3 papers

7,612

facebookresearch/InferSent

3 papers

2,279

namisan/mt-dnn

3 papers

2,202

See all 11 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Semantic Textual Similarity Assessment in Chest X-ray Reports Using a Domain-Specific Cosine-Based Metric

sayeh1994/medical-corpus-semantic-similarity-evaluation • 19 Feb 2024

Medical language processing and deep learning techniques have emerged as critical tools for improving healthcare, particularly in the analysis of medical imaging and medical text data.

19 Feb 2024

Paper
Code

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages

semantic-textual-relatedness/semantic_relatedness_semeval2024 • • 13 Feb 2024

Exploring and quantifying semantic relatedness is central to representing language.

13 Feb 2024

Paper
Code

Pixel Sentence Representation Learning

gowitheflow-1998/pixel-linguist • • 13 Feb 2024

To our knowledge, this is the first representation learning method devoid of traditional language models for understanding sentence and document semantics, marking a stride closer to human-like textual comprehension.

13 Feb 2024

Paper
Code

OrderBkd: Textual backdoor attack through repositioning

alekseevskaia/orderbkd • • 12 Feb 2024

The use of third-party datasets and pre-trained machine learning models poses a threat to NLP systems due to possibility of hidden backdoor attacks.

12 Feb 2024

Paper
Code

HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text

hqa-attack/hqaattack-demo • • NeurIPS 2023

Black-box hard-label adversarial attack on text is a practical and challenging task, as the text data space is inherently discrete and non-differentiable, and only the predicted label is accessible.

02 Feb 2024

Paper
Code

Benchmarking Transferable Adversarial Attacks

kxplaug/taa-bench • • 1 Feb 2024

The robustness of deep learning models against adversarial attacks remains a pivotal concern.

01 Feb 2024

Paper
Code

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

xinghaow99/denosent • • 24 Jan 2024

These methods regularize the representation space by pulling similar sentence representations closer and pushing away the dissimilar ones and have been proven effective in various NLP tasks, e. g., semantic textual similarity (STS) tasks.

24 Jan 2024

Paper
Code

Contrastive Learning in Distilled Models

kennethlimjf/contrastive-learning-in-distilled-models • • 23 Jan 2024

Natural Language Processing models like BERT can provide state-of-the-art word embeddings for downstream NLP tasks.

23 Jan 2024

Paper
Code

Noise Contrastive Estimation-based Matching Framework for Low-Resource Security Attack Pattern Recognition

tumeteor/ttp-mapping • 18 Jan 2024

Tactics, Techniques and Procedures (TTPs) represent sophisticated attack patterns in the cybersecurity domain, described encyclopedically in textual knowledge bases.

18 Jan 2024

Paper
Code

A character-based steganography using masked language modeling

emirozturk/MLMStego • • IEEE Access 2024

In this study, a steganography method based on BERT transformer model is proposed for hiding text data in cover text.

15 Jan 2024

Paper
Code

Semantic Textual Similarity

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result