Semantic Textual Similarity

556 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Textual Similarity

Dataset	Best Model	Compare
STS Benchmark	MT-DNN-SMART	See all
MRPC	MT-DNN-SMART	See all
MTEB	ST5-XXL	See all
STS13	AnglE-LLaMA-13B	See all
SICK	PromCSE-RoBERTa-large (0.355B)	See all
STS12	PromptEOL+CSE+OPT-13B	See all
STS14	AnglE-LLaMA-13B	See all
STS15	AnglE-LLaMA-13B	See all
STS16	AnglE-LLaMA-13B	See all
SentEval	GenSen	See all
CxC	PromCSE-RoBERTa-large (0.355B)	See all
SICK-R	AnglE-LLaMA-7B	See all
MRPC Dev	Synthesizer (R+V)	See all

Show all 13 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

huggingface/transformers

9 papers

124,457

facebookresearch/xformers

3 papers

7,508

facebookresearch/InferSent

3 papers

2,279

namisan/mt-dnn

3 papers

2,198

See all 11 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

A Collection of Pragmatic-Similarity Judgments over Spoken Dialog Utterances

divettemarco/pragsim • 21 Mar 2024

While there exist measures for semantic similarity and prosodic similarity, there are as yet none for pragmatic similarity.

21 Mar 2024

Paper
Code

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval

hhc1997/l2rm • • 8 Mar 2024

To achieve this, we propose L2RM, a general framework based on Optimal Transport (OT) that learns to rematch mismatched pairs.

08 Mar 2024

Paper
Code

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

infzhou/sam-pd • • 7 Mar 2024

Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images.

07 Mar 2024

Paper
Code

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

MICV-yonsei/EAGLE • • 3 Mar 2024

Semantic segmentation has innately relied on extensive pixel-level annotated data, leading to the emergence of unsupervised methodologies.

03 Mar 2024

Paper
Code

NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents

aiintelligentsystems/next-level-bert • • 27 Feb 2024

While (large) language models have significantly improved over the last years, they still struggle to sensibly process long sequences found, e. g., in books, due to the quadratic scaling of the underlying attention mechanism.

27 Feb 2024

Paper
Code

The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations

ainagari/splitsim • • 22 Feb 2024

When deriving contextualized word representations from language models, a decision needs to be made on how to obtain one for out-of-vocabulary (OOV) words that are segmented into subwords.

22 Feb 2024

Paper
Code

DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain

drbenchmark/drbenchmark • • 20 Feb 2024

This limitation hampers the evaluation of the latest French biomedical models, as they are either assessed on a minimal number of tasks with non-standardized protocols or evaluated using general downstream tasks.

20 Feb 2024

Paper
Code

UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation

dipta007/semeval24-task8 • • 20 Feb 2024

The aim of SemEval-2024 Task 1, "Semantic Textual Relatedness for African and Asian Languages" is to develop models for identifying semantic textual relatedness (STR) between two sentences using multiple languages (14 African and Asian languages) and settings (supervised, unsupervised, and cross-lingual).

20 Feb 2024

Paper
Code

Semantic Textual Similarity Assessment in Chest X-ray Reports Using a Domain-Specific Cosine-Based Metric

sayeh1994/medical-corpus-semantic-similarity-evaluation • 19 Feb 2024

Medical language processing and deep learning techniques have emerged as critical tools for improving healthcare, particularly in the analysis of medical imaging and medical text data.

19 Feb 2024

Paper
Code

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages

semantic-textual-relatedness/semantic_relatedness_semeval2024 • • 13 Feb 2024

Exploring and quantifying semantic relatedness is central to representing language.

13 Feb 2024

Paper
Code

Semantic Textual Similarity

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result