Semantic Similarity

417 papers with code • 8 benchmarks • 12 datasets

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Similarity

Dataset	Best Model	Compare
SICK	Dependency Tree-LSTM (Tai et al., 2015)	See all
Annotated corpus for semantic similarity of clinical trial outcomes (original corpus)	BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")	See all
Annotated corpus for semantic similarity of clinical trial outcomes (expanded corpus)	BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")	See all
BIOSSES	NCBI_BERT(base) (P+M)	See all
MedSTS	NCBI_BERT(base) (P+M)	See all
ClinicalSTS	CharacterBERT (base, medical, ensemble)	See all
CHIP-STS	MacBERT-large	See all
STS Benchmark	Def2Vec	See all

Libraries

Use these libraries to find Semantic Similarity models and implementations

faceonlive/ai-research

2 papers

159

juliendenize/eztorch

2 papers

Datasets

Subtasks

Similarity Explanation

Latest papers

Most implemented Social Latest No code

A Collection of Pragmatic-Similarity Judgments over Spoken Dialog Utterances

divettemarco/pragsim • 21 Mar 2024

While there exist measures for semantic similarity and prosodic similarity, there are as yet none for pragmatic similarity.

21 Mar 2024

Paper
Code

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval

hhc1997/l2rm • • 8 Mar 2024

To achieve this, we propose L2RM, a general framework based on Optimal Transport (OT) that learns to rematch mismatched pairs.

08 Mar 2024

Paper
Code

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

infzhou/sam-pd • • 7 Mar 2024

Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images.

07 Mar 2024

Paper
Code

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

MICV-yonsei/EAGLE • • 3 Mar 2024

Semantic segmentation has innately relied on extensive pixel-level annotated data, leading to the emergence of unsupervised methodologies.

03 Mar 2024

Paper
Code

The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations

ainagari/splitsim • • 22 Feb 2024

When deriving contextualized word representations from language models, a decision needs to be made on how to obtain one for out-of-vocabulary (OOV) words that are segmented into subwords.

22 Feb 2024

Paper
Code

UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation

dipta007/semeval24-task8 • • 20 Feb 2024

The aim of SemEval-2024 Task 1, "Semantic Textual Relatedness for African and Asian Languages" is to develop models for identifying semantic textual relatedness (STR) between two sentences using multiple languages (14 African and Asian languages) and settings (supervised, unsupervised, and cross-lingual).

20 Feb 2024

Paper
Code

Semantic Textual Similarity Assessment in Chest X-ray Reports Using a Domain-Specific Cosine-Based Metric

sayeh1994/medical-corpus-semantic-similarity-evaluation • 19 Feb 2024

Medical language processing and deep learning techniques have emerged as critical tools for improving healthcare, particularly in the analysis of medical imaging and medical text data.

19 Feb 2024

Paper
Code

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages

semantic-textual-relatedness/semantic_relatedness_semeval2024 • • 13 Feb 2024

Exploring and quantifying semantic relatedness is central to representing language.

13 Feb 2024

Paper
Code

OrderBkd: Textual backdoor attack through repositioning

alekseevskaia/orderbkd • • 12 Feb 2024

The use of third-party datasets and pre-trained machine learning models poses a threat to NLP systems due to possibility of hidden backdoor attacks.

12 Feb 2024

Paper
Code

HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text

hqa-attack/hqaattack-demo • • NeurIPS 2023

Black-box hard-label adversarial attack on text is a practical and challenging task, as the text data space is inherently discrete and non-differentiable, and only the predicted label is accessible.

02 Feb 2024

Paper
Code

Semantic Similarity

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result