Semantic Similarity

418 papers with code • 8 benchmarks • 12 datasets

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Similarity

Dataset	Best Model	Compare
SICK	Dependency Tree-LSTM (Tai et al., 2015)	See all
Annotated corpus for semantic similarity of clinical trial outcomes (original corpus)	BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")	See all
Annotated corpus for semantic similarity of clinical trial outcomes (expanded corpus)	BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")	See all
BIOSSES	NCBI_BERT(base) (P+M)	See all
MedSTS	NCBI_BERT(base) (P+M)	See all
ClinicalSTS	CharacterBERT (base, medical, ensemble)	See all
CHIP-STS	MacBERT-large	See all
STS Benchmark	Def2Vec	See all

Libraries

Use these libraries to find Semantic Similarity models and implementations

faceonlive/ai-research

2 papers

189

juliendenize/eztorch

2 papers

Datasets

Subtasks

Similarity Explanation

Most implemented papers

Most implemented Social Latest No code

Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints

nmrksic/attract-repel • • 1 Jun 2017

We present Attract-Repel, an algorithm for improving the semantic quality of word vectors by injecting constraints extracted from lexical resources.

Paper
Code

A Semantics-Based Measure of Emoji Similarity

joonasrooben/NLP-text2emoji • • 14 Jul 2017

This paper presents a comprehensive analysis of the semantic similarity of emoji through embedding models that are learned over machine-readable emoji meanings in the EmojiNet knowledge base.

Paper
Code

Ad Hoc Table Retrieval using Semantic Similarity

iai-group/www2018-table • 16 Feb 2018

Specifically, we (i) represent queries and tables in multiple semantic spaces (both discrete sparse and continuous dense vector representations) and (ii) introduce various similarity measures for matching those semantic representations.

Paper
Code

Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

jojonki/MultiWOZ-Parser • ACL 2018

Robust dialogue belief tracking is a key component in maintaining good quality dialogue systems.

Paper
Code

A Joint Sequence Fusion Model for Video Question Answering and Retrieval

antoine77340/howto100m • • ECCV 2018

We present an approach named JSFusion (Joint Sequence Fusion) that can measure semantic similarity between any pairs of multimodal sequence data (e. g. a video clip and a language sentence).

Paper
Code

Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia Content

WeimingWen/CCRV • • EMNLP 2018

With the increasing popularity of smart devices, rumors with multimedia content become more and more common on social networks.

Paper
Code

Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints

abaheti95/DC-NeuralConversation • • EMNLP 2018

Neural conversation models tend to generate safe, generic responses for most inputs.

Paper
Code

Auto-Encoding Dictionary Definitions into Consistent Word Embeddings

tombosc/cpae • EMNLP 2018

Monolingual dictionaries are widespread and semantically rich resources.

Paper
Code

Ranked List Loss for Deep Metric Learning

XinshaoAmosWang/Ranked-List-Loss-for-DML • CVPR 2019

To address this, we propose to build a set-based similarity structure by exploiting all instances in the gallery.

Paper
Code

A Bilingual Generative Transformer for Semantic Sentence Embedding

jwieting/bilingual-generative-transformer • • EMNLP 2020

Semantic sentence embedding models encode natural language sentences into vectors, such that closeness in embedding space indicates closeness in the semantics between the sentences.

Paper
Code

Semantic Similarity

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result