Semantic Textual Similarity

556 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

A Collection of Pragmatic-Similarity Judgments over Spoken Dialog Utterances

divettemarco/pragsim 21 Mar 2024

While there exist measures for semantic similarity and prosodic similarity, there are as yet none for pragmatic similarity.

0
21 Mar 2024

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval

hhc1997/l2rm 8 Mar 2024

To achieve this, we propose L2RM, a general framework based on Optimal Transport (OT) that learns to rematch mismatched pairs.

10
08 Mar 2024

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

infzhou/sam-pd 7 Mar 2024

Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images.

4
07 Mar 2024

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

MICV-yonsei/EAGLE 3 Mar 2024

Semantic segmentation has innately relied on extensive pixel-level annotated data, leading to the emergence of unsupervised methodologies.

35
03 Mar 2024

NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents

aiintelligentsystems/next-level-bert 27 Feb 2024

While (large) language models have significantly improved over the last years, they still struggle to sensibly process long sequences found, e. g., in books, due to the quadratic scaling of the underlying attention mechanism.

4
27 Feb 2024

The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations

ainagari/splitsim 22 Feb 2024

When deriving contextualized word representations from language models, a decision needs to be made on how to obtain one for out-of-vocabulary (OOV) words that are segmented into subwords.

1
22 Feb 2024

DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain

drbenchmark/drbenchmark 20 Feb 2024

This limitation hampers the evaluation of the latest French biomedical models, as they are either assessed on a minimal number of tasks with non-standardized protocols or evaluated using general downstream tasks.

3
20 Feb 2024

UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation

dipta007/semeval24-task8 20 Feb 2024

The aim of SemEval-2024 Task 1, "Semantic Textual Relatedness for African and Asian Languages" is to develop models for identifying semantic textual relatedness (STR) between two sentences using multiple languages (14 African and Asian languages) and settings (supervised, unsupervised, and cross-lingual).

0
20 Feb 2024

Semantic Textual Similarity Assessment in Chest X-ray Reports Using a Domain-Specific Cosine-Based Metric

sayeh1994/medical-corpus-semantic-similarity-evaluation 19 Feb 2024

Medical language processing and deep learning techniques have emerged as critical tools for improving healthcare, particularly in the analysis of medical imaging and medical text data.

1
19 Feb 2024

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages

semantic-textual-relatedness/semantic_relatedness_semeval2024 13 Feb 2024

Exploring and quantifying semantic relatedness is central to representing language.

23
13 Feb 2024