Semantic Textual Similarity

546 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

Latest papers with no code

Attention-aware semantic relevance predicting Chinese sentence reading

no code yet • 27 Mar 2024

Our approach underscores the potential of these metrics to advance our comprehension of how humans understand and process language, ultimately leading to a better understanding of language comprehension and processing.

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

no code yet • 27 Mar 2024

Most of the existing works focus on improving the representation ability for the contextualized embedding of the [CLS] token and calculate relevance using textual semantic similarity.

Evaluation of Semantic Search and its Role in Retrieved-Augmented-Generation (RAG) for Arabic Language

no code yet • 27 Mar 2024

The latest advancements in machine learning and deep learning have brought forth the concept of semantic similarity, which has proven immensely beneficial in multiple applications and has largely replaced keyword search.

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

no code yet • 24 Mar 2024

Subsequently, ESREAL computes token-level hallucination scores by assessing the semantic similarity of aligned regions based on the type of hallucination.

Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase Graphs

no code yet • 24 Mar 2024

We study the patent phrase similarity inference task, which measures the semantic similarity between two patent phrases.

Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives

no code yet • 21 Mar 2024

In this paper, we introduce the Financial-STS task, a financial domain-specific NLP task designed to measure the nuanced semantic similarity between pairs of financial narratives.

RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

no code yet • 17 Mar 2024

In this paper, we introduce RobustSentEmbed, a self-supervised sentence embedding framework designed to improve both generalization and robustness in diverse text representation tasks and against a diverse set of adversarial attacks.

A Modified Word Saliency-Based Adversarial Attack on Text Classification Models

no code yet • 17 Mar 2024

This paper introduces a novel adversarial attack method targeting text classification models, termed the Modified Word Saliency-based Adversarial At-tack (MWSAA).

SIFiD: Reassess Summary Factual Inconsistency Detection with LLM

no code yet • 12 Mar 2024

Ensuring factual consistency between the summary and the original document is paramount in summarization tasks.

Knowledge-aware Alert Aggregation in Large-scale Cloud Systems: a Hybrid Approach

no code yet • 11 Mar 2024

We also share our experience in deploying COLA in our real-world cloud system, Cloud X.