Semantic Textual Similarity

560 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

Latest papers with no code

SIFiD: Reassess Summary Factual Inconsistency Detection with LLM

no code yet • 12 Mar 2024

Ensuring factual consistency between the summary and the original document is paramount in summarization tasks.

Knowledge-aware Alert Aggregation in Large-scale Cloud Systems: a Hybrid Approach

no code yet • 11 Mar 2024

We also share our experience in deploying COLA in our real-world cloud system, Cloud X.

Deep Contrastive Multi-view Clustering under Semantic Feature Guidance

no code yet • 9 Mar 2024

To mitigate the interference of view-private information, specific view and fusion view semantic features are learned by cluster-level contrastive learning and concatenated to measure the semantic similarity of instances.

Is Cosine-Similarity of Embeddings Really About Similarity?

no code yet • 8 Mar 2024

Cosine-similarity is the cosine of the angle between two vectors, or equivalently the dot product between their normalizations.

Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity

no code yet • 8 Mar 2024

Rather, we find a superiority of the Wikipedia domain over the NLI domain for these languages, in contrast to prior studies that focused on NLI as training data.

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation

no code yet • 7 Mar 2024

We devise completeness loss and consistency loss based on semantic similarity scores.

Improving Cross-lingual Representation for Semantic Retrieval with Code-switching

no code yet • 3 Mar 2024

Semantic Retrieval (SR) has become an indispensable part of the FAQ system in the task-oriented question-answering (QA) dialogue scenario.

GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features

no code yet • 3 Mar 2024

First, MiniGPT-4 is employed to generate the detailed description of the video frame and rewrite the query statement, fed into the encoder as new features.

API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access

no code yet • 2 Mar 2024

This study aims to address the pervasive challenge of quantifying uncertainty in large language models (LLMs) without logit-access.

Semantic Text Transmission via Prediction with Small Language Models: Cost-Similarity Trade-off

no code yet • 1 Mar 2024

We obtain $(\bar{c}, \bar{s})$ pairs for neural language and first-order Markov chain-based small language models (SLM) for prediction, using both a threshold policy that transmits a word if its cosine similarity with that predicted/completed at the destination is below a threshold, and a periodic policy, which transmits words after a specific interval and predicts/completes the words in between, at the destination.