Semantic Textual Similarity

560 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Textual Similarity

Dataset	Best Model	Compare
STS Benchmark	MT-DNN-SMART	See all
MRPC	MT-DNN-SMART	See all
MTEB	ST5-XXL	See all
STS13	AnglE-LLaMA-13B	See all
SICK	PromCSE-RoBERTa-large (0.355B)	See all
STS12	PromptEOL+CSE+OPT-13B	See all
STS14	AnglE-LLaMA-13B	See all
STS15	AnglE-LLaMA-13B	See all
STS16	AnglE-LLaMA-13B	See all
SentEval	GenSen	See all
CxC	PromCSE-RoBERTa-large (0.355B)	See all
SICK-R	AnglE-LLaMA-7B	See all
MRPC Dev	Synthesizer (R+V)	See all

Show all 13 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

huggingface/transformers

9 papers

125,478

facebookresearch/xformers

3 papers

7,638

facebookresearch/InferSent

3 papers

2,279

namisan/mt-dnn

3 papers

2,203

See all 11 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

SIFiD: Reassess Summary Factual Inconsistency Detection with LLM

no code yet • 12 Mar 2024

Ensuring factual consistency between the summary and the original document is paramount in summarization tasks.

Paper
Add Code

Knowledge-aware Alert Aggregation in Large-scale Cloud Systems: a Hybrid Approach

no code yet • 11 Mar 2024

We also share our experience in deploying COLA in our real-world cloud system, Cloud X.

Paper
Add Code

Deep Contrastive Multi-view Clustering under Semantic Feature Guidance

no code yet • 9 Mar 2024

To mitigate the interference of view-private information, specific view and fusion view semantic features are learned by cluster-level contrastive learning and concatenated to measure the semantic similarity of instances.

Paper
Add Code

Is Cosine-Similarity of Embeddings Really About Similarity?

no code yet • 8 Mar 2024

Cosine-similarity is the cosine of the angle between two vectors, or equivalently the dot product between their normalizations.

Paper
Add Code

Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity

no code yet • 8 Mar 2024

Rather, we find a superiority of the Wikipedia domain over the NLI domain for these languages, in contrast to prior studies that focused on NLI as training data.

Paper
Add Code

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation

no code yet • 7 Mar 2024

We devise completeness loss and consistency loss based on semantic similarity scores.

Paper
Add Code

Improving Cross-lingual Representation for Semantic Retrieval with Code-switching

no code yet • 3 Mar 2024

Semantic Retrieval (SR) has become an indispensable part of the FAQ system in the task-oriented question-answering (QA) dialogue scenario.

Paper
Add Code

GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features

no code yet • 3 Mar 2024

First, MiniGPT-4 is employed to generate the detailed description of the video frame and rewrite the query statement, fed into the encoder as new features.

Paper
Add Code

API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access

no code yet • 2 Mar 2024

This study aims to address the pervasive challenge of quantifying uncertainty in large language models (LLMs) without logit-access.

Paper
Add Code

Semantic Text Transmission via Prediction with Small Language Models: Cost-Similarity Trade-off

no code yet • 1 Mar 2024

We obtain $(\bar{c}, \bar{s})$ pairs for neural language and first-order Markov chain-based small language models (SLM) for prediction, using both a threshold policy that transmits a word if its cosine similarity with that predicted/completed at the destination is below a threshold, and a periodic policy, which transmits words after a specific interval and predicts/completes the words in between, at the destination.

Paper
Add Code

Semantic Textual Similarity

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result