Speaker Verification

171 papers with code • 5 benchmarks • 6 datasets

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Benchmarks

Add a Result

These leaderboards are used to track progress in Speaker Verification

Dataset	Best Model	Compare
VoxCeleb	WavLM+ECAPA-TDNN	See all
CN-CELEB	X-Vectors with Attention Backend	See all
CALLHOME	GE2E	See all
VoxCeleb1	SpeechNAS	See all
VoxCeleb2	ResNet-50	See all

Libraries

Use these libraries to find Speaker Verification models and implementations

PaddlePaddle/PaddleSpeech

5 papers

10,186

alibaba-damo-academy/3D-Speaker

4 papers

717

Jungjee/RawNet

4 papers

332

CorentinJ/Real-Time-Voice-Cloning

2 papers

50,871

See all 9 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Probing Self-supervised Learning Models with Target Speech Extraction

no code yet • 17 Feb 2024

TSE uniquely requires both speaker identification and speech separation, distinguishing it from other tasks in the Speech processing Universal PERformance Benchmark (SUPERB) evaluation.

Paper
Add Code

LightCAM: A Fast and Light Implementation of Context-Aware Masking based D-TDNN for Speaker Verification

no code yet • 8 Feb 2024

Traditional Time Delay Neural Networks (TDNN) have achieved state-of-the-art performance at the cost of high computational complexity and slower inference speed, making them difficult to implement in an industrial environment.

Paper
Add Code

Adversarial Data Augmentation for Robust Speaker Verification

no code yet • 5 Feb 2024

This adversarial learning empowers the network to generate speaker embeddings that can deceive the augmentation classifier, making the learned speaker embeddings more robust in the face of augmentation variations.

Paper
Add Code

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

no code yet • 26 Jan 2024

In this work, we propose the combination of two SVM-based classifiers with different kernel functions: Linear kernel and Gaussian Radial Basis Function (RBF) kernel with a Logistic Regression (LR) classifier.

Paper
Add Code

Adversarial speech for voice privacy protection from Personalized Speech generation

no code yet • 22 Jan 2024

For validation, we employ the open-source pre-trained YourTTS model for speech generation and protect the target speaker's speech in the white-box scenario.

Paper
Add Code

Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis

no code yet • 22 Jan 2024

The architecture of the system comprises a speaker verification system, a synthesizer, a vocoder, and noise reduction.

Paper
Add Code

Generalizing Speaker Verification for Spoof Awareness in the Embedding Space

no code yet • 20 Jan 2024

To this end, we propose to generalize the standalone ASV (G-SASV) against spoofing attacks, where we leverage limited training data from CM to enhance a simple backend in the embedding space, without the involvement of a separate CM module during the test (authentication) phase.

Paper
Add Code

ECAPA2: A Hybrid Neural Network Architecture and Training Strategy for Robust Speaker Embeddings

no code yet • 16 Jan 2024

In this paper, we present ECAPA2, a novel hybrid neural network architecture and training strategy to produce robust speaker embeddings.

Paper
Add Code

Exploratory Evaluation of Speech Content Masking

no code yet • 8 Jan 2024

Most recent speech privacy efforts have focused on anonymizing acoustic speaker attributes but there has not been as much research into protecting information from speech content.

Paper
Add Code

VOT: Revolutionizing Speaker Verification with Memory and Attention Mechanisms

no code yet • 28 Dec 2023

Speaker verification is to judge the similarity of two unknown voices in an open set, where the ideal speaker embedding should be able to condense discriminant information into a compact utterance-level representation that has small intra-speaker distances and large inter-speaker distances. We propose a novel model named Voice Transformer(VOT) for speaker verification.

Paper
Add Code

Speaker Verification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result