Speaker Verification

170 papers with code • 5 benchmarks • 6 datasets

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Benchmarks

Add a Result

These leaderboards are used to track progress in Speaker Verification

Dataset	Best Model	Compare
VoxCeleb	WavLM+ECAPA-TDNN	See all
CN-CELEB	X-Vectors with Attention Backend	See all
CALLHOME	GE2E	See all
VoxCeleb1	SpeechNAS	See all
VoxCeleb2	ResNet-50	See all

Libraries

Use these libraries to find Speaker Verification models and implementations

PaddlePaddle/PaddleSpeech

5 papers

10,131

alibaba-damo-academy/3D-Speaker

4 papers

709

Jungjee/RawNet

4 papers

332

CorentinJ/Real-Time-Voice-Cloning

2 papers

50,743

See all 9 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

VoxCeleb2: Deep Speaker Recognition

a-nagrani/VGGVox • 14 Jun 2018

The objective of this paper is speaker recognition under noisy and unconstrained conditions.

Paper
Code

Multiobjective Optimization Training of PLDA for Speaker Verification

sanphiee/MOT-sGPLDA-SRE14 • 25 Aug 2018

Most current state-of-the-art text-independent speaker verification systems take probabilistic linear discriminant analysis (PLDA) as their backend classifiers.

Paper
Code

DELTA: A DEep learning based Language Technology plAtform

didi/delta • • 2 Aug 2019

In this paper we present DELTA, a deep learning based language technology platform.

Paper
Code

Personal VAD: Speaker-Conditioned Voice Activity Detection

pirxus/personalVAD • • 12 Aug 2019

In this paper, we propose "personal VAD", a system to detect the voice activity of a target speaker at the frame level.

Paper
Code

Adversarial Attacks on GMM i-vector based Speaker Verification Systems

lixucuhk/adversarial-attack-on-GMM-i-vector-based-speaker-verification-systems • • 8 Nov 2019

Experiment results show that GMM i-vector systems are seriously vulnerable to adversarial attacks, and the crafted adversarial samples prove to be transferable and pose threats to neuralnetwork speaker embedding based systems (e. g. x-vector systems).

Paper
Code

A Speaker Verification Backend for Improved Calibration Performance across Varying Conditions

zh794390558/sv_score_calibration • • 5 Feb 2020

In a recent work, we presented a discriminative backend for speaker verification that achieved good out-of-the-box calibration performance on most tested conditions containing varying levels of mismatch to the training conditions.

Paper
Code

Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms

Jungjee/RawNet • • 1 Apr 2020

Recent advances in deep learning have facilitated the design of speaker verification systems that directly input raw waveforms.

Paper
Code

Crossed-Time Delay Neural Network for Speaker Recognition

chenllliang/ctdnn • • 31 May 2020

Time Delay Neural Network (TDNN) is a well-performing structure for DNN-based speaker recognition systems.

Paper
Code

FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention

yistLin/FragmentVC • • 27 Oct 2020

Any-to-any voice conversion aims to convert the voice from and to any speakers even unseen during training, which is much more challenging compared to one-to-one or many-to-many tasks, but much more attractive in real-world scenarios.

Paper
Code

Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks

lixucuhk/Channel-wise-Gated-Res2Net • • 19 Jul 2021

This argument motivates the current work that presents a novel, channel-wise gated Res2Net (CG-Res2Net), which modifies Res2Net to enable a channel-wise gating mechanism in the connection between feature groups.

Paper
Code

Speaker Verification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result