Speaker Verification

171 papers with code • 5 benchmarks • 6 datasets

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Benchmarks

Add a Result

These leaderboards are used to track progress in Speaker Verification

Dataset	Best Model	Compare
VoxCeleb	WavLM+ECAPA-TDNN	See all
CN-CELEB	X-Vectors with Attention Backend	See all
CALLHOME	GE2E	See all
VoxCeleb1	SpeechNAS	See all
VoxCeleb2	ResNet-50	See all

Libraries

Use these libraries to find Speaker Verification models and implementations

PaddlePaddle/PaddleSpeech

5 papers

10,168

alibaba-damo-academy/3D-Speaker

4 papers

715

Jungjee/RawNet

4 papers

332

CorentinJ/Real-Time-Voice-Cloning

2 papers

50,821

See all 9 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing

shimhz/mdl_sharpness • • 31 May 2023

Audio anti-spoofing for automatic speaker verification aims to safeguard users' identities from spoofing attacks.

31 May 2023

Paper
Code

Towards single integrated spoofing-aware speaker verification embeddings

sasv-challenge/asvspoof5-sasvbaseline • • 30 May 2023

Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge.

30 May 2023

Paper
Code

One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification

jungwoo4021/os-kdft • • 27 May 2023

This paper suggests One-Step Knowledge Distillation and Fine-Tuning (OS-KDFT), which incorporates KD and fine-tuning (FT).

27 May 2023

Paper
Code

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

alibaba-damo-academy/3D-Speaker • • 22 May 2023

This paper proposes a novel architecture called Enhanced Res2Net (ERes2Net), which incorporates both local and global feature fusion techniques to improve the performance.

715

22 May 2023

Paper
Code

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

yip-jia-qi/aca-net • • 20 May 2023

In this paper, we propose ACA-Net, a lightweight, global context-aware speaker embedding extractor for Speaker Verification (SV) that improves upon existing work by using Asymmetric Cross Attention (ACA) to replace temporal pooling.

20 May 2023

Paper
Code

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds

ubenwa/cryceleb2023 • • 1 May 2023

This paper describes the Ubenwa CryCeleb dataset - a labeled collection of infant cries - and the accompanying CryCeleb 2023 task, which is a public speaker verification challenge based on cry sounds.

01 May 2023

Paper
Code

DS-TDNN: Dual-stream Time-delay Neural Network with Global-aware Filter for Speaker Verification

ychenl/ds-tdnn • • 20 Mar 2023

To effectively leverage the long-term dependencies of audio signals and constrain model complexity, we introduce a novel module called Global-aware Filter layer (GF layer) in this work, which employs a set of learnable transform-domain filters between a 1D discrete Fourier transform and its inverse transform to capture global context.

20 Mar 2023

Paper
Code

Can spoofing countermeasure and speaker verification systems be jointly optimised?

eurecom-asp/sasv-joint-optimisation • • 13 Mar 2023

Spoofing countermeasure (CM) and automatic speaker verification (ASV) sub-systems can be used in tandem with a backend classifier as a solution to the spoofing aware speaker verification (SASV) task.

13 Mar 2023

Paper
Code

Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification

danielmengliu/audiovisuallip • • 22 Feb 2023

Visual speech (i. e., lip motion) is highly related to auditory speech due to the co-occurrence and synchronization in speech production.

22 Feb 2023

Paper
Code

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

jaesunghuh/voxsrc2022 • 20 Feb 2023

This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022.

20 Feb 2023

Paper
Code

Speaker Verification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result