Speaker Verification

171 papers with code • 5 benchmarks • 6 datasets

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Libraries

Use these libraries to find Speaker Verification models and implementations

Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing

shimhz/mdl_sharpness 31 May 2023

Audio anti-spoofing for automatic speaker verification aims to safeguard users' identities from spoofing attacks.

4
31 May 2023

Towards single integrated spoofing-aware speaker verification embeddings

sasv-challenge/asvspoof5-sasvbaseline 30 May 2023

Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge.

0
30 May 2023

One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification

jungwoo4021/os-kdft 27 May 2023

This paper suggests One-Step Knowledge Distillation and Fine-Tuning (OS-KDFT), which incorporates KD and fine-tuning (FT).

10
27 May 2023

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

alibaba-damo-academy/3D-Speaker 22 May 2023

This paper proposes a novel architecture called Enhanced Res2Net (ERes2Net), which incorporates both local and global feature fusion techniques to improve the performance.

715
22 May 2023

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

yip-jia-qi/aca-net 20 May 2023

In this paper, we propose ACA-Net, a lightweight, global context-aware speaker embedding extractor for Speaker Verification (SV) that improves upon existing work by using Asymmetric Cross Attention (ACA) to replace temporal pooling.

5
20 May 2023

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds

ubenwa/cryceleb2023 1 May 2023

This paper describes the Ubenwa CryCeleb dataset - a labeled collection of infant cries - and the accompanying CryCeleb 2023 task, which is a public speaker verification challenge based on cry sounds.

11
01 May 2023

DS-TDNN: Dual-stream Time-delay Neural Network with Global-aware Filter for Speaker Verification

ychenl/ds-tdnn 20 Mar 2023

To effectively leverage the long-term dependencies of audio signals and constrain model complexity, we introduce a novel module called Global-aware Filter layer (GF layer) in this work, which employs a set of learnable transform-domain filters between a 1D discrete Fourier transform and its inverse transform to capture global context.

34
20 Mar 2023

Can spoofing countermeasure and speaker verification systems be jointly optimised?

eurecom-asp/sasv-joint-optimisation 13 Mar 2023

Spoofing countermeasure (CM) and automatic speaker verification (ASV) sub-systems can be used in tandem with a backend classifier as a solution to the spoofing aware speaker verification (SASV) task.

5
13 Mar 2023

Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification

danielmengliu/audiovisuallip 22 Feb 2023

Visual speech (i. e., lip motion) is highly related to auditory speech due to the co-occurrence and synchronization in speech production.

18
22 Feb 2023

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

jaesunghuh/voxsrc2022 20 Feb 2023

This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022.

17
20 Feb 2023