Browse SoTA > Speech > Speaker Verification

Speaker Verification

33 papers with code · Speech

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Benchmarks

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data

NeurIPS 2017 wnhsu/FactorizedHierarchicalVAE

We present a factorized hierarchical variational autoencoder, which learns disentangled and interpretable representations from sequential data without supervision.

SPEAKER VERIFICATION SPEECH RECOGNITION

An Unsupervised Autoregressive Model for Speech Representation Learning

5 Apr 2019iamyuanchung/Autoregressive-Predictive-Coding

This paper proposes a novel unsupervised autoregressive neural model for learning generic speech representations.

REPRESENTATION LEARNING SPEAKER VERIFICATION

Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms

1 Apr 2020Jungjee/RawNet

Recent advances in deep learning have facilitated the design of speaker verification systems that directly input raw waveforms.

TEXT-INDEPENDENT SPEAKER VERIFICATION

RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification

17 Apr 2019Jungjee/RawNet

In this study, we explore end-to-end deep neural networks that input raw waveforms to improve various aspects: front-end speaker embedding extraction including model architecture, pre-training scheme, additional objective functions, and back-end classification.

DATA AUGMENTATION TEXT-INDEPENDENT SPEAKER VERIFICATION

AutoSpeech: Neural Architecture Search for Speaker Recognition

7 May 2020VITA-Group/AutoSpeech

Speaker recognition systems based on Convolutional Neural Networks (CNNs) are often built with off-the-shelf backbones such as VGG-Net or ResNet.

IMAGE CLASSIFICATION NEURAL ARCHITECTURE SEARCH SPEAKER IDENTIFICATION SPEAKER RECOGNITION SPEAKER VERIFICATION

NPLDA: A Deep Neural PLDA Model for Speaker Verification

10 Feb 2020iiscleap/NeuralPlda

The likelihood ratio score of the generative PLDA model is posed as a discriminative similarity function and the learnable parameters of the score function are optimized using a verification cost.

SPEAKER RECOGNITION SPEAKER VERIFICATION

Pairwise Discriminative Neural PLDA for Speaker Verification

20 Jan 2020iiscleap/NeuralPlda

The pre-processing steps of linear discriminant analysis (LDA), unit length normalization and within class covariance normalization are all modeled as layers of a neural model and the speaker verification cost functions can be back-propagated through these layers during training.

SPEAKER VERIFICATION

Scalable Factorized Hierarchical Variational Autoencoder Training

9 Apr 2018wnhsu/ScalableFHVAE

Deep generative models have achieved great success in unsupervised learning with the ability to capture complex nonlinear relationships between latent generating factors and observations.

HYPERPARAMETER OPTIMIZATION ROBUST SPEECH RECOGNITION SPEAKER VERIFICATION VOICE CONVERSION

Deep Residual Neural Networks for Audio Spoofing Detection

30 Jun 2019nesl/asvspoof2019

Additionally, replay attacks where the attacker uses a speaker to replay a previously recorded genuine human speech are also possible.

SPEAKER VERIFICATION SPEECH SYNTHESIS VOICE CONVERSION

Attentive Filtering Networks for Audio Replay Attack Detection

31 Oct 2018jefflai108/Attentive-Filtering-Network

In this work, we propose our replay attacks detection system - Attentive Filtering Network, which is composed of an attention-based filtering mechanism that enhances feature representations in both the frequency and time domains, and a ResNet-based classifier.

SPEAKER VERIFICATION