Browse SoTA > Speech > Speaker Recognition

Speaker Recognition

22 papers with code · Speech

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Benchmarks

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Speech and Speaker Recognition from Raw Waveform with SincNet

13 Dec 2018mravanelli/SincNet

Deep neural networks can learn complex and abstract representations, that are progressively obtained by combining simpler ones.

SPEAKER RECOGNITION SPEECH RECOGNITION

Speaker Recognition from Raw Waveform with SincNet

29 Jul 2018mravanelli/SincNet

Rather than employing standard hand-crafted features, the latter CNNs learn low-level speech representations from waveforms, potentially allowing the network to better capture important narrow-band speaker characteristics such as pitch and formants.

SPEAKER IDENTIFICATION SPEAKER RECOGNITION SPEAKER VERIFICATION

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

11 Oct 2018mindslab-ai/voicefilter

In this paper, we present a novel system that separates the voice of a target speaker from multi-speaker signals, by making use of a reference signal from the target speaker.

SPEAKER RECOGNITION SPEAKER SEPARATION SPEECH ENHANCEMENT SPEECH RECOGNITION

Filterbank design for end-to-end speech separation

23 Oct 2019mpariente/AsSteroid

Also, we validate the use of parameterized filterbanks and show that complex-valued representations and masks are beneficial in all conditions.

SPEAKER RECOGNITION SPEECH SEPARATION

Deep Speaker: an End-to-End Neural Speaker Embedding System

5 May 2017philipperemy/deep-speaker

We present Deep Speaker, a neural speaker embedding system that maps utterances to a hypersphere where speaker similarity is measured by cosine similarity.

SPEAKER IDENTIFICATION SPEAKER RECOGNITION

VoxCeleb2: Deep Speaker Recognition

14 Jun 2018a-nagrani/VGGVox

The objective of this paper is speaker recognition under noisy and unconstrained conditions.

SPEAKER RECOGNITION

TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech

12 Jul 2020andi611/Self-Supervised-Speech-Pretraining-and-Representation-Learning

In our experiments, we show that through alteration along different dimensions, the model learns to encode distinct aspects of speech.

SELF-SUPERVISED LEARNING SPEAKER RECOGNITION SPEECH RECOGNITION TRANSFER LEARNING

Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders

25 Oct 2019andi611/Self-Supervised-Speech-Pretraining-and-Representation-Learning

We present Mockingjay as a new speech representation learning approach, where bidirectional Transformer encoders are pre-trained on a large amount of unlabeled speech.

REPRESENTATION LEARNING SENTIMENT ANALYSIS SPEAKER RECOGNITION

Utterance-level Aggregation For Speaker Recognition In The Wild

None 2019 taylorlu/Speaker-Diarization

The objective of this paper is speaker recognition "in the wild"-where utterances may be of variable length and also contain irrelevant signals.

SPEAKER RECOGNITION TEXT-INDEPENDENT SPEAKER VERIFICATION

AutoSpeech: Neural Architecture Search for Speaker Recognition

7 May 2020VITA-Group/AutoSpeech

Speaker recognition systems based on Convolutional Neural Networks (CNNs) are often built with off-the-shelf backbones such as VGG-Net or ResNet.

IMAGE CLASSIFICATION NEURAL ARCHITECTURE SEARCH SPEAKER IDENTIFICATION SPEAKER RECOGNITION SPEAKER VERIFICATION