Speaker Identification

61 papers with code • 4 benchmarks • 4 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Speaker Identification

Dataset	Best Model	Compare
VoxCeleb1	MSM-MAE	See all
EVI en-GB	Fuzzy Retrieval	See all
EVI pl-PL	Fuzzy Retrieval	See all
EVI fr-FR	Fuzzy Retrieval	See all

Datasets

Latest papers with no code

Most implemented Social Latest No code

TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches

no code yet • 18 Apr 2024

This study employs deep learning techniques to explore four speaker profiling tasks on the TIMIT dataset, namely gender classification, accent classification, age estimation, and speaker identification, highlighting the potential and challenges of multi-task learning versus single-task models.

Paper
Add Code

Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling

no code yet • 1 Apr 2024

Recently, there have been efforts to encode the linguistic information of speech using a self-supervised framework for speech synthesis.

Paper
Add Code

Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks

no code yet • 15 Mar 2024

In this study, we propose a DNN-based approach for hearing-loss compensation, which is trained on the outputs of hearing-impaired and normal-hearing DNN-based auditory models in response to speech signals.

Paper
Add Code

A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

no code yet • 3 Mar 2024

Self-supervised learned models have been found to be very effective for certain speech tasks such as automatic speech recognition, speaker identification, keyword spotting and others.

Paper
Add Code

Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification

no code yet • 29 Feb 2024

In this paper, we propose a method to detect the presence of adversarial examples, i. e., a binary classifier distinguishing between benign and adversarial examples.

Paper
Add Code

Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods

no code yet • 26 Feb 2024

The goal is to investigate on the kind of information which is used by these methods, and where it is located in the speech signal.

Paper
Add Code

Significance of Chirp MFCC as a Feature in Speech and Audio Applications

no code yet • 19 Feb 2024

A novel feature, based on the chirp z-transform, that offers an improved representation of the underlying true spectrum is proposed.

Paper
Add Code

Probing Self-supervised Learning Models with Target Speech Extraction

no code yet • 17 Feb 2024

TSE uniquely requires both speaker identification and speech separation, distinguishing it from other tasks in the Speech processing Universal PERformance Benchmark (SUPERB) evaluation.

Paper
Add Code

Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis

no code yet • 11 Feb 2024

This paper proposes a speech rhythm-based method for speaker embeddings to model phoneme duration using a few utterances by the target speaker.

Paper
Add Code

Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

no code yet • 23 Jan 2024

Automated speaker identification (SID) is a crucial step for the personalization of a wide range of speech-enabled services.

Paper
Add Code

Speaker Identification

Benchmarks Add a Result

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result