Search Results for author: Zuheng Kang

Found 7 papers, 0 papers with code

Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning

no code implementations • 24 Apr 2024 • Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Jing Xiao

Single-model systems often suffer from deficiencies in tasks such as speaker verification (SV) and image classification, relying heavily on partial prior knowledge during decision-making, resulting in suboptimal performance.

Paper
Add Code

Retrieval-Augmented Audio Deepfake Detection

no code implementations • 22 Apr 2024 • Zuheng Kang, Yayun He, Botao Zhao, Xiaoyang Qu, Junqing Peng, Jing Xiao, Jianzong Wang

With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of ultra-realistic audio deepfakes, there is growing concern about their potential misuse.

DeepFake Detection Face Swapping +3

Paper
Add Code

VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model

no code implementations • 7 Oct 2023 • Yayun He, Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

Speaker verification (SV) performance deteriorates as utterances become shorter.

Text-Independent Speaker Verification

Paper
Add Code

SVVAD: Personal Voice Activity Detection for Speaker Verification

no code implementations • 31 May 2023 • Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

To address this, we propose a speaker verification-based voice activity detection (SVVAD) framework that can adapt the speech features according to which are most informative for SV.

Action Detection Activity Detection +1

Paper
Add Code

Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification

no code implementations • 14 Mar 2023 • Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Xiaoyang Qu, Jing Xiao

Data-Free Knowledge Distillation (DFKD) has recently attracted growing attention in the academic community, especially with major breakthroughs in computer vision.

Data-free Knowledge Distillation Sound Classification

Paper
Add Code

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

no code implementations • 18 Oct 2022 • Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

Estimating age from a single speech is a classic and challenging topic.

Age Estimation

Paper
Add Code

SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

no code implementations • 27 Jun 2022 • Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao

Speech emotion recognition (SER) has many challenges, but one of the main challenges is that each framework does not have a unified standard.

Speech Emotion Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.