Search Results for author: Zuheng Kang

Found 7 papers, 0 papers with code

Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning

no code implementations24 Apr 2024 Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Jing Xiao

Single-model systems often suffer from deficiencies in tasks such as speaker verification (SV) and image classification, relying heavily on partial prior knowledge during decision-making, resulting in suboptimal performance.

Retrieval-Augmented Audio Deepfake Detection

no code implementations22 Apr 2024 Zuheng Kang, Yayun He, Botao Zhao, Xiaoyang Qu, Junqing Peng, Jing Xiao, Jianzong Wang

With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of ultra-realistic audio deepfakes, there is growing concern about their potential misuse.

DeepFake Detection Face Swapping +3

SVVAD: Personal Voice Activity Detection for Speaker Verification

no code implementations31 May 2023 Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

To address this, we propose a speaker verification-based voice activity detection (SVVAD) framework that can adapt the speech features according to which are most informative for SV.

Action Detection Activity Detection +1

Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification

no code implementations14 Mar 2023 Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Xiaoyang Qu, Jing Xiao

Data-Free Knowledge Distillation (DFKD) has recently attracted growing attention in the academic community, especially with major breakthroughs in computer vision.

Data-free Knowledge Distillation Sound Classification

SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

no code implementations27 Jun 2022 Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao

Speech emotion recognition (SER) has many challenges, but one of the main challenges is that each framework does not have a unified standard.

Speech Emotion Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.