no code implementations • 7 Apr 2023 • Jian Guan, Feiyang Xiao, Youde Liu, Qiaoxi Zhu, Wenwu Wang
This paper uses contrastive learning to refine audio representations for each machine ID, rather than for each audio sample.
1 code implementation • 10 Jan 2022 • Feiyang Xiao, Jian Guan, Haiyan Lan, Qiaoxi Zhu, Wenwu Wang
Although this method effectively captures global information within audio data via the self-attention mechanism, it may ignore the event with short time duration, due to its limitation in capturing local information in an audio signal, leading to inaccurate prediction of captions.
1 code implementation • 30 Mar 2021 • Feiyang Xiao, Jian Guan, Qiuqiang Kong, Wenwu Wang
Speech enhancement aims to obtain speech signals with high intelligibility and quality from noisy speech.