no code implementations • 27 Nov 2023 • Zezhong Jin, Youzhi Tu, Man-Wai Mak
The intuition is that phonetic information can preserve low-level acoustic dynamics with speaker information and thus partly compensate for the degradation due to noise and reverberation.
no code implementations • 28 Oct 2022 • Zezhong Jin, Dading Zhong, Xiao Song, Zhaoyi Liu, Naipeng Ye, Qingcheng Zeng
The model is iteratively updated to correct the unreliable pseudo labels to minimize the effect of noisy labels.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2