no code implementations • 9 Mar 2024 • Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang
In this model, the inherent acoustic characteristics of individuals related to the tongue motion details are encoded by using wav2vec 2. 0, while the ASR transcriptions related to the universality of tongue motions are encoded by using BERT.
1 code implementation • 4th Workshop on African Natural Language Processing 2023 • Cheng Xu, Nan Yan
This paper presents a dataset called AROT-COV23 (ARabic Original Tweets on COVID-19 as of 2023) containing about 500, 000 original Arabic COVID-19-related tweets from January 2020 to January 2023.
1 code implementation • 7 Oct 2021 • Jin Li, Haibin Liu, Nan Yan, Lan Wang
Symbolic melodies generation is one of the essential tasks for automatic music generation.
1 code implementation • 19 Aug 2021 • Jin Li, Nan Yan, Lan Wang
However, cross-lingual SER remains a challenge in real-world applications due to a great difference between the source and target domain distributions.
no code implementations • 18 Aug 2021 • Jin Li, Rongfeng Su, Xurong Xie, Nan Yan, Lan Wang
The shallow stream is used to acquire traditional shallow features that is beneficial for the classification of phones or words while the deep stream is used to obtain utterance-level speaker-invariant deep features for improving the feature diversity.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 18 Aug 2021 • Jin Li, Nan Yan, Lan Wang
For example, RawNet and RawNet2 extracted speaker's feature embeddings from waveforms automatically for recognizing their voice, which can vastly reduce the front-end computation and obtain state-of-the-art performance.
no code implementations • 11 Aug 2020 • Chenggang Cui, Nan Yan, Chuanlin Zhang
To mitigate the bus voltage stability issue in DC microgrid, an innovative intelligent control strategy for buck DC-DC converter with constant power loads (CPLs) via deep reinforcement learning algorithm is constructed for the first time.