Search Results for author: Nan Yan

Found 7 papers, 4 papers with code

An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

no code implementations9 Mar 2024 Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang

In this model, the inherent acoustic characteristics of individuals related to the tongue motion details are encoded by using wav2vec 2. 0, while the ASR transcriptions related to the universality of tongue motions are encoded by using BERT.

AROT-COV23: A Dataset of 500K Original Arabic Tweets on COVID-19

1 code implementation 4th Workshop on African Natural Language Processing 2023 Cheng Xu, Nan Yan

This paper presents a dataset called AROT-COV23 (ARabic Original Tweets on COVID-19 as of 2023) containing about 500, 000 original Arabic COVID-19-related tweets from January 2020 to January 2023.

Enhanced Memory Network: The novel network structure for Symbolic Music Generation

1 code implementation7 Oct 2021 Jin Li, Haibin Liu, Nan Yan, Lan Wang

Symbolic melodies generation is one of the essential tasks for automatic music generation.

Music Generation

Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel

1 code implementation19 Aug 2021 Jin Li, Nan Yan, Lan Wang

However, cross-lingual SER remains a challenge in real-world applications due to a great difference between the source and target domain distributions.

Speech Emotion Recognition

A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition

no code implementations18 Aug 2021 Jin Li, Rongfeng Su, Xurong Xie, Nan Yan, Lan Wang

The shallow stream is used to acquire traditional shallow features that is beneficial for the classification of phones or words while the deep stream is used to obtain utterance-level speaker-invariant deep features for improving the feature diversity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

FDN: Finite Difference Network with Hierarchical Convolutional Features for Text-independent Speaker Verification

1 code implementation18 Aug 2021 Jin Li, Nan Yan, Lan Wang

For example, RawNet and RawNet2 extracted speaker's feature embeddings from waveforms automatically for recognizing their voice, which can vastly reduce the front-end computation and obtain state-of-the-art performance.

Text-Independent Speaker Verification

An Intelligent Control Strategy for buck DC-DC Converter via Deep Reinforcement Learning

no code implementations11 Aug 2020 Chenggang Cui, Nan Yan, Chuanlin Zhang

To mitigate the bus voltage stability issue in DC microgrid, an innovative intelligent control strategy for buck DC-DC converter with constant power loads (CPLs) via deep reinforcement learning algorithm is constructed for the first time.

reinforcement-learning Reinforcement Learning (RL) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.