Search Results for author: Nan Yan

Found 7 papers, 4 papers with code

An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

no code implementations • 9 Mar 2024 • Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang

In this model, the inherent acoustic characteristics of individuals related to the tongue motion details are encoded by using wav2vec 2. 0, while the ASR transcriptions related to the universality of tongue motions are encoded by using BERT.

Paper
Add Code

AROT-COV23: A Dataset of 500K Original Arabic Tweets on COVID-19

1 code implementation • 4th Workshop on African Natural Language Processing 2023 • Cheng Xu, Nan Yan

This paper presents a dataset called AROT-COV23 (ARabic Original Tweets on COVID-19 as of 2023) containing about 500, 000 original Arabic COVID-19-related tweets from January 2020 to January 2023.

Paper
Code

Enhanced Memory Network: The novel network structure for Symbolic Music Generation

1 code implementation • 7 Oct 2021 • Jin Li, Haibin Liu, Nan Yan, Lan Wang

Symbolic melodies generation is one of the essential tasks for automatic music generation.

Music Generation

Paper
Code

Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel

1 code implementation • 19 Aug 2021 • Jin Li, Nan Yan, Lan Wang

However, cross-lingual SER remains a challenge in real-world applications due to a great difference between the source and target domain distributions.

Speech Emotion Recognition

Paper
Code

A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition

no code implementations • 18 Aug 2021 • Jin Li, Rongfeng Su, Xurong Xie, Nan Yan, Lan Wang

The shallow stream is used to acquire traditional shallow features that is beneficial for the classification of phones or words while the deep stream is used to obtain utterance-level speaker-invariant deep features for improving the feature diversity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

FDN: Finite Difference Network with Hierarchical Convolutional Features for Text-independent Speaker Verification

1 code implementation • 18 Aug 2021 • Jin Li, Nan Yan, Lan Wang

For example, RawNet and RawNet2 extracted speaker's feature embeddings from waveforms automatically for recognizing their voice, which can vastly reduce the front-end computation and obtain state-of-the-art performance.

Text-Independent Speaker Verification

Paper
Code

An Intelligent Control Strategy for buck DC-DC Converter via Deep Reinforcement Learning

no code implementations • 11 Aug 2020 • Chenggang Cui, Nan Yan, Chuanlin Zhang

To mitigate the bus voltage stability issue in DC microgrid, an innovative intelligent control strategy for buck DC-DC converter with constant power loads (CPLs) via deep reinforcement learning algorithm is constructed for the first time.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.