Search Results for author: Zerui Li

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT

In this paper, we propose LauraGPT, a unified GPT model for audio recognition, understanding, and generation.

280

Paper
Code

It possesses the advantages of AED-based model's accuracy, NAR model's efficiency, and explicit customization capacity of superior performance.

3,383

Paper
Code

FunASR offers models trained on large-scale industrial corpora and the ability to deploy them in applications.

Ranked #1 on Speech Recognition on WenetSpeech (using extra training data)

3,383

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.