Search Results for author: Luoyi Sun

Found 1 papers, 0 papers with code

A Large-scale Dataset for Audio-Language Representation Learning

no code implementations20 Sep 2023 Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie

To tackle these challenges, we present an innovative and automatic audio caption generation pipeline based on a series of public tools or APIs, and construct a large-scale, high-quality, audio-language dataset, named as Auto-ACD, comprising over 1. 9M audio-text pairs.

Audio captioning Caption Generation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.