Search Results for author: Zeyu Xie

Found 4 papers, 2 papers with code

Phonetic and Lexical Discovery of a Canine Language using HuBERT

no code implementations • 25 Feb 2024 • Xingyuan Li, Sinong Wang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu

This paper delves into the pioneering exploration of potential communication patterns within dog vocalizations and transcends traditional linguistic analysis barriers, which heavily relies on human priori knowledge on limited datasets to find sound units in dog vocalization.

Paper
Add Code

Improving Audio Caption Fluency with Automatic Error Correction

no code implementations • 16 Jun 2023 • Hanxue Zhang, Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Automated audio captioning (AAC) is an important cross-modality translation task, aiming at generating descriptions for audio clips.

Audio captioning Sentence

Paper
Add Code

Can Audio Captions Be Evaluated with Image Caption Metrics?

1 code implementation • 10 Oct 2021 • Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu

Current metrics are found in poor correlation with human annotations on these datasets.

AudioCaps Audio captioning +2

Paper
Code

THE SJTU SYSTEM FOR DCASE2021 CHALLENGE TASK 6: AUDIO CAPTIONING BASED ON ENCODER PRE-TRAINING AND REINFORCEMENT LEARNING

1 code implementation • DCASE Challenge 2021 • Xuenan Xu, Zeyu Xie, Mengyue Wu, Kai Yu

This report proposes an audio captioning system for the Detection and Classification of Acoustic Scenes and Events (DCASE) 2021 challenge task Task 6.

Ranked #2 on Audio captioning on Clotho (using extra training data)

Audio captioning Audio Tagging +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.