Search Results for author: Mustafa Sert

Audio Captioning with Composition of Acoustic and Semantic Information

We also present exhaustive experiments to show the efficiency of different features and datasets for our proposed model the audio captioning task.

Paper
Add Code

In this study, a novel deep network architecture with audio embeddings is presented to predict audio captions.

Ranked #7 on Audio captioning on Clotho (CIDEr metric)

Paper
Add Code

Using these best practices we propose efficient fusion mechanisms both for single and multiple network models.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.