no code implementations • 13 May 2021 • Ayşegül Özkaya Eren, Mustafa Sert
We also present exhaustive experiments to show the efficiency of different features and datasets for our proposed model the audio captioning task.
no code implementations • 5 Jun 2020 • Ayşegül Özkaya Eren, Mustafa Sert
In this study, a novel deep network architecture with audio embeddings is presented to predict audio captions.
Ranked #7 on Audio captioning on Clotho (CIDEr metric)
no code implementations • 5 Aug 2016 • Hilal Ergun, Mustafa Sert
Using these best practices we propose efficient fusion mechanisms both for single and multiple network models.