Search Results for author: Soham Tiwari

Found 2 papers, 1 papers with code

Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning

no code implementations • 4 Jun 2022 • Andrew Koh, Soham Tiwari, Chng Eng Siong

In this paper, we propose an algorithm, Epochal Difficult Captions, to supplement the training of any model for the Automated Audio Captioning task.

Audio captioning

Paper
Add Code

Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset

1 code implementation • 1 Nov 2021 • Soham Tiwari, Kshitiz Lakhotia, Manjunath Mulimani

Inspired by the You Only Look Once (YOLO) algorithm in computer vision, the YOHO algorithm can match the performance of the various state-of-the-art algorithms on datasets such as Music Speech Detection Dataset, TUT Sound Event, and Urban-SED datasets but at lower inference times.

Event Detection Retrieval +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.