no code implementations • 4 Jun 2022 • Andrew Koh, Soham Tiwari, Chng Eng Siong
In this paper, we propose an algorithm, Epochal Difficult Captions, to supplement the training of any model for the Automated Audio Captioning task.
1 code implementation • 22 Mar 2022 • Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Chng Eng Siong
In this work, we propose a bi-encoder transformer mixture model for speaker age and height estimation.
1 code implementation • 24 Oct 2021 • Shangeth Rajaa, Pham Van Tung, Chng Eng Siong
Speaker profiling, which aims to estimate speaker characteristics such as age and height, has a wide range of applications inforensics, recommendation systems, etc.