no code implementations • 10 Jun 2022 • Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen
The results show that the classification performance is highly sensitive to the semantic relation between test and training classes and textual and image embeddings can reach up to the semantic acoustic embeddings when the seen and unseen classes are semantically similar.