1 code implementation • 21 Sep 2023 • Theodoros Kouzelis, Vassilis Katsouros
Our approach leverages the similarity between audio and text embeddings in CLAP.
1 code implementation • 20 Sep 2023 • Manos Plitsis, Theodoros Kouzelis, Georgios Paraskevopoulos, Vassilis Katsouros, Yannis Panagakis
In this work, we investigate the personalization of text-to-music diffusion models in a few-shot setting.
1 code implementation • 30 May 2023 • Theodoros Kouzelis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros
The study of speech disorders can benefit greatly from time-aligned data.
no code implementations • 31 Dec 2022 • Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos
Modern speech recognition systems exhibits rapid performance degradation under domain shift.