1 code implementation • 16 Nov 2023 • Ilaria Manco, Benno Weck, Seungheon Doh, Minz Won, Yixiao Zhang, Dmitry Bogdanov, Yusong Wu, Ke Chen, Philip Tovstogan, Emmanouil Benetos, Elio Quinton, György Fazekas, Juhan Nam
We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models.
1 code implementation • 31 Jul 2023 • Seungheon Doh, Keunwoo Choi, Jongpil Lee, Juhan Nam
In addition, we trained a transformer-based music captioning model with the dataset and evaluated it under zero-shot and transfer-learning settings.
no code implementations • 19 Mar 2023 • Seungheon Doh, Minz Won, Keunwoo Choi, Juhan Nam
We introduce a framework that recommends music based on the emotions of speech.
1 code implementation • 14 Jan 2023 • Haven Kim, Seungheon Doh, Junwon Lee, Juhan Nam
Automatically generating or captioning music playlist titles given a set of tracks is of significant interest in music streaming services as customized playlists are widely used in personalized music recommendation, and well-composed text titles attract users and help their music discovery.
3 code implementations • 26 Nov 2022 • Seungheon Doh, Minz Won, Keunwoo Choi, Juhan Nam
This paper introduces effective design choices for text-to-music retrieval systems.
no code implementations • NLP4MusA 2021 • Seungheon Doh, Junwon Lee, Juhan Nam
We propose a machine-translation approach to automatically generate a playlist title from a set of music tracks.
1 code implementation • 9 Feb 2021 • Dasaem Jeong, Seungheon Doh, Taegyun Kwon
The goal of this paper to generate a visually appealing video that responds to music with a neural network so that each frame of the video reflects the musical characteristics of the corresponding audio clip.
Ranked #1 on Music Auto-Tagging on TimeTravel (using extra training data)
no code implementations • 23 Jul 2020 • Seungheon Doh, Jongpil Lee, Tae Hong Park, Juhan Nam
Word embedding pioneered by Mikolov et al. is a staple technique for word representations in natural language processing (NLP) research which has also found popularity in music information retrieval tasks.