7 code implementations • 11 Apr 2024 • Chin-Yun Yu, Christopher Mitcheltree, Alistair Carson, Stefan Bilbao, Joshua D. Reiss, György Fazekas
Infinite impulse response filters are an essential building block of many time-varying audio systems, such as audio effects and synthesisers.
1 code implementation • 16 Nov 2023 • Ilaria Manco, Benno Weck, Seungheon Doh, Minz Won, Yixiao Zhang, Dmitry Bogdanov, Yusong Wu, Ke Chen, Philip Tovstogan, Emmanouil Benetos, Elio Quinton, György Fazekas, Juhan Nam
We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models.
1 code implementation • 13 Nov 2023 • Chin-Yun Yu, Emilian Postolache, Emanuele Rodolà, György Fazekas
In this paper, we examine this problem in the context of duet singing voices separation, and propose a method to enforce the coherency of singer identity by splitting the mixture into overlapping segments and performing posterior sampling in an auto-regressive manner, conditioning on the previous segment.
no code implementations • 21 Oct 2023 • Jincheng Zhang, György Fazekas, Charalampos Saitis
Diffusion models have shown promising results for a wide range of generative tasks with continuous data, such as image and audio synthesis.
no code implementations • 21 Oct 2023 • Jincheng Zhang, Jingjing Tang, Charalampos Saitis, György Fazekas
The diffusion model is trained to generate intermediate music sequences consisting of codebook indexes, which are then decoded to symbolic music using the VQ-VAE's decoder.
no code implementations • 6 Sep 2023 • Soumya Sai Vanka, Maryam Safi, Jean-Baptiste Rolland, György Fazekas
Effective music mixing requires technical and creative finesse, but clear communication with the client is crucial.
2 code implementations • 29 Jun 2023 • Chin-Yun Yu, György Fazekas
This paper introduces GlOttal-flow LPC Filter (GOLF), a novel method for singing voice synthesis (SVS) that exploits the physical characteristics of the human voice using differentiable digital signal processing.
no code implementations • 19 Apr 2023 • Ben Hayes, Charalampos Saitis, György Fazekas
We discuss the discontinuities that arise when mapping unordered objects to neural network outputs of fixed permutation, referred to as the responsibility problem.
1 code implementation • 24 Jan 2023 • Cyrus Vahidi, Han Han, Changhong Wang, Mathieu Lagrange, György Fazekas, Vincent Lostanlen
Computer musicians refer to mesostructures as the intermediate levels of articulation between the microstructure of waveshapes and the macrostructure of musical forms.
1 code implementation • 27 Oct 2022 • Rodrigo Diaz, Ben Hayes, Charalampos Saitis, György Fazekas, Mark Sandler
Physical models of rigid bodies are used for sound synthesis in applications from virtual environments to music production.
1 code implementation • 27 Oct 2022 • Chin-Yun Yu, Sung-Lin Yeh, György Fazekas, Hao Tang
Moreover, by coupling the proposed sampling method with an unconditional DM, i. e., a DM with no auxiliary inputs to its noise predictor, we can generalize it to a wide range of SR setups.
1 code implementation • 26 Oct 2022 • Ben Hayes, Charalampos Saitis, György Fazekas
Sinusoidal parameter estimation is a fundamental task in applications from spectral analysis to time-series forecasting.
1 code implementation • 25 Aug 2022 • Ilaria Manco, Emmanouil Benetos, Elio Quinton, György Fazekas
In this work, we explore cross-modal learning in an attempt to bridge audio and language in the music domain.
1 code implementation • 16 Jul 2021 • Elona Shatri, György Fazekas
While we do not assume to have solved the main challenges in OMR, this dataset opens a new course of discussions that would ultimately aid that goal.
1 code implementation • 11 Jul 2021 • Ben Hayes, Charalampos Saitis, György Fazekas
We present the Neural Waveshaping Unit (NEWT): a novel, lightweight, fully causal approach to neural audio synthesis which operates directly in the waveform domain, with an accompanying optimisation (FastNEWT) for efficient CPU inference.
1 code implementation • 25 May 2021 • Cyrus Vahidi, Charalampos Saitis, György Fazekas
Modulation filter bank representations that have been actively researched as a basis for timbre perception have the potential to facilitate the extraction of perceptually salient features.
no code implementations • 7 Dec 2020 • Shengchen Li, Yinji Jing, György Fazekas
The aim of the dataset is to examine whether it is possible to distinguish computer generated melodies by learning the feature of generated melodies.
no code implementations • 14 Jun 2020 • Elona Shatri, György Fazekas
In this paper, we review relevant works in OMR, including fundamental methods and significant outcomes, and highlight different stages of the OMR pipeline.
no code implementations • 16 Jan 2020 • Johan Pauwels, György Fazekas, Mark B. Sandler
In this paper, we analyse some proposed examples of MLNs for musical analysis and consider their practical disadvantages when compared to formulating the same musical dependence relationships as (dynamic) Bayesian networks.
no code implementations • 1 May 2019 • Di Sheng, György Fazekas
In this paper, a siamese DNN model is proposed to learn the characteristics of the audio dynamic range compressor (DRC).
2 code implementations • 13 Sep 2017 • Keunwoo Choi, György Fazekas, Kyunghyun Cho, Mark Sandler
Following their success in Computer Vision and other areas, deep learning techniques have recently become widely adopted in Music Information Retrieval (MIR) research.
1 code implementation • 6 Sep 2017 • Keunwoo Choi, György Fazekas, Kyunghyun Cho, Mark Sandler
In this paper, we empirically investigate the effect of audio preprocessing on music tagging with deep neural networks.
3 code implementations • 27 Mar 2017 • Keunwoo Choi, György Fazekas, Mark Sandler, Kyunghyun Cho
In this paper, we present a transfer learning approach for music classification and regression tasks.