Search Results for author: György Fazekas

Found 23 papers, 15 papers with code

Differentiable All-pole Filters for Time-varying Audio Systems

7 code implementations11 Apr 2024 Chin-Yun Yu, Christopher Mitcheltree, Alistair Carson, Stefan Bilbao, Joshua D. Reiss, György Fazekas

Infinite impulse response filters are an essential building block of many time-varying audio systems, such as audio effects and synthesisers.

Audio Effects Modeling Audio Synthesis

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation

1 code implementation16 Nov 2023 Ilaria Manco, Benno Weck, Seungheon Doh, Minz Won, Yixiao Zhang, Dmitry Bogdanov, Yusong Wu, Ke Chen, Philip Tovstogan, Emmanouil Benetos, Elio Quinton, György Fazekas, Juhan Nam

We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models.

Music Captioning Music Generation +2

Zero-Shot Duet Singing Voices Separation with Diffusion Models

1 code implementation13 Nov 2023 Chin-Yun Yu, Emilian Postolache, Emanuele Rodolà, György Fazekas

In this paper, we examine this problem in the context of duet singing voices separation, and propose a method to enforce the coherency of singer identity by splitting the mixture into overlapping segments and performing posterior sampling in an auto-regressive manner, conditioning on the previous segment.

Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions

no code implementations21 Oct 2023 Jincheng Zhang, György Fazekas, Charalampos Saitis

Diffusion models have shown promising results for a wide range of generative tasks with continuous data, such as image and audio synthesis.

Audio Synthesis Generative Adversarial Network +1

Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models

no code implementations21 Oct 2023 Jincheng Zhang, Jingjing Tang, Charalampos Saitis, György Fazekas

The diffusion model is trained to generate intermediate music sequences consisting of codebook indexes, which are then decoded to symbolic music using the VQ-VAE's decoder.

Denoising Music Generation

Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables

2 code implementations29 Jun 2023 Chin-Yun Yu, György Fazekas

This paper introduces GlOttal-flow LPC Filter (GOLF), a novel method for singing voice synthesis (SVS) that exploits the physical characteristics of the human voice using differentiable digital signal processing.

Singing Voice Synthesis

The Responsibility Problem in Neural Networks with Unordered Targets

no code implementations19 Apr 2023 Ben Hayes, Charalampos Saitis, György Fazekas

We discuss the discontinuities that arise when mapping unordered objects to neural network outputs of fixed permutation, referred to as the responsibility problem.

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis

1 code implementation24 Jan 2023 Cyrus Vahidi, Han Han, Changhong Wang, Mathieu Lagrange, György Fazekas, Vincent Lostanlen

Computer musicians refer to mesostructures as the intermediate levels of articulation between the microstructure of waveshapes and the macrostructure of musical forms.

Rigid-Body Sound Synthesis with Differentiable Modal Resonators

1 code implementation27 Oct 2022 Rodrigo Diaz, Ben Hayes, Charalampos Saitis, György Fazekas, Mark Sandler

Physical models of rigid bodies are used for sound synthesis in applications from virtual environments to music production.

Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution

1 code implementation27 Oct 2022 Chin-Yun Yu, Sung-Lin Yeh, György Fazekas, Hao Tang

Moreover, by coupling the proposed sampling method with an unconditional DM, i. e., a DM with no auxiliary inputs to its noise predictor, we can generalize it to a wide range of SR setups.

Super-Resolution

Sinusoidal Frequency Estimation by Gradient Descent

1 code implementation26 Oct 2022 Ben Hayes, Charalampos Saitis, György Fazekas

Sinusoidal parameter estimation is a fundamental task in applications from spectral analysis to time-series forecasting.

Time Series Time Series Forecasting

Contrastive Audio-Language Learning for Music

1 code implementation25 Aug 2022 Ilaria Manco, Emmanouil Benetos, Elio Quinton, György Fazekas

In this work, we explore cross-modal learning in an attempt to bridge audio and language in the music domain.

Audio to Text Retrieval Descriptive +5

DoReMi: First glance at a universal OMR dataset

1 code implementation16 Jul 2021 Elona Shatri, György Fazekas

While we do not assume to have solved the main challenges in OMR, this dataset opens a new course of discussions that would ultimately aid that goal.

object-detection Object Detection

Neural Waveshaping Synthesis

1 code implementation11 Jul 2021 Ben Hayes, Charalampos Saitis, György Fazekas

We present the Neural Waveshaping Unit (NEWT): a novel, lightweight, fully causal approach to neural audio synthesis which operates directly in the waveform domain, with an accompanying optimisation (FastNEWT) for efficient CPU inference.

Audio Generation Audio Synthesis

A Modulation Front-End for Music Audio Tagging

1 code implementation25 May 2021 Cyrus Vahidi, Charalampos Saitis, György Fazekas

Modulation filter bank representations that have been actively researched as a basis for timbre perception have the potential to facilitate the extraction of perceptually salient features.

Audio Tagging Music Tagging +1

A novel dataset for the identification of computer generated melodies in the CSMT challenge

no code implementations7 Dec 2020 Shengchen Li, Yinji Jing, György Fazekas

The aim of the dataset is to examine whether it is possible to distinguish computer generated melodies by learning the feature of generated melodies.

Optical Music Recognition: State of the Art and Major Challenges

no code implementations14 Jun 2020 Elona Shatri, György Fazekas

In this paper, we review relevant works in OMR, including fundamental methods and significant outcomes, and highlight different stages of the OMR pipeline.

A Critical Look at the Applicability of Markov Logic Networks for Music Signal Analysis

no code implementations16 Jan 2020 Johan Pauwels, György Fazekas, Mark B. Sandler

In this paper, we analyse some proposed examples of MLNs for musical analysis and consider their practical disadvantages when compared to formulating the same musical dependence relationships as (dynamic) Bayesian networks.

A Feature Learning Siamese Model for Intelligent Control of the Dynamic Range Compressor

no code implementations1 May 2019 Di Sheng, György Fazekas

In this paper, a siamese DNN model is proposed to learn the characteristics of the audio dynamic range compressor (DRC).

A Tutorial on Deep Learning for Music Information Retrieval

2 code implementations13 Sep 2017 Keunwoo Choi, György Fazekas, Kyunghyun Cho, Mark Sandler

Following their success in Computer Vision and other areas, deep learning techniques have recently become widely adopted in Music Information Retrieval (MIR) research.

Information Retrieval Music Information Retrieval +2

A Comparison of Audio Signal Preprocessing Methods for Deep Neural Networks on Music Tagging

1 code implementation6 Sep 2017 Keunwoo Choi, György Fazekas, Kyunghyun Cho, Mark Sandler

In this paper, we empirically investigate the effect of audio preprocessing on music tagging with deep neural networks.

Music Tagging

Transfer learning for music classification and regression tasks

3 code implementations27 Mar 2017 Keunwoo Choi, György Fazekas, Mark Sandler, Kyunghyun Cho

In this paper, we present a transfer learning approach for music classification and regression tasks.

Classification General Classification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.