Search Results for author: György Fazekas

Found 23 papers, 15 papers with code

Differentiable All-pole Filters for Time-varying Audio Systems

7 code implementations • 11 Apr 2024 • Chin-Yun Yu, Christopher Mitcheltree, Alistair Carson, Stefan Bilbao, Joshua D. Reiss, György Fazekas

Infinite impulse response filters are an essential building block of many time-varying audio systems, such as audio effects and synthesisers.

Audio Effects Modeling Audio Synthesis

179

Paper
Code

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation

1 code implementation • 16 Nov 2023 • Ilaria Manco, Benno Weck, Seungheon Doh, Minz Won, Yixiao Zhang, Dmitry Bogdanov, Yusong Wu, Ke Chen, Philip Tovstogan, Emmanouil Benetos, Elio Quinton, György Fazekas, Juhan Nam

We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models.

Music Captioning Music Generation +2

116

Paper
Code

Zero-Shot Duet Singing Voices Separation with Diffusion Models

1 code implementation • 13 Nov 2023 • Chin-Yun Yu, Emilian Postolache, Emanuele Rodolà, György Fazekas

In this paper, we examine this problem in the context of duet singing voices separation, and propose a method to enforce the coherency of singer identity by splitting the mixture into overlapping segments and performing posterior sampling in an auto-regressive manner, conditioning on the previous segment.

Paper
Code

Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions

no code implementations • 21 Oct 2023 • Jincheng Zhang, György Fazekas, Charalampos Saitis

Diffusion models have shown promising results for a wide range of generative tasks with continuous data, such as image and audio synthesis.

Audio Synthesis Generative Adversarial Network +1

Paper
Add Code

Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models

no code implementations • 21 Oct 2023 • Jincheng Zhang, Jingjing Tang, Charalampos Saitis, György Fazekas

The diffusion model is trained to generate intermediate music sequences consisting of codebook indexes, which are then decoded to symbolic music using the VQ-VAE's decoder.

Denoising Music Generation

Paper
Add Code

The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers

no code implementations • 6 Sep 2023 • Soumya Sai Vanka, Maryam Safi, Jean-Baptiste Rolland, György Fazekas

Effective music mixing requires technical and creative finesse, but clear communication with the client is crucial.

Decision Making

Paper
Add Code

Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables

2 code implementations • 29 Jun 2023 • Chin-Yun Yu, György Fazekas

This paper introduces GlOttal-flow LPC Filter (GOLF), a novel method for singing voice synthesis (SVS) that exploits the physical characteristics of the human voice using differentiable digital signal processing.

Singing Voice Synthesis

Paper
Code

The Responsibility Problem in Neural Networks with Unordered Targets

no code implementations • 19 Apr 2023 • Ben Hayes, Charalampos Saitis, György Fazekas

We discuss the discontinuities that arise when mapping unordered objects to neural network outputs of fixed permutation, referred to as the responsibility problem.

Paper
Add Code

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis

1 code implementation • 24 Jan 2023 • Cyrus Vahidi, Han Han, Changhong Wang, Mathieu Lagrange, György Fazekas, Vincent Lostanlen

Computer musicians refer to mesostructures as the intermediate levels of articulation between the microstructure of waveshapes and the macrostructure of musical forms.

Paper
Code

Rigid-Body Sound Synthesis with Differentiable Modal Resonators

1 code implementation • 27 Oct 2022 • Rodrigo Diaz, Ben Hayes, Charalampos Saitis, György Fazekas, Mark Sandler

Physical models of rigid bodies are used for sound synthesis in applications from virtual environments to music production.

Paper
Code

Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution

1 code implementation • 27 Oct 2022 • Chin-Yun Yu, Sung-Lin Yeh, György Fazekas, Hao Tang

Moreover, by coupling the proposed sampling method with an unconditional DM, i. e., a DM with no auxiliary inputs to its noise predictor, we can generalize it to a wide range of SR setups.

Super-Resolution

Paper
Code

Sinusoidal Frequency Estimation by Gradient Descent

1 code implementation • 26 Oct 2022 • Ben Hayes, Charalampos Saitis, György Fazekas

Sinusoidal parameter estimation is a fundamental task in applications from spectral analysis to time-series forecasting.

Time Series Time Series Forecasting

Paper
Code

Contrastive Audio-Language Learning for Music

1 code implementation • 25 Aug 2022 • Ilaria Manco, Emmanouil Benetos, Elio Quinton, György Fazekas

In this work, we explore cross-modal learning in an attempt to bridge audio and language in the music domain.

Audio to Text Retrieval Descriptive +5

Paper
Code

DoReMi: First glance at a universal OMR dataset

1 code implementation • 16 Jul 2021 • Elona Shatri, György Fazekas

While we do not assume to have solved the main challenges in OMR, this dataset opens a new course of discussions that would ultimately aid that goal.

object-detection Object Detection

288

Paper
Code

Neural Waveshaping Synthesis

1 code implementation • 11 Jul 2021 • Ben Hayes, Charalampos Saitis, György Fazekas

We present the Neural Waveshaping Unit (NEWT): a novel, lightweight, fully causal approach to neural audio synthesis which operates directly in the waveform domain, with an accompanying optimisation (FastNEWT) for efficient CPU inference.

Audio Generation Audio Synthesis

180

Paper
Code

A Modulation Front-End for Music Audio Tagging

1 code implementation • 25 May 2021 • Cyrus Vahidi, Charalampos Saitis, György Fazekas

Modulation filter bank representations that have been actively researched as a basis for timbre perception have the potential to facilitate the extraction of perceptually salient features.

Audio Tagging Music Tagging +1

Paper
Code

A novel dataset for the identification of computer generated melodies in the CSMT challenge

no code implementations • 7 Dec 2020 • Shengchen Li, Yinji Jing, György Fazekas

The aim of the dataset is to examine whether it is possible to distinguish computer generated melodies by learning the feature of generated melodies.

Paper
Add Code

Optical Music Recognition: State of the Art and Major Challenges

no code implementations • 14 Jun 2020 • Elona Shatri, György Fazekas

In this paper, we review relevant works in OMR, including fundamental methods and significant outcomes, and highlight different stages of the OMR pipeline.

Paper
Add Code

A Critical Look at the Applicability of Markov Logic Networks for Music Signal Analysis

no code implementations • 16 Jan 2020 • Johan Pauwels, György Fazekas, Mark B. Sandler

In this paper, we analyse some proposed examples of MLNs for musical analysis and consider their practical disadvantages when compared to formulating the same musical dependence relationships as (dynamic) Bayesian networks.

Paper
Add Code

A Feature Learning Siamese Model for Intelligent Control of the Dynamic Range Compressor

no code implementations • 1 May 2019 • Di Sheng, György Fazekas

In this paper, a siamese DNN model is proposed to learn the characteristics of the audio dynamic range compressor (DRC).

Paper
Add Code

A Tutorial on Deep Learning for Music Information Retrieval

2 code implementations • 13 Sep 2017 • Keunwoo Choi, György Fazekas, Kyunghyun Cho, Mark Sandler

Following their success in Computer Vision and other areas, deep learning techniques have recently become widely adopted in Music Information Retrieval (MIR) research.

Information Retrieval Music Information Retrieval +2