Search Results for author: Gaël Richard

Found 27 papers, 15 papers with code

Structure-informed Positional Encoding for Music Generation

no code implementations20 Feb 2024 Manvi Agarwal, Changhong Wang, Gaël Richard

Music generated by deep learning methods often suffers from a lack of coherence and long-term organization.

Music Generation

Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport

1 code implementation22 Dec 2023 Bernardo Torres, Geoffroy Peeters, Gaël Richard

In neural audio signal processing, pitch conditioning has been used to enhance the performance of synthesizers.

Audio Signal Processing

Transfer Learning and Bias Correction with Pre-trained Audio Embeddings

1 code implementation20 Jul 2023 Changhong Wang, Gaël Richard, Brian McFee

This approach allows representations derived for one task to be applied to another, and can result in high accuracy with less stringent training data requirements for the downstream task.

Information Retrieval Instrument Recognition +3

Video-to-Music Recommendation using Temporal Alignment of Segments

no code implementations12 Jun 2023 Laure Prétet, Gaël Richard, Clément Souchier, Geoffroy Peeters

We propose a novel approach to significantly improve the system's performance using structure-aware recommendation.

Music Recommendation

Tackling Interpretability in Audio Classification Networks with Non-negative Matrix Factorization

no code implementations11 May 2023 Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Gaël Richard, Florence d'Alché-Buc

This paper tackles two major problem settings for interpretability of audio processing networks, post-hoc and by-design interpretation.

Audio Classification

Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts

1 code implementation14 Nov 2022 Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard

Namely, we propose a system which can generate a situational playlist for a user at a certain time 1) by leveraging user-aware music autotaggers, and 2) by automatically inferring the user's situation from stream data (e. g. device, network) and user's general profile information (e. g. age).

Retrieval TAG

Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms

no code implementations4 Mar 2022 Milad Sefidgaran, Amin Gohari, Gaël Richard, Umut Şimşekli

Understanding generalization in modern machine learning settings has been one of the major challenges in statistical learning theory.

Generalization Bounds Learning Theory

Unsupervised Music Source Separation Using Differentiable Parametric Source Models

2 code implementations24 Jan 2022 Kilian Schulze-Forster, Gaël Richard, Liam Kelley, Clement S. J. Doire, Roland Badeau

Integrating domain knowledge in the form of source models into a data-driven method leads to high data efficiency: the proposed approach achieves good separation quality even when trained on less than three minutes of audio.

Audio Source Separation Music Source Separation +1

Probabilistic semi-nonnegative matrix factorization: a Skellam-based framework

1 code implementation7 Jul 2021 Benoit Fuentes, Gaël Richard

We present a new probabilistic model to address semi-nonnegative matrix factorization (SNMF), called Skellam-SNMF.

Bayesian Inference

Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks

1 code implementation NeurIPS 2021 Melih Barsbey, Milad Sefidgaran, Murat A. Erdogdu, Gaël Richard, Umut Şimşekli

Neural network compression techniques have become increasingly popular as they can drastically reduce the storage and computation requirements for very large networks.

Generalization Bounds Neural Network Compression

VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding

1 code implementation4 May 2021 Javier Nistal, Cyran Aouameur, Stefan Lattner, Gaël Richard

Influenced by the field of Computer Vision, Generative Adversarial Networks (GANs) are often adopted for the audio domain using fixed-size two-dimensional spectrogram representations as the "image data".

Audio Synthesis

Self-Supervised VQ-VAE for One-Shot Music Style Transfer

1 code implementation10 Feb 2021 Ondřej Cífka, Alexey Ozerov, Umut Şimşekli, Gaël Richard

While several style conversion methods tailored to musical signals have been proposed, most lack the 'one-shot' capability of classical image style transfer algorithms.

Music Style Transfer Self-Supervised Learning +1

Learning to rank music tracks using triplet loss

no code implementations18 May 2020 Laure Prétet, Gaël Richard, Geoffroy Peeters

These algorithms aim at retrieving a ranked list of music tracks based on their similarity with a target music track.

Learning-To-Rank

On the Heavy-Tailed Theory of Stochastic Gradient Descent for Deep Neural Networks

no code implementations29 Nov 2019 Umut Şimşekli, Mert Gürbüzbalaban, Thanh Huy Nguyen, Gaël Richard, Levent Sagun

This assumption is often made for mathematical convenience, since it enables SGD to be analyzed as a stochastic differential equation (SDE) driven by a Brownian motion.

Supervised Symbolic Music Style Translation Using Synthetic Data

1 code implementation4 Jul 2019 Ondřej Cífka, Umut Şimşekli, Gaël Richard

Research on style transfer and domain translation has clearly demonstrated the ability of deep learning-based algorithms to manipulate images in terms of artistic style.

Music Genre Transfer Style Transfer +2

First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise

1 code implementation NeurIPS 2019 Thanh Huy Nguyen, Umut Şimşekli, Mert Gürbüzbalaban, Gaël Richard

We show that the behaviors of the two systems are indeed similar for small step-sizes and we identify how the error depends on the algorithm and problem parameters.

Computational Efficiency

Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization

no code implementations22 Jan 2019 Thanh Huy Nguyen, Umut Şimşekli, Gaël Richard

Recent studies on diffusion-based sampling methods have shown that Langevin Monte Carlo (LMC) algorithms can be beneficial for non-convex optimization, and rigorous theoretical guarantees have been proven for both asymptotic and finite-time regimes.

Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

no code implementations ICML 2018 Umut Şimşekli, Çağatay Yıldız, Thanh Huy Nguyen, Gaël Richard, A. Taylan Cemgil

The results support our theory and show that the proposed algorithm provides a significant speedup over the recently proposed synchronous distributed L-BFGS algorithm.

Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events

no code implementations19 Apr 2018 Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard

Audio-visual representation learning is an important task from the perspective of designing machines with the ability to understand complex events.

Multiple Instance Learning Representation Learning

Stochastic Gradient Richardson-Romberg Markov Chain Monte Carlo

no code implementations NeurIPS 2016 Alain Durmus, Umut Simsekli, Eric Moulines, Roland Badeau, Gaël Richard

We illustrate our framework on the popular Stochastic Gradient Langevin Dynamics (SGLD) algorithm and propose a novel SG-MCMC algorithm referred to as Stochastic Gradient Richardson-Romberg Langevin Dynamics (SGRRLD).

Bayesian Inference

Stochastic Quasi-Newton Langevin Monte Carlo

no code implementations10 Feb 2016 Umut Şimşekli, Roland Badeau, A. Taylan Cemgil, Gaël Richard

These second order methods directly approximate the inverse Hessian by using a limited history of samples and their gradients.

Second-order methods

Cannot find the paper you are looking for? You can Submit a new open access paper.