Search Results for author: Bryce Irvin

Found 3 papers, 2 papers with code

CATSE: A Context-Aware Framework for Causal Target Sound Extraction

no code implementations • 21 Mar 2024 • Shrishail Baligar, Mikolaj Kegler, Bryce Irvin, Marko Stamenovic, Shawn Newsam

First, we explore the utility of context by providing the TSE model with oracle information about what sound classes make up the input mixture, where the objective of the model is to extract one or more sources of interest indicated by the user.

Target Sound Extraction

Paper
Add Code

Latent CLAP Loss for Better Foley Sound Synthesis

1 code implementation • 18 Mar 2024 • Tornike Karchkhadze, Hassan Salami Kavaki, Mohammad Rasool Izadi, Bryce Irvin, Mikolaj Kegler, Ari Hertz, Shuo Zhang, Marko Stamenovic

We introduce a new loss term to enhance Foley sound generation in AudioLDM without post-filtering.

FAD

Paper
Code

Self-Supervised Learning for Speech Enhancement through Synthesis

1 code implementation • 4 Nov 2022 • Bryce Irvin, Marko Stamenovic, Mikolaj Kegler, Li-Chia Yang

Modern speech enhancement (SE) networks typically implement noise suppression through time-frequency masking, latent representation masking, or discriminative signal prediction.

Denoising Self-Supervised Learning +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.