Search Results for author: Sven Ahlbäck

Found 4 papers, 1 papers with code

Polyphonic pitch detection with convolutional recurrent neural networks

no code implementations4 Feb 2022 Carl Thomé, Sven Ahlbäck

Recent directions in automatic speech recognition (ASR) research have shown that applying deep learning models from image recognition challenges in computer vision is beneficial.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes

no code implementations28 Jul 2021 Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven Ahlbäck

We also include ablation studies investigating the effects of the use of multiple kernel shapes and comparing different input representations for the audio and the note-related information.

Cannot find the paper you are looking for? You can Submit a new open access paper.