Search Results for author: Yasuhiro Oikawa

Found 7 papers, 2 papers with code

Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head

1 code implementation • 17 Feb 2022 • Kento Nagatomo, Masahiro Yasuda, Kohei Yatabe, Shoichiro Saito, Yasuhiro Oikawa

Sound event localization and detection (SELD) is a combined task of identifying the sound event and its direction.

Paper
Code

APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization

no code implementations • 16 Feb 2022 • Tomoro Tanaka, Kohei Yatabe, Masahiro Yasuda, Yasuhiro Oikawa

Still, they cannot perform well if the training data have mismatches and/or constraints in the time domain are not imposed.

Audio declipping

Paper
Add Code

Sparse time-frequency representation via atomic norm minimization

no code implementations • 7 May 2021 • Tsubasa Kusano, Kohei Yatabe, Yasuhiro Oikawa

In this paper, we propose a method of estimating a sparse T-F representation using atomic norm.

Paper
Add Code

Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling

no code implementations • 28 Jul 2020 • Yoshiki Masuyama, Yoshiaki Bando, Kohei Yatabe, Yoko Sasaki, Masaki Onishi, Yasuhiro Oikawa

By incorporating with the spatial information in multichannel audio signals, our method trains deep neural networks (DNNs) to distinguish multiple sound source objects.

Self-Supervised Learning

Paper
Add Code

Phase reconstruction based on recurrent phase unwrapping with deep neural networks

no code implementations • 14 Feb 2020 • Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada

In the proposed method, DNNs estimate phase derivatives instead of phase itself, which allows us to avoid the sensitivity problem.

Audio Synthesis

Paper
Add Code

Invertible DNN-based nonlinear time-frequency transform for speech enhancement

1 code implementation • 25 Nov 2019 • Daiki Takeuchi, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada

Therefore, some end-to-end methods used a DNN to learn the linear T-F transform which is much easier to understand.

Audio and Speech Processing Sound

Paper
Code

Deep Griffin-Lim Iteration

no code implementations • 10 Mar 2019 • Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada

This paper presents a novel phase reconstruction method (only from a given amplitude spectrogram) by combining a signal-processing-based approach and a deep neural network (DNN).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.