no code implementations • 26 Jan 2024 • Ragib Amin Nihal, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai
The demand for accurate object detection in aerial imagery has surged with the widespread use of drones and satellite technology.
no code implementations • 21 Sep 2023 • Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai
Via the experiments with the CHiME-3 dataset, we verify that the four BFs have the same peak performance as the upper bound provided by the ideal MWF BF, whereas the optimal mask depends on the adopted BF and differs from the IRM.
no code implementations • 15 Nov 2021 • Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai
We describe a novel metric-based learning approach that introduces a multimodal framework and uses deep audio and geophone encoders in siamese configuration to design an adaptable and lightweight supervised model.
no code implementations • 1 Apr 2021 • Shakeel Muhammad, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai
In the present study, we present an intelligent earthquake signal detector that provides added assistance to automate traditional disaster responses.
no code implementations • 22 Mar 2019 • Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara
To solve this problem, we take an unsupervised approach that decomposes each TF bin into the sum of speech and noise by using multichannel nonnegative matrix factorization (MNMF).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 31 Oct 2017 • Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara
This paper presents a statistical method of single-channel speech enhancement that uses a variational autoencoder (VAE) as a prior distribution on clean speech.
no code implementations • 7 Aug 2017 • Hiroaki Tsushima, Eita Nakamura, Katsutoshi Itoyama, Kazuyoshi Yoshii
Generative statistical models of chord sequences play crucial roles in music processing.
no code implementations • LREC 2016 • Koichiro Yoshino, Naoki Hirayama, Shinsuke Mori, Fumihiko Takahashi, Katsutoshi Itoyama, Hiroshi G. Okuno
Binary file summaries/549. html matches