1 code implementation • 14 Dec 2023 • Hyun-Jun Heo, Ui-Hyeop Shin, Ran Lee, YoungJu Cheon, Hyung-Min Park
Meanwhile, in vision tasks, ConvNet structures have been modernized by referring to Transformer, resulting in improved performance.
no code implementations • 13 Jun 2023 • Ui-Hyeop Shin, Hyung-Min Park
In this paper, we present a statistical beamforming algorithm as a pre-processing step for robust automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 8 Apr 2023 • Jeongkyun Park, Kwanghee Choi, Hyunjun Heo, Hyung-Min Park
However, the pooling problem remains; the length of speech representations is inherently variable.
1 code implementation • 16 Jan 2023 • Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min Park
Inspired by humans comprehending speech in a multi-modal manner, various audio-visual datasets have been constructed.
1 code implementation • 25 Jun 2022 • Kwanghee Choi, Hyung-Min Park
Hence, we are motivated to distill the rich knowledge embedded inside a well-trained teacher text model to the student speech model.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 12 Apr 2019 • Jong-Hyeon Park, Myungwoo Oh, Hyung-Min Park
The latent variables allow us to convert the domain of speech according to its context and domain representation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +6
no code implementations • 25 Aug 2015 • Changsoo Je, Hyung-Min Park
We propose a novel reflection color model consisting of body essence and (mixed) neuter, and present an effective method for separating dichromatic reflection components using a single image.