Search Results for author: Hyung-Min Park

Found 7 papers, 5 papers with code

NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for Speaker Verification

1 code implementation • 14 Dec 2023 • Hyun-Jun Heo, Ui-Hyeop Shin, Ran Lee, YoungJu Cheon, Hyung-Min Park

Meanwhile, in vision tasks, ConvNet structures have been modernized by referring to Transformer, resulting in improved performance.

Speaker Verification

Paper
Code

Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition

no code implementations • 13 Jun 2023 • Ui-Hyeop Shin, Hyung-Min Park

In this paper, we present a statistical beamforming algorithm as a pre-processing step for robust automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Unsupervised Speech Representation Pooling Using Vector Quantization

1 code implementation • 8 Apr 2023 • Jeongkyun Park, Kwanghee Choi, Hyunjun Heo, Hyung-Min Park

However, the pooling problem remains; the length of speech representations is inherently variable.

Emotion Recognition intent-classification +4

Paper
Code

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

1 code implementation • 16 Jan 2023 • Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min Park

Inspired by humans comprehending speech in a multi-modal manner, various audio-visual datasets have been constructed.

Audio-Visual Speech Recognition Lip Reading +3

Paper
Code

Distilling a Pretrained Language Model to a Multilingual ASR Model

1 code implementation • 25 Jun 2022 • Kwanghee Choi, Hyung-Min Park

Hence, we are motivated to distill the rich knowledge embedded inside a well-trained teacher text model to the student speech model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition

1 code implementation • 12 Apr 2019 • Jong-Hyeon Park, Myungwoo Oh, Hyung-Min Park

The latent variables allow us to convert the domain of speech according to its context and domain representation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Code

BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components

no code implementations • 25 Aug 2015 • Changsoo Je, Hyung-Min Park

We propose a novel reflection color model consisting of body essence and (mixed) neuter, and present an effective method for separating dichromatic reflection components using a single image.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.