Search Results for author: Han-Gyu Kim

Found 6 papers, 3 papers with code

DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset

1 code implementation8 Dec 2022 Young-Jun Lee, Byungsoo Ko, Han-Gyu Kim, Jonghwan Hyeon, Ho-Jin Choi

Through this pipeline, we introduce DialogCC, a high-quality and diverse multi-modal dialogue dataset that surpasses existing datasets in terms of quality and diversity in human evaluation.

Retrieval Text Retrieval

Group Generalized Mean Pooling for Vision Transformer

no code implementations8 Dec 2022 Byungsoo Ko, Han-Gyu Kim, Byeongho Heo, Sangdoo Yun, Sanghyuk Chun, Geonmo Gu, Wonjae Kim

As ViT groups the channels via a multi-head attention mechanism, grouping the channels by GGeM leads to lower head-wise dependence while amplifying important channels on the activation maps.

Image Retrieval Representation Learning +1

Back from the future: bidirectional CTC decoding using future information in speech recognition

no code implementations7 Oct 2021 Namkyu Jung, Geonmin Kim, Han-Gyu Kim

In this paper, we propose a simple but effective method to decode the output of Connectionist Temporal Classifier (CTC) model using a bi-directional neural language model.

Language Modelling speech-recognition +1

Learning with Memory-based Virtual Classes for Deep Metric Learning

1 code implementation ICCV 2021 Byungsoo Ko, Geonmo Gu, Han-Gyu Kim

This can be undesirable for DML, where training and test data exhibit entirely different classes.

Metric Learning

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

2 code implementations29 Mar 2021 Geonmo Gu, Byungsoo Ko, Han-Gyu Kim

One of the main purposes of deep metric learning is to construct an embedding space that has well-generalized embeddings on both seen (training) classes and unseen (test) classes.

Image Retrieval Metric Learning +1

Audio Source Separation Using a Deep Autoencoder

no code implementations22 Dec 2014 Giljin Jang, Han-Gyu Kim, Yung-Hwan Oh

This paper proposes a novel framework for unsupervised audio source separation using a deep autoencoder.

Audio Source Separation Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.