no code implementations • CVPR 2022 • Bumsoo Kim, Jonghwan Mun, Kyoung-Woon On, Minchul Shin, Junhyun Lee, Eun-Sol Kim
Human-Object Interaction (HOI) detection is the task of identifying a set of <human, object, interaction> triplets from an image.
1 code implementation • NeurIPS 2023 • Changdae Oh, Junhyuk So, Hoyoon Byun, Yongtaek Lim, Minchul Shin, Jong-June Jeon, Kyungwoo Song
Such a lack of alignment and uniformity might restrict the transferability and robustness of embeddings.
1 code implementation • 14 Jan 2022 • Jonghwan Mun, Minchul Shin, Gunsoo Han, Sangho Lee, Seongsu Ha, Joonseok Lee, Eun-Sol Kim
Inspired from this, we tackle video scene segmentation, which is a task of temporally localizing scene boundaries in a video, with a self-supervised learning framework where we mainly focus on designing effective pretext tasks.
no code implementations • 13 Oct 2021 • Minchul Shin, Jonghwan Mun, Kyoung-Woon On, Woo-Young Kang, Gunsoo Han, Eun-Sol Kim
The VALUE (Video-And-Language Understanding Evaluation) benchmark is newly introduced to evaluate and analyze multi-modal representation learning algorithms on three video-and-language tasks: Retrieval, QA, and Captioning.
no code implementations • 29 Sep 2021 • Jonghwan Mun, Minchul Shin, Gunsoo Han, Sangho Lee, Seongsu Ha, Joonseok Lee, Eun-Sol Kim
Inspired from this, we tackle video scene segmentation, which is a task of temporally localizing scene boundaries in a video, with a self-supervised learning framework where we mainly focus on designing effective pretext tasks.
2 code implementations • 1 Jun 2021 • Geonmo Gu, Byungsoo Ko, SeoungHyun Go, Sung-Hyun Lee, Jingeun Lee, Minchul Shin
In this paper, we propose a real-time and light-weight line segment detector for resource-constrained environments named Mobile LSD (M-LSD).
Ranked #6 on Line Segment Detection on York Urban Dataset
2 code implementations • 7 Apr 2021 • Minchul Shin, Yoonjae Cho, Byungsoo Ko, Geonmo Gu
In this paper, we study the compositional learning of images and texts for image retrieval.
Ranked #14 on Image Retrieval on Fashion IQ
no code implementations • 21 Dec 2020 • Francis X. Diebold, Minchul Shin, Boyuan Zhang
We propose methods for constructing regularized mixtures of density forecasts.
no code implementations • ECCV 2020 • Minchul Shin
This paper presents a study on semi-supervised learning to solve the visual attribute prediction problem.
no code implementations • 13 Jul 2020 • Minchul Shin, Yoonjae Cho, Seongwuk Hong
This paper is dedicated to team VAA's approach submitted to the Fashion-IQ challenge in CVPR 2020.
no code implementations • 27 Jul 2019 • Byungsoo Ko, Minchul Shin, Geonmo Gu, HeeJae Jun, Tae Kwan Lee, Youngjoon Kim
Many studies have been performed on metric learning, which has become a key ingredient in top-performing methods of instance-level image retrieval.
no code implementations • 11 Jul 2019 • Minchul Shin, Sanghyuk Park, Taeksoo Kim
FAM is a challenging task in that the attributes are hard to define, and the unique characteristics of a query are hard to be preserved.
no code implementations • 14 Nov 2016 • Minchul Shin, Munsang Kim, Dong-Soo Kwon
The experiment result showed that a three-layer structure consisting of a simple convolutional and a max pooling layer with histogram equalization image input was the most efficient.
Facial Expression Recognition Facial Expression Recognition (FER)