Search Results for author: Taekyung Kim

Found 28 papers, 5 papers with code

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

no code implementations26 Apr 2024 Wonjae Kim, Sanghyuk Chun, Taekyung Kim, Dongyoon Han, Sangdoo Yun

In an era where the volume of data drives the effectiveness of self-supervised learning, the specificity and clarity of data semantics play a crucial role in model training.

Leveraging Temporal Contextualization for Video Action Recognition

no code implementations15 Apr 2024 Minji Kim, Dongyoon Han, Taekyung Kim, Bohyung Han

We propose Temporal Contextualization (TC), a novel layer-wise temporal information infusion mechanism for video that extracts core information from each frame, interconnects relevant information across the video to summarize into context tokens, and ultimately leverages the context tokens during the feature encoding process.

Action Recognition Temporal Action Localization +1

Masked Image Modeling via Dynamic Token Morphing

no code implementations30 Dec 2023 Taekyung Kim, Dongyoon Han, Byeongho Heo

Masked Image Modeling (MIM) arises as a promising option for Vision Transformers among various self-supervised learning (SSL) methods.

Fine-Grained Image Classification Self-Supervised Learning

Learning with Unmasked Tokens Drives Stronger Vision Learners

no code implementations20 Oct 2023 Taekyung Kim, Sanghyuk Chun, Byeongho Heo, Dongyoon Han

MIMs such as Masked Autoencoder (MAE) learn strong representations by randomly masking input tokens for the encoder to process, with the decoder reconstructing the masked tokens to the input.

Attribute Fine-Grained Image Classification +2

METAVerse: Meta-Learning Traversability Cost Map for Off-Road Navigation

no code implementations26 Jul 2023 Junwon Seo, Taekyung Kim, Seongyong Ahn, Kiho Kwak

To conduct a comprehensive evaluation, we collect driving data from various terrains and demonstrate that our method can obtain a global model that minimizes uncertainty.

Autonomous Navigation Meta-Learning

Safe Navigation in Unstructured Environments by Minimizing Uncertainty in Control and Perception

no code implementations26 Jun 2023 Junwon Seo, Jungwi Mun, Taekyung Kim

We train a vehicle dynamics model that can quantify the epistemic uncertainty of the model to perform active exploration, resulting in the efficient collection of training data and effective avoidance of uncertain state-action spaces.

Meta-Learning

Bridging Active Exploration and Uncertainty-Aware Deployment Using Probabilistic Ensemble Neural Network Dynamics

no code implementations20 May 2023 Taekyung Kim, Jungwi Mun, Junwon Seo, Beomsu Kim, Seongil Hong

Active exploration, in which a robot directs itself to states that yield the highest information gain, is essential for efficient data collection and minimizing human supervision.

Autonomous Vehicles Model-based Reinforcement Learning

What Do Self-Supervised Vision Transformers Learn?

1 code implementation1 May 2023 Namuk Park, Wonjae Kim, Byeongho Heo, Taekyung Kim, Sangdoo Yun

We present a comparative study on how and why contrastive learning (CL) and masked image modeling (MIM) differ in their representations and in their performance of downstream tasks.

Contrastive Learning

Learning Terrain-Aware Kinodynamic Model for Autonomous Off-Road Rally Driving With Model Predictive Path Integral Control

no code implementations1 May 2023 Hojin Lee, Taekyung Kim, Jungwi Mun, Wonsuk Lee

High-speed autonomous driving in off-road environments has immense potential for various applications, but it also presents challenges due to the complexity of vehicle-terrain interactions.

Autonomous Driving

Panoramic Image-to-Image Translation

no code implementations11 Apr 2023 Soohyun Kim, Junho Kim, Taekyung Kim, Hwan Heo, Seungryong Kim, Jiyoung Lee, Jin-Hwa Kim

This task is difficult due to the geometric distortion of panoramic images and the lack of a panoramic image dataset with diverse conditions, like weather or time.

Image-to-Image Translation Translation

Robust Camera Pose Refinement for Multi-Resolution Hash Encoding

no code implementations3 Feb 2023 Hwan Heo, Taekyung Kim, Jiyoung Lee, Jaewon Lee, Soohyun Kim, Hyunwoo J. Kim, Jin-Hwa Kim

Multi-resolution hash encoding has recently been proposed to reduce the computational cost of neural renderings, such as NeRF.

Neural Rendering Novel View Synthesis

Self-Supervised 3D Traversability Estimation with Proxy Bank Guidance

no code implementations21 Nov 2022 Jihwan Bae, Junwon Seo, Taekyung Kim, Hae-Gon Jeon, Kiho Kwak, Inwook Shim

To mitigate the uncertainty, we introduce a deep metric learning-based method to incorporate unlabeled data with a few positive and negative prototypes in order to leverage the uncertainty, which jointly learns using semantic segmentation and traversability regression.

Metric Learning regression +2

ScaTE: A Scalable Framework for Self-Supervised Traversability Estimation in Unstructured Environments

no code implementations14 Sep 2022 Junwon Seo, Taekyung Kim, Kiho Kwak, Jihong Min, Inwook Shim

By integrating our framework with a model predictive controller, we demonstrate that estimated traversability results in effective navigation that enables distinct maneuvers based on the driving characteristics of the vehicles.

Autonomous Vehicles

Residual-Guided Learning Representation for Self-Supervised Monocular Depth Estimation

no code implementations8 Nov 2021 Byeongjun Park, Taekyung Kim, Hyojun Go, Changick Kim

In this paper, we propose residual guidance loss that enables the depth estimation network to embed the discriminative feature by transferring the discriminability of auto-encoded features.

Monocular Depth Estimation Self-Supervised Learning

Multi-view Arbitrary Style Transfer

no code implementations1 Jan 2021 Taekyung Kim, Changick Kim

We propose a photometric consistency loss, which directly enforces the geometrically consistent style texture across the view, and a stroke consistency loss, which matches the characteristics and directions of the brushstrokes by aligning the local patches of the corresponding pixels before minimizing feature deviation.

Style Transfer

Just a Few Points Are All You Need for Multi-View Stereo: A Novel Semi-Supervised Learning Method for Multi-View Stereo

no code implementations ICCV 2021 Taekyung Kim, Jaehoon Choi, Seokeon Choi, Dongki Jung, Changick Kim

We generate the spare ground truth of the DTU dataset for evaluation and extensive experiments verify that our SGT-MVSNet outperforms the state-of-the-art MVS methods on the sparse ground truth setting.

3D Reconstruction

Meta Batch-Instance Normalization for Generalizable Person Re-Identification

1 code implementation CVPR 2021 Seokeon Choi, Taekyung Kim, Minki Jeong, Hyoungseob Park, Changick Kim

To this end, we combine learnable batch-instance normalization layers with meta-learning and investigate the challenging cases caused by both batch and instance normalization layers.

Data Augmentation Domain Generalization +2

Attract, Perturb, and Explore: Learning a Feature Alignment Network for Semi-supervised Domain Adaptation

2 code implementations ECCV 2020 Taekyung Kim, Changick Kim

Finally, the exploration scheme locally aligns features in a class-wise manner complementary to the attraction scheme by selectively aligning unlabeled target features complementary to the perturbation scheme.

Semi-supervised Domain Adaptation Unsupervised Domain Adaptation

Learning to Align Multi-Camera Domains using Part-Aware Clustering for Unsupervised Video Person Re-Identification

no code implementations29 Sep 2019 Youngeun Kim, Seokeon Choi, Taekyung Kim, Sumin Lee, Changick Kim

Since the cost of labeling increases dramatically as the number of cameras increases, it is difficult to apply the re-identification algorithm to a large camera network.

Clustering Metric Learning +2

RPM-Net: Robust Pixel-Level Matching Networks for Self-Supervised Video Object Segmentation

no code implementations29 Sep 2019 Youngeun Kim, Seokeon Choi, Hankyeol Lee, Taekyung Kim, Changick Kim

In this paper, we introduce a self-supervised approach for video object segmentation without human labeled data. Specifically, we present Robust Pixel-level Matching Net-works (RPM-Net), a novel deep architecture that matches pixels between adjacent frames, using only color information from unlabeled videos for training.

Object Segmentation +3

Self-Training and Adversarial Background Regularization for Unsupervised Domain Adaptive One-Stage Object Detection

no code implementations ICCV 2019 Seunghyeon Kim, Jaehoon Choi, Taekyung Kim, Changick Kim

Experimental results show that our approach effectively improves the performance of the one-stage object detection in unsupervised domain adaptation setting.

Object object-detection +2

Pseudo-Labeling Curriculum for Unsupervised Domain Adaptation

no code implementations1 Aug 2019 Jaehoon Choi, Minki Jeong, Taekyung Kim, Changick Kim

To learn target discriminative representations, using pseudo-labels is a simple yet effective approach for unsupervised domain adaptation.

Clustering Semi-Supervised Image Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.