Search Results for author: Joon-Young Lee

Found 61 papers, 28 papers with code

MaGGIe: Masked Guided Gradual Human Instance Matting

no code implementations24 Apr 2024 Chuong Huynh, Seoung Wug Oh, Abhinav Shrivastava, Joon-Young Lee

Human matting is a foundation task in image and video processing, where human foreground pixels are extracted from the input.

IMIL: Interactive Medical Image Learning Framework

no code implementations17 Apr 2024 Adrit Rao, Andrea Fisher, Ken Chang, John Christopher Panagides, Katherine McNamara, Joon-Young Lee, Oliver Aalami

We propose the Interactive Medical Image Learning (IMIL) framework, a novel approach for improving the training of medical image analysis algorithms that enables clinician-guided intermediate training data augmentations on misprediction outliers, focusing the algorithm on relevant visual information.

Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

no code implementations5 Apr 2024 Gihyun Kwon, Simon Jenni, DIngzeyu Li, Joon-Young Lee, Jong Chul Ye, Fabian Caba Heilbron

While there has been significant progress in customizing text-to-image generation models, generating images that combine multiple personalized concepts remains challenging.

Text-to-Image Generation

Putting the Object Back into Video Object Segmentation

1 code implementation19 Oct 2023 Ho Kei Cheng, Seoung Wug Oh, Brian Price, Joon-Young Lee, Alexander Schwing

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

Object Segmentation +3

Tracking Anything with Decoupled Video Segmentation

1 code implementation ICCV 2023 Ho Kei Cheng, Seoung Wug Oh, Brian Price, Alexander Schwing, Joon-Young Lee

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

 Ranked #1 on Unsupervised Video Object Segmentation on DAVIS 2016 val (using extra training data)

Open-Vocabulary Video Segmentation Open-World Video Segmentation +7

Studying the Impact of Augmentations on Medical Confidence Calibration

no code implementations23 Aug 2023 Adrit Rao, Joon-Young Lee, Oliver Aalami

In this paper, we evaluate the effects of three modern augmentation techniques, CutMix, MixUp, and CutOut on the calibration and performance of CNNs for medical tasks.

Decision Making

Long-range Multimodal Pretraining for Movie Understanding

no code implementations ICCV 2023 Dawit Mureja Argaw, Joon-Young Lee, Markus Woodson, In So Kweon, Fabian Caba Heilbron

While great progress has been attained, there is still a need for a pretrained multimodal model that can perform well in the ever-growing set of movie understanding tasks the community has been establishing.

XMem++: Production-level Video Segmentation From Few Annotated Frames

1 code implementation ICCV 2023 Maksym Bekuzarov, Ariana Bermudez, Joon-Young Lee, Hao Li

Despite advancements in user-guided video segmentation, extracting complex objects consistently for highly complex scenes is still a labor-intensive task, especially for production.

Segmentation Semantic Segmentation +3

INVE: Interactive Neural Video Editing

no code implementations15 Jul 2023 Jiahui Huang, Leonid Sigal, Kwang Moo Yi, Oliver Wang, Joon-Young Lee

We present Interactive Neural Video Editing (INVE), a real-time video editing solution, which can assist the video editing process by consistently propagating sparse frame edits to the entire video clip.

Video Editing

Tracking by Associating Clips

no code implementations20 Dec 2022 Sanghyun Woo, KwanYong Park, Seoung Wug Oh, In So Kweon, Joon-Young Lee

The tracking-by-detection paradigm today has become the dominant method for multi-object tracking and works by detecting objects in each frame and then performing data association across frames.

Chunking Management +2

Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection

no code implementations20 Dec 2022 Sanghyun Woo, KwanYong Park, Seoung Wug Oh, In So Kweon, Joon-Young Lee

First, no tracking supervisions are in LVIS, which leads to inconsistent learning of detection (with LVIS and TAO) and tracking (only with TAO).

Video Object Detection

Per-Clip Video Object Segmentation

1 code implementation CVPR 2022 KwanYong Park, Sanghyun Woo, Seoung Wug Oh, In So Kweon, Joon-Young Lee

In this per-clip inference scheme, we update the memory with an interval and simultaneously process a set of consecutive frames (i. e. clip) between the memory updates.

Object Segmentation +3

One-Trimap Video Matting

1 code implementation27 Jul 2022 Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, Joon-Young Lee

A key of OTVM is the joint modeling of trimap propagation and alpha prediction.

Image Matting Video Matting

VITA: Video Instance Segmentation via Object Token Association

1 code implementation9 Jun 2022 Miran Heo, Sukjun Hwang, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim

Specifically, we use an image object detector as a means of distilling object-specific contexts into object tokens.

Ranked #11 on Video Instance Segmentation on YouTube-VIS 2021 (using extra training data)

Instance Segmentation Object +2

Information-Theoretic Bias Reduction via Causal View of Spurious Correlation

no code implementations10 Jan 2022 Seonguk Seo, Joon-Young Lee, Bohyung Han

We propose an information-theoretic bias measurement technique through a causal interpretation of spurious correlation, which is effective to identify the feature-level algorithmic bias by taking advantage of conditional mutual information.

Face Recognition Fairness

Hierarchical Memory Matching Network for Video Object Segmentation

1 code implementation ICCV 2021 Hongje Seong, Seoung Wug Oh, Joon-Young Lee, Seongwon Lee, Suhyeon Lee, Euntai Kim

Based on a recent memory-based method [33], we propose two advanced memory read modules that enable us to perform memory reading in multiple scales while exploiting temporal smoothness.

Object Retrieval +3

Studying the Effects of Self-Attention for Medical Image Analysis

no code implementations2 Sep 2021 Adrit Rao, Jongchan Park, Sanghyun Woo, Joon-Young Lee, Oliver Aalami

The use of computer vision to automate the classification of medical images is widely studied.

Unsupervised Learning of Debiased Representations with Pseudo-Attributes

1 code implementation CVPR 2022 Seonguk Seo, Joon-Young Lee, Bohyung Han

Dataset bias is a critical challenge in machine learning since it often leads to a negative impact on a model due to the unintended decision rules captured by spurious correlations.

Attribute

Learning to Associate Every Segment for Video Panoptic Segmentation

no code implementations CVPR 2021 Sanghyun Woo, Dahun Kim, Joon-Young Lee, In So Kweon

Temporal correspondence - linking pixels or objects across frames - is a fundamental supervisory signal for the video models.

Ranked #6 on Video Panoptic Segmentation on Cityscapes-VPS (using extra training data)

Video Panoptic Segmentation

Polygonal Point Set Tracking

1 code implementation CVPR 2021 Gunhee Nam, Miran Heo, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim

Since the existing datasets are not suitable to validate our method, we build a new polygonal point set tracking dataset and demonstrate the superior performance of our method over the baselines and existing contour-based VOS methods.

Semantic Segmentation Video Object Segmentation +1

Video Panoptic Segmentation

1 code implementation CVPR 2020 Dahun Kim, Sanghyun Woo, Joon-Young Lee, In So Kweon

In this paper, we propose and explore a new video extension of this task, called video panoptic segmentation.

Ranked #7 on Video Panoptic Segmentation on Cityscapes-VPS (using extra training data)

Instance Segmentation Segmentation +5

History for Visual Dialog: Do we really need it?

2 code implementations ACL 2020 Shubham Agarwal, Trung Bui, Joon-Young Lee, Ioannis Konstas, Verena Rieser

Visual Dialog involves "understanding" the dialog history (what has been discussed previously) and the current question (what is asked), in addition to grounding information in the image, to generate the correct response.

Visual Dialog

Onion-Peel Networks for Deep Video Completion

1 code implementation ICCV 2019 Seoung Wug Oh, Sungho Lee, Joon-Young Lee, Seon Joo Kim

Given a set of reference images and a target image with holes, our network fills the hole by referring the contents in the reference images.

Video Inpainting

Learning Video Representations from Correspondence Proposals

2 code implementations CVPR 2019 Xingyu Liu, Joon-Young Lee, Hailin Jin

In particular, it can effectively learn representations for videos by mixing appearance and long-range motion with an RGB-only input.

Action Recognition In Videos

Deep Blind Video Decaptioning by Temporal Aggregation and Recurrence

1 code implementation CVPR 2019 Dahun Kim, Sanghyun Woo, Joon-Young Lee, In So Kweon

Blind video decaptioning is a problem of automatically removing text overlays and inpainting the occluded parts in videos without any input masks.

Video Denoising Video Inpainting +1

Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks

1 code implementation CVPR 2019 Seoung Wug Oh, Joon-Young Lee, Ning Xu, Seon Joo Kim

We propose a new multi-round training scheme for the interactive video object segmentation so that the networks can learn how to understand the user's intention and update incorrect estimations during the training.

Interactive Video Object Segmentation Object +3

Video Object Segmentation using Space-Time Memory Networks

3 code implementations ICCV 2019 Seoung Wug Oh, Joon-Young Lee, Ning Xu, Seon Joo Kim

In our framework, the past frames with object masks form an external memory, and the current frame as the query is segmented using the mask information in the memory.

Ranked #4 on Interactive Video Object Segmentation on DAVIS 2017 (using extra training data)

Interactive Video Object Segmentation Object +3

GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment

no code implementations21 Sep 2018 Xin Ye, Zhe Lin, Joon-Young Lee, Jianming Zhang, Shibin Zheng, Yezhou Yang

We study the problem of learning a generalizable action policy for an intelligent agent to actively approach an object of interest in an indoor environment solely from its visual inputs.

Semantic Segmentation Visual Navigation

Learning to Blend Photos

1 code implementation ECCV 2018 Wei-Chih Hung, Jianming Zhang, Xiaohui Shen, Zhe Lin, Joon-Young Lee, Ming-Hsuan Yang

Specifically, given a foreground image and a background image, our proposed method automatically generates a set of blending photos with scores that indicate the aesthetics quality with the proposed quality network and policy network.

Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias

no code implementations ECCV 2018 Rameswar Panda, Jianming Zhang, Haoxiang Li, Joon-Young Lee, Xin Lu, Amit K. Roy-Chowdhury

While machine learning approaches to visual emotion recognition offer great promise, current methods consider training and testing models on small scale datasets covering limited visual emotion concepts.

Emotion Recognition

CBAM: Convolutional Block Attention Module

31 code implementations ECCV 2018 Sanghyun Woo, Jongchan Park, Joon-Young Lee, In So Kweon

We propose Convolutional Block Attention Module (CBAM), a simple yet effective attention module for feed-forward convolutional neural networks.

General Classification Image Classification

BAM: Bottleneck Attention Module

10 code implementations17 Jul 2018 Jongchan Park, Sanghyun Woo, Joon-Young Lee, In So Kweon

In this work, we focus on the effect of attention in general deep neural networks.

Neural Architecture Search

Distort-and-Recover: Color Enhancement using Deep Reinforcement Learning

no code implementations CVPR 2018 Jongchan Park, Joon-Young Lee, Donggeun Yoo, In So Kweon

In addition, we present a 'distort-and-recover' training scheme which only requires high-quality reference images for training instead of input and retouched image pairs.

reinforcement-learning Reinforcement Learning (RL)

The AdobeIndoorNav Dataset: Towards Deep Reinforcement Learning based Real-world Indoor Robot Visual Navigation

1 code implementation24 Feb 2018 Kaichun Mo, Haoxiang Li, Zhe Lin, Joon-Young Lee

Synthetic data suffers from domain gap to the real-world scenes while visual inputs rendered from 3D reconstructed scenes have undesired holes and artifacts.

Robotics

Gradient-based Camera Exposure Control for Outdoor Mobile Platforms

no code implementations24 Aug 2017 Inwook Shim, Tae-Hyun Oh, Joon-Young Lee, Jinwook Choi, Dong-Geol Choi, In So Kweon

We introduce a novel method to automatically adjust camera exposure for image processing and computer vision applications on mobile robot platforms.

Pedestrian Detection Stereo Matching +2

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

no code implementations CVPR 2017 Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, Joon-Young Lee, Hailin Jin, Thomas Funkhouser

One of the bottlenecks in training for better representations is the amount of available per-pixel ground truth data that is required for core scene understanding tasks such as semantic segmentation, normal prediction, and object edge detection.

Boundary Detection Edge Detection +4

Action-Driven Object Detection with Top-Down Visual Attentions

no code implementations20 Dec 2016 Donggeun Yoo, Sunggyun Park, Kyunghyun Paeng, Joon-Young Lee, In So Kweon

In this paper, we present an "action-driven" detection mechanism using our "top-down" visual attention model.

Object object-detection +1

Automatic Content-Aware Color and Tone Stylization

no code implementations CVPR 2016 Joon-Young Lee, Kalyan Sunkavalli, Zhe Lin, Xiaohui Shen, In So Kweon

We introduce a new technique that automatically generates diverse, visually compelling stylizations for a photograph in an unsupervised manner.

Style Transfer

Vision System and Depth Processing for DRC-HUBO+

no code implementations21 Sep 2015 Inwook Shim, Seunghak Shin, Yunsu Bok, Kyungdon Joo, Dong-Geol Choi, Joon-Young Lee, Jaesik Park, Jun-Ho Oh, In So Kweon

This paper presents a vision system and a depth processing algorithm for DRC-HUBO+, the winner of the DRC finals 2015.

object-detection Object Detection +1

One-Day Outdoor Photometric Stereo via Skylight Estimation

no code implementations CVPR 2015 Jiyoung Jung, Joon-Young Lee, In So Kweon

We present an outdoor photometric stereo method using images captured in a single day.

Fisher Kernel for Deep Neural Activations

no code implementations4 Dec 2014 Donggeun Yoo, Sunggyun Park, Joon-Young Lee, In So Kweon

In this paper, we present a straightforward framework for better image representation by combining the two approaches.

Color Transfer Using Probabilistic Moving Least Squares

no code implementations CVPR 2014 Youngbae Hwang, Joon-Young Lee, In So Kweon, Seon Joo Kim

This paper introduces a new color transfer method which is a process of transferring color of an image to match the color of another image of the same scene.

Cannot find the paper you are looking for? You can Submit a new open access paper.