Search Results for author: Seung Wook Kim

Found 20 papers, 6 papers with code

RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting

no code implementations • 16 Apr 2024 • Ashkan Mirzaei, Riccardo de Lutio, Seung Wook Kim, David Acuna, Jonathan Kelly, Sanja Fidler, Igor Gilitschenski, Zan Gojcic

In this work, we propose an approach for 3D scene inpainting -- the task of coherently replacing parts of the reconstructed scene with desired content.

3D Inpainting Image Inpainting

Paper
Add Code

EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models

no code implementations • 22 Jan 2024 • Koichi Namekata, Amirmojtaba Sabour, Sanja Fidler, Seung Wook Kim

Diffusion models have recently received increasing research attention for their remarkable transfer abilities in semantic segmentation tasks.

Segmentation Semantic Segmentation

Paper
Add Code

Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models

no code implementations • 21 Dec 2023 • Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler, Karsten Kreis

We also propose a motion amplification mechanism as well as a new autoregressive synthesis scheme to generate and combine multiple 4D sequences for longer generation.

Synthetic Data Generation Video Generation

Paper
Add Code

WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space

no code implementations • 22 Nov 2023 • Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger, Karsten Kreis

Then, we train a diffusion model in the 3D-aware latent space, thereby enabling synthesis of high-quality 3D-consistent image samples, outperforming recent state-of-the-art GAN-based methods.

3D-Aware Image Synthesis Depth Estimation +2

Paper
Add Code

EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

1 code implementation • 3 Nov 2023 • Jiawei Yang, Boris Ivanovic, Or Litany, Xinshuo Weng, Seung Wook Kim, Boyi Li, Tong Che, Danfei Xu, Sanja Fidler, Marco Pavone, Yue Wang

We present EmerNeRF, a simple yet powerful approach for learning spatial-temporal representations of dynamic driving scenes.

Optical Flow Estimation Semantic Segmentation

487

Paper
Code

DreamTeacher: Pretraining Image Backbones with Deep Generative Models

no code implementations • ICCV 2023 • Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, Karsten Kreis, Antonio Torralba, Sanja Fidler

In this work, we introduce a self-supervised feature representation learning framework DreamTeacher that utilizes generative networks for pre-training downstream image backbones.

Knowledge Distillation Representation Learning

Paper
Add Code

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

no code implementations • CVPR 2023 • Seung Wook Kim, Bradley Brown, Kangxue Yin, Karsten Kreis, Katja Schwarz, Daiqing Li, Robin Rombach, Antonio Torralba, Sanja Fidler

We first train a scene auto-encoder to express a set of image and pose pairs as a neural field, represented as density and feature voxel grids that can be projected to produce novel views of the scene.

Scene Generation

Paper
Add Code

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

2 code implementations • CVPR 2023 • Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler, Karsten Kreis

We first pre-train an LDM on images only; then, we turn the image generator into a video generator by introducing a temporal dimension to the latent space diffusion model and fine-tuning on encoded image sequences, i. e., videos.

Ranked #5 on Text-to-Video Generation on MSR-VTT (CLIP-FID metric)

Image Generation Text-to-Video Generation +3

22,262

Paper
Code

Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps

no code implementations • CVPR 2022 • Seung Wook Kim, Karsten Kreis, Daiqing Li, Antonio Torralba, Sanja Fidler

Modern image generative models show remarkable sample quality when trained on a single domain or class of objects.

Generative Adversarial Network Image-to-Image Translation +1

Paper
Add Code

BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations

no code implementations • CVPR 2022 • Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Kreis, Adela Barriuso, Sanja Fidler, Antonio Torralba

By training an effective feature segmentation architecture on top of BigGAN, we turn BigGAN into a labeled dataset generator.

8k Generative Adversarial Network +1

Paper
Add Code

EditGAN: High-Precision Semantic Image Editing

1 code implementation • NeurIPS 2021 • Huan Ling, Karsten Kreis, Daiqing Li, Seung Wook Kim, Antonio Torralba, Sanja Fidler

EditGAN builds on a GAN framework that jointly models images and their semantic segmentations, requiring only a handful of labeled examples, making it a scalable tool for editing.

Segmentation Semantic Segmentation +1

623

Paper
Code

Visual TransforMatcher: Efficient Match-to-Match Attention for Visual Correspondence

no code implementations • 29 Sep 2021 • Seung Wook Kim, Juhong Min, Minsu Cho

Establishing correspondences between images remains a challenging task, especially under large appearance changes due to different viewpoints and intra-class variations.

Paper
Add Code

DriveGAN: Towards a Controllable High-Quality Neural Simulation

no code implementations • CVPR 2021 • Seung Wook Kim, Jonah Philion, Antonio Torralba, Sanja Fidler

Realistic simulators are critical for training and verifying robotics systems.

Vocal Bursts Intensity Prediction

Paper
Add Code

Self-supervised driven consistency training for annotation efficient histopathology image analysis

2 code implementations • 7 Feb 2021 • Chetan L. Srinidhi, Seung Wook Kim, Fu-Der Chen, Anne L. Martel

In this work, we overcome this challenge by leveraging both task-agnostic and task-specific unlabeled data based on two novel strategies: i) a self-supervised pretext task that harnesses the underlying multi-resolution contextual cues in histology whole-slide images to learn a powerful supervisory signal for unsupervised representation learning; ii) a new teacher-student semi-supervised consistency paradigm that learns to effectively transfer the pretrained representations to downstream tasks based on prediction consistency with the task-specific un-labeled data.

Histopathological Image Classification Representation Learning +1

Paper
Code

Variational Amodal Object Completion

no code implementations • NeurIPS 2020 • Huan Ling, David Acuna, Karsten Kreis, Seung Wook Kim, Sanja Fidler

In images of complex scenes, objects are often occluding each other which makes perception tasks such as object detection and tracking, or robotic control tasks such as planning, challenging.

Object object-detection +1

Paper
Add Code

Edge Network-Assisted Real-Time Object Detection Framework for Autonomous Driving

no code implementations • 17 Aug 2020 • Seung Wook Kim, Keunsoo Ko, Haneul Ko, Victor C. M. Leung

In an EODF, AVs extract the region of interests~(RoIs) of the captured image when the channel quality is not sufficiently good for supporting real-time OD.

Autonomous Driving object-detection +1

Paper
Add Code

Learning to Simulate Dynamic Environments with GameGAN

no code implementations • CVPR 2020 • Seung Wook Kim, Yuhao Zhou, Jonah Philion, Antonio Torralba, Sanja Fidler

Simulation is a crucial component of any robotic system.

Generative Adversarial Network

Paper
Add Code

The Shmoop Corpus: A Dataset of Stories with Loosely Aligned Summaries

1 code implementation • 30 Dec 2019 • Atef Chaudhury, Makarand Tapaswi, Seung Wook Kim, Sanja Fidler

Understanding stories is a challenging reading comprehension problem for machines as it requires reading a large volume of text and following long-range dependencies.

Abstractive Text Summarization Question Answering +1