no code implementations • 22 Mar 2024 • Geon Yeong Park, Hyeonho Jeong, Sang Wan Lee, Jong Chul Ye
The evolution of diffusion models has greatly impacted video generation and understanding.
no code implementations • 18 Mar 2024 • Hyeonho Jeong, Jinho Chang, Geon Yeong Park, Jong Chul Ye
Text-driven diffusion-based video editing presents a unique challenge not encountered in image editing literature: establishing real-world motion.
no code implementations • 18 Mar 2024 • Jeongsol Kim, Geon Yeong Park, Jong Chul Ye
Reverse sampling and score-distillation have emerged as main workhorses in recent years for image manipulation using latent diffusion models (LDMs).
no code implementations • 1 Dec 2023 • Hyeonho Jeong, Geon Yeong Park, Jong Chul Ye
Text-to-video diffusion models have advanced video generation significantly.
no code implementations • 30 Nov 2023 • Hyelin Nam, Gihyun Kwon, Geon Yeong Park, Jong Chul Ye
A promising recent approach in this realm is Delta Denoising Score (DDS) - an image editing technique based on Score Distillation Sampling (SDS) framework that leverages the rich generative prior of text-to-image diffusion models.
no code implementations • 27 Nov 2023 • Jeongsol Kim, Geon Yeong Park, Hyungjin Chung, Jong Chul Ye
The recent advent of diffusion models has led to significant progress in solving inverse problems, leveraging these models as effective generative priors.
1 code implementation • NeurIPS 2023 • Geon Yeong Park, Jeongsol Kim, Beomsu Kim, Sang Wan Lee, Jong Chul Ye
Despite the remarkable performance of text-to-image diffusion models in image generation tasks, recent studies have raised the issue that generated images sometimes cannot capture the intended semantic contents of the text prompts, which phenomenon is often called semantic misalignment.
no code implementations • 11 Oct 2022 • Geon Yeong Park, Chanyong Jung, Sangmin Lee, Jong Chul Ye, Sang Wan Lee
Specifically, we first pretrain a biased encoder in a self-supervised manner with the rank regularization, serving as a semantic bottleneck to enforce the encoder to learn the spuriously correlated attributes.
1 code implementation • CVPR 2023 • Geon Yeong Park, Sangmin Lee, Sang Wan Lee, Jong Chul Ye
Neural networks are often biased to spuriously correlated features that provide misleading statistical evidence that does not generalize.
Ranked #2 on Facial Attribute Classification on bFFHQ
no code implementations • ICCV 2021 • Geon Yeong Park, Sang Wan Lee
Here we adopt an information-theoretic approach to identify and resolve the potential adverse effect of the multiple domain discriminators on MDA: disintegration of domain-discriminative information, limited computational scalability, and a large variance in the gradient of the loss during training.
no code implementations • ICCV 2021 • Geon Yeong Park, Sang Wan Lee
To overcome such limitations, we deviate from the existing input-space-based adversarial training regime and propose a single-step latent adversarial training method (SLAT), which leverages the gradients of latent representation as the latent adversarial perturbation.
no code implementations • 1 Jan 2021 • Geon Yeong Park, Sang Wan Lee
Our framework shows that the information shared across domains cannot be gleaned with multiple discriminators.