Search Results for author: Zheng Ding

Found 11 papers, 5 papers with code

Explorative Inbetweening of Time and Space

no code implementations21 Mar 2024 Haiwen Feng, Zheng Ding, Zhihao Xia, Simon Niklaus, Victoria Abrevaya, Michael J. Black, Xuaner Zhang

We introduce bounded generation as a generalized task to control video generation to synthesize arbitrary camera and subject motion based only on a given start and end frame.

Denoising Video Generation

Restoration by Generation with Constrained Priors

no code implementations28 Dec 2023 Zheng Ding, Xuaner Zhang, Zhuowen Tu, Zhihao Xia

We propose a method to adapt a pretrained diffusion model for image restoration by simply adding noise to the input image to be restored and then denoise.

Denoising Image Restoration

TokenCompose: Grounding Diffusion with Token-level Supervision

1 code implementation6 Dec 2023 ZiRui Wang, Zhizhou Sha, Zheng Ding, Yilin Wang, Zhuowen Tu

We present TokenCompose, a Latent Diffusion Model for text-to-image generation that achieves enhanced consistency between user-specified text prompts and model-generated images.

Denoising Object +1

Dolfin: Diffusion Layout Transformers without Autoencoder

no code implementations25 Oct 2023 Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu

In this paper, we introduce a novel generative model, Diffusion Layout Transformers without Autoencoder (Dolfin), which significantly improves the modeling capability with reduced complexity compared to existing methods.

Patched Denoising Diffusion Models For High-Resolution Image Synthesis

1 code implementation2 Aug 2023 Zheng Ding, Mengqi Zhang, Jiajun Wu, Zhuowen Tu

Feature collage systematically crops and combines partial features of the neighboring patches to predict the features of a shifted image patch, allowing the seamless generation of the entire image due to the overlap in the patch feature space.

Denoising Image Generation

DiffusionRig: Learning Personalized Priors for Facial Appearance Editing

1 code implementation CVPR 2023 Zheng Ding, Xuaner Zhang, Zhihao Xia, Lars Jebe, Zhuowen Tu, Xiuming Zhang

On a high level, DiffusionRig learns to map simplistic renderings of 3D face models to realistic photos of a given person.

MasQCLIP for Open-Vocabulary Universal Image Segmentation

1 code implementation ICCV 2023 Xin Xu, Tianyi Xiong, Zheng Ding, Zhuowen Tu

We present a new method for open-vocabulary universal image segmentation, which is capable of performing instance, semantic, and panoptic segmentation under a unified framework.

Image Segmentation Panoptic Segmentation +1

Point Cloud Recognition with Position-to-Structure Attention Transformers

no code implementations5 Oct 2022 Zheng Ding, James Hou, Zhuowen Tu

In this paper, we present Position-to-Structure Attention Transformers (PS-Former), a Transformer-based algorithm for 3D point cloud recognition.

Feature Engineering Position +2

Open-Vocabulary Universal Image Segmentation with MaskCLIP

1 code implementation18 Aug 2022 Zheng Ding, Jieke Wang, Zhuowen Tu

In this paper, we tackle an emerging computer vision task, open-vocabulary universal image segmentation, that aims to perform semantic/instance/panoptic segmentation (background semantic labeling + foreground instance segmentation) for arbitrary categories of text-based descriptions in inference time.

Image Segmentation Instance Segmentation +4

Guided Variational Autoencoder for Disentanglement Learning

no code implementations CVPR 2020 Zheng Ding, Yifan Xu, Weijian Xu, Gaurav Parmar, Yang Yang, Max Welling, Zhuowen Tu

We propose an algorithm, guided variational autoencoder (Guided-VAE), that is able to learn a controllable generative model by performing latent representation disentanglement learning.

Disentanglement General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.