Search Results for author: Haomiao Ni

Found 11 papers, 7 papers with code

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

no code implementations25 Apr 2024 Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks

To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image.

Denoising Image to Video Generation

3D-Aware Talking-Head Video Motion Transfer

no code implementations5 Nov 2023 Haomiao Ni, Jiachen Liu, Yuan Xue, Sharon X. Huang

In this paper, we propose a novel 3D-aware talking-head video motion transfer network, Head3D, which fully exploits the subject appearance information by generating a visually-interpretable 3D canonical head from the 2D subject frames with a recurrent network.

Novel View Synthesis

Synthetic Augmentation with Large-scale Unconditional Pre-training

1 code implementation8 Aug 2023 Jiarong Ye, Haomiao Ni, Peng Jin, Sharon X. Huang, Yuan Xue

To further reduce the dependency on annotated data, we propose a synthetic augmentation method called HistoDiffusion, which can be pre-trained on large-scale unlabeled datasets and later applied to a small-scale labeled dataset for augmented training.

Exploring Compositional Visual Generation with Latent Classifier Guidance

no code implementations25 Apr 2023 Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min

We show that this paradigm based on latent classifier guidance is agnostic to pre-trained generative models, and present competitive results for both image generation and sequential manipulation of real and synthetic images.

Image Generation

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

1 code implementation CVPR 2023 Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min

In this paper, we propose an approach for cI2V using novel latent flow diffusion models (LFDM) that synthesize an optical flow sequence in the latent space based on the given condition to warp the given image.

Image to Video Generation Optical Flow Estimation

Semi-supervised Body Parsing and Pose Estimation for Enhancing Infant General Movement Assessment

2 code implementations14 Oct 2022 Haomiao Ni, Yuan Xue, Liya Ma, Qian Zhang, Xiaoye Li, Xiaolei Huang

We collected a new clinical IMV dataset with GMA annotations, and our experiments show that SPN models for body parsing and pose estimation trained on the first two datasets generalize well to the new clinical dataset and their results can significantly boost the CRNN-based GMA prediction performance.

Data Augmentation Generative Adversarial Network +1

Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis

1 code implementation2 Oct 2022 Haomiao Ni, Yihao Liu, Sharon X. Huang, Yuan Xue

The novel design of dual branches combines the strengths of deformation-grid-based transformation and warp-free generation for better identity preservation and robustness to occlusion in the synthesized videos.

motion retargeting

Asymmetry Disentanglement Network for Interpretable Acute Ischemic Stroke Infarct Segmentation in Non-Contrast CT Scans

1 code implementation30 Jun 2022 Haomiao Ni, Yuan Xue, Kelvin Wong, John Volpi, Stephen T. C. Wong, James Z. Wang, Xiaolei Huang

In this paper, we propose a novel Asymmetry Disentanglement Network (ADN) to automatically separate pathological asymmetries and intrinsic anatomical asymmetries in NCCTs for more effective and interpretable AIS segmentation.

Disentanglement Segmentation

SiamParseNet: Joint Body Parsing and Label Propagation in Infant Movement Videos

1 code implementation16 Jul 2020 Haomiao Ni, Yuan Xue, Qian Zhang, Xiaolei Huang

In this paper, we propose a semi-supervised body parsing model, termed SiamParseNet (SPN), to jointly learn single frame body parsing and label propagation between frames in a semi-supervised fashion.

Cannot find the paper you are looking for? You can Submit a new open access paper.