Search Results for author: Yanhong Zeng

Found 11 papers, 8 papers with code

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

no code implementations • 25 Mar 2024 • Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma

Creating and animating 3D biped cartoon characters is crucial and valuable in various applications.

Paper
Add Code

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

1 code implementation • 21 Dec 2023 • Yiming Zhang, Zhening Xing, Yanhong Zeng, Youqing Fang, Kai Chen

Recent advancements in personalized text-to-image (T2I) models have revolutionized content creation, empowering non-experts to generate stunning images with unique styles.

Image Animation

724

Paper
Code

A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting

1 code implementation • 6 Dec 2023 • Junhao Zhuang, Yanhong Zeng, Wenran Liu, Chun Yuan, Kai Chen

This enables PowerPaint to accomplish various inpainting tasks by utilizing different task prompts, resulting in state-of-the-art performance.

Image Inpainting Object

6,578

Paper
Code

Degradation-Guided Meta-Restoration Network for Blind Super-Resolution

no code implementations • 3 Jul 2022 • Fuzhi Yang, Huan Yang, Yanhong Zeng, Jianlong Fu, Hongtao Lu

The extractor estimates the degradations in LR inputs and guides the meta-restoration modules to predict restoration parameters for different degradations on-the-fly.

Blind Super-Resolution Image Restoration +1

Paper
Add Code

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions

1 code implementation • CVPR 2022 • Hongwei Xue, Tiankai Hang, Yanhong Zeng, Yuchong Sun, Bei Liu, Huan Yang, Jianlong Fu, Baining Guo

To enable VL pre-training, we jointly optimize the HD-VILA model by a hybrid Transformer that learns rich spatiotemporal features, and a multimodal Transformer that enforces interactions of the learned video features with diversified texts.

Ranked #16 on Video Retrieval on MSR-VTT

Retrieval Super-Resolution +4

437

Paper
Code

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers

no code implementations • NeurIPS 2021 • Yanhong Zeng, Huan Yang, Hongyang Chao, Jianbo Wang, Jianlong Fu

Given a sequence of style tokens, the TokenGAN is able to control the image synthesis by assigning the styles to the content tokens by attention mechanism with a Transformer.

Image Generation

Paper
Add Code

3D Human Body Reshaping with Anthropometric Modeling

1 code implementation • 5 Apr 2021 • Yanhong Zeng, Jianlong Fu, Hongyang Chao

First, we calculate full-body anthropometric parameters from limited user inputs by imputation technique, and thus essential anthropometric parameters for 3D body reshaping can be obtained.

feature selection Imputation +1

333

Paper
Code

Aggregated Contextual Transformations for High-Resolution Image Inpainting

2 code implementations • 3 Apr 2021 • Yanhong Zeng, Jianlong Fu, Hongyang Chao, Baining Guo

For improving texture synthesis, we enhance the discriminator of AOT-GAN by training it with a tailored mask-prediction task.

Ranked #9 on Image Inpainting on Places2

Image Inpainting Texture Synthesis +1

4,219

Paper
Code

Learning Semantic-aware Normalization for Generative Adversarial Networks

1 code implementation • NeurIPS 2020 • Heliang Zheng, Jianlong Fu, Yanhong Zeng, Jiebo Luo, Zheng-Jun Zha

Such a model disentangles latent factors according to the semantic of feature channels by channel-/group- wise fusion of latent codes and feature channels.

Image Inpainting Unconditional Image Generation

Paper
Code

Learning Joint Spatial-Temporal Transformations for Video Inpainting

2 code implementations • ECCV 2020 • Yanhong Zeng, Jianlong Fu, Hongyang Chao

In this paper, we propose to learn a joint Spatial-Temporal Transformer Network (STTN) for video inpainting.

Ranked #5 on Seeing Beyond the Visible on KITTI360-EX

Seeing Beyond the Visible Video Inpainting

434

Paper
Code

Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting

2 code implementations • CVPR 2019 • Yanhong Zeng, Jianlong Fu, Hongyang Chao, Baining Guo

As the missing content can be filled by attention transfer from deep to shallow in a pyramid fashion, both visual and semantic coherence for image inpainting can be ensured.

Image Inpainting Vocal Bursts Intensity Prediction

349

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.