Search Results for author: Yichun Shi

Found 32 papers, 17 papers with code

Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

no code implementations • 16 Apr 2024 • SeungWook Kim, Kejie Li, Xueqing Deng, Yichun Shi, Minsu Cho, Peng Wang

Leveraging multi-view diffusion models as priors for 3D optimization have alleviated the problem of 3D consistency, e. g., the Janus face problem or the content drift problem, in zero-shot text-to-3D models.

Common Sense Reasoning Text to 3D

Paper
Add Code

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

no code implementations • 15 Apr 2024 • Mude Hui, Siwei Yang, Bingchen Zhao, Yichun Shi, Heng Wang, Peng Wang, Yuyin Zhou, Cihang Xie

This study introduces HQ-Edit, a high-quality instruction-based image editing dataset with around 200, 000 edits.

Attribute

Paper
Add Code

Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion

no code implementations • 9 Apr 2024 • Fan Yang, Jianfeng Zhang, Yichun Shi, Bowen Chen, Chenxu Zhang, Huichao Zhang, Xiaofeng Yang, Jiashi Feng, Guosheng Lin

Benefiting from the rapid development of 2D diffusion models, 3D content creation has made significant progress recently.

3D Generation

Paper
Add Code

X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention

no code implementations • 23 Mar 2024 • You Xie, Hongyi Xu, Guoxian Song, Chao Wang, Yichun Shi, Linjie Luo

We propose X-Portrait, an innovative conditional diffusion model tailored for generating expressive and temporally coherent portrait animation.

Disentanglement

Paper
Add Code

DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis

1 code implementation • 20 Dec 2023 • Yuming Gu, You Xie, Hongyi Xu, Guoxian Song, Yichun Shi, Di Chang, Jing Yang, Linjie Luo

The rendering view is then manipulated with a novel conditional control module that interprets the camera pose by watching a condition image of a crossed subject from the same view.

Denoising

Paper
Code

ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation

no code implementations • 2 Dec 2023 • Peng Wang, Yichun Shi

We introduce "ImageDream," an innovative image-prompt, multi-view diffusion model for 3D object generation.

3D Generation Object

Paper
Add Code

MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

2 code implementations • 18 Nov 2023 • Di Chang, Yichun Shi, Quankai Gao, Jessica Fu, Hongyi Xu, Guoxian Song, Qing Yan, Yizhe Zhu, Xiao Yang, Mohammad Soleymani

In this work, we propose MagicPose, a diffusion-based model for 2D human pose and facial expression retargeting.

Video Generation

903

Paper
Code

Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models

no code implementations • 4 Oct 2023 • Jianglong Ye, Peng Wang, Kejie Li, Yichun Shi, Heng Wang

Specifically, we decompose the NVS task into two stages: (i) transforming observed regions to a novel view, and (ii) hallucinating unseen regions.

Image to 3D Novel View Synthesis

Paper
Add Code

MVDream: Multi-view Diffusion for 3D Generation

2 code implementations • 31 Aug 2023 • Yichun Shi, Peng Wang, Jianglong Ye, Mai Long, Kejie Li, Xiao Yang

We introduce MVDream, a diffusion model that is able to generate consistent multi-view images from a given text prompt.

3D Generation

647

Paper
Code

OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis

no code implementations • CVPR 2023 • Hongyi Xu, Guoxian Song, Zihang Jiang, Jianfeng Zhang, Yichun Shi, Jing Liu, WanChun Ma, Jiashi Feng, Linjie Luo

We present OmniAvatar, a novel geometry-guided 3D head synthesis model trained from in-the-wild unstructured images that is capable of synthesizing diverse identity-preserved 3D heads with compelling dynamic details under full disentangled control over camera poses, facial expressions, head shapes, articulated neck and jaw poses.

Paper
Add Code

PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters

1 code implementation • CVPR 2023 • Shuhong Chen, Kevin Zhang, Yichun Shi, Heng Wang, Yiheng Zhu, Guoxian Song, Sizhe An, Janus Kristjansson, Xiao Yang, Matthias Zwicker

We propose PAniC-3D, a system to reconstruct stylized 3D character heads directly from illustrated (p)ortraits of (ani)me (c)haracters.

3D Architecture 3D Reconstruction +1

695

Paper
Code

AgileGAN3D: Few-Shot 3D Portrait Stylization by Augmented Transfer Learning

no code implementations • 24 Mar 2023 • Guoxian Song, Hongyi Xu, Jing Liu, Tiancheng Zhi, Yichun Shi, Jianfeng Zhang, Zihang Jiang, Jiashi Feng, Shen Sang, Linjie Luo

Capitalizing on the recent advancement of 3D-aware GAN models, we perform \emph{guided transfer learning} on a pretrained 3D GAN generator to produce multi-view-consistent stylized renderings.

Transfer Learning

Paper
Add Code

PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360$^{\circ}$

1 code implementation • 23 Mar 2023 • Sizhe An, Hongyi Xu, Yichun Shi, Guoxian Song, Umit Ogras, Linjie Luo

We propose PanoHead, the first 3D-aware generative model that enables high-quality view-consistent image synthesis of full heads in $360^\circ$ with diverse appearance and detailed geometry using only in-the-wild unstructured images for training.

Image Generation Image Segmentation +1

1,848

Paper
Code

PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg

1 code implementation • CVPR 2023 • Sizhe An, Hongyi Xu, Yichun Shi, Guoxian Song, Umit Y. Ogras, Linjie Luo

We propose PanoHead, the first 3D-aware generative model that enables high-quality view-consistent image synthesis of full heads in 360deg with diverse appearance and detailed geometry using only in-the-wild unstructured images for training.

Image Generation Image Segmentation +1

1,848

Paper
Code

AvatarGen: A 3D Generative Model for Animatable Human Avatars

1 code implementation • 26 Nov 2022 • Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng

Specifically, we decompose the generative 3D human synthesis into pose-guided mapping and canonical representation with predefined human pose and shape, such that the canonical representation can be explicitly driven to different poses and shapes with the guidance of a 3D parametric human model SMPL.

241

Paper
Code

AvatarGen: a 3D Generative Model for Animatable Human Avatars

1 code implementation • 1 Aug 2022 • Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng

Unsupervised generation of clothed virtual humans with various appearance and animatable poses is important for creating 3D human avatars and other AR/VR applications.

3D Human Reconstruction

241

Paper
Code

Handling Data Heterogeneity in Federated Learning via Knowledge Distillation and Fusion

1 code implementation • 23 Jul 2022 • Xu Zhou, Xinyu Lei, Cong Yang, Yichun Shi, Xiao Zhang, Jingwen Shi

The key idea in FedKF is to let the server return the global knowledge to be fused with the local knowledge in each training round so that the local model can be regularized towards the global optima.

Data-free Knowledge Distillation Fairness +2

Paper
Code

IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis

1 code implementation • 31 May 2022 • Jingxiang Sun, Xuan Wang, Yichun Shi, Lizhen Wang, Jue Wang, Yebin Liu

Existing 3D-aware facial generation methods face a dilemma in quality versus editability: they either generate editable results in low resolution or high-quality ones with no editing flexibility.

Ranked #1 on 3D-Aware Image Synthesis on FFHQ 512 x 512 - 4x upscaling

3D-Aware Image Synthesis

465

Paper
Code

SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing

1 code implementation • CVPR 2022 • Yichun Shi, Xiao Yang, Yangyue Wan, Xiaohui Shen

When combined with editing methods designed for StyleGANs, it can achieve a more fine-grained control to edit synthesized or real images.

Disentanglement Facial Editing +2

249

Paper
Code

Federated Learning with Domain Generalization

no code implementations • 20 Nov 2021 • Liling Zhang, Xinyu Lei, Yichun Shi, Hongyu Huang, Chao Chen

Federated Learning (FL) enables a group of clients to jointly train a machine learning model with the help of a centralized server.

Domain Generalization Federated Learning

Paper
Add Code

Semantic StyleGAN

no code implementations • arXiv:2112.02236v2 [cs.CV] 7 Dec 2021 2021 • Researchers at ByteDance Inc, Yichun Shi, Xiao Yang, Yangyue Wan, Xiaohui Shen

SemanticStyleGAN presents a method where a generator is trained to model local semantic parts separately and synthesizes images in a compositional way.

Disentanglement

Paper
Add Code

Lifting 2D StyleGAN for 3D-Aware Face Generation

1 code implementation • CVPR 2021 • Yichun Shi, Divyansh Aggarwal, Anil K. Jain

We propose a framework, called LiftedGAN, that disentangles and lifts a pre-trained StyleGAN2 for 3D-aware face generation.

Face Generation

Paper
Code

Boosting Unconstrained Face Recognition with Auxiliary Unlabeled Data

no code implementations • 17 Mar 2020 • Yichun Shi, Anil K. Jain

In recent years, significant progress has been made in face recognition, which can be partially attributed to the availability of large-scale labeled face datasets.

Domain Generalization Face Recognition

Paper
Add Code

Towards Universal Representation Learning for Deep Face Recognition

no code implementations • CVPR 2020 • Yichun Shi, Xiang Yu, Kihyuk Sohn, Manmohan Chandraker, Anil K. Jain

Recognizing wild faces is extremely hard as they appear with all kinds of variations.

Face Recognition Representation Learning

Paper
Add Code

Recurrent Embedding Aggregation Network for Video Face Recognition

no code implementations • 26 Apr 2019 • Sixue Gong, Yichun Shi, Anil K. Jain

Recurrent networks have been successful in analyzing temporal data and have been widely used for video analysis.

Face Recognition

Paper
Add Code

Probabilistic Face Embeddings

1 code implementation • ICCV 2019 • Yichun Shi, Anil K. Jain

Embedding methods have achieved success in face recognition by comparing facial features in a latent semantic space.

Ranked #1 on Face Verification on IJB-C (training dataset metric)

Face Recognition Face Verification

335

Paper
Code

Video Face Recognition: Component-wise Feature Aggregation Network (C-FAN)

no code implementations • 19 Feb 2019 • Sixue Gong, Yichun Shi, Anil K. Jain

We propose a new approach to video face recognition.

Face Recognition

Paper
Add Code

WarpGAN: Automatic Caricature Generation

3 code implementations • CVPR 2019 • Yichun Shi, Debayan Deb, Anil K. Jain

We propose, WarpGAN, a fully automatic network that can generate caricatures given an input face photo.

Photo-To-Caricature Translation

262

Paper
Code

DocFace+: ID Document to Selfie Matching

1 code implementation • 15 Sep 2018 • Yichun Shi, Anil K. Jain

Numerous activities in our daily life require us to verify who we are by showing our ID documents containing face images, such as passports and driver licenses, to human operators.

TAR

364

Paper
Code

DocFace: Matching ID Document Photos to Selfies

1 code implementation • 6 May 2018 • Yichun Shi, Anil K. Jain

Numerous activities in our daily life, including transactions, access to services and transportation, require us to verify who we are by showing our ID documents containing face images, e. g. passports and driver licenses.

Face Recognition TAR +1

364

Paper
Code

Face Recognition: Primates in the Wild

1 code implementation • 24 Apr 2018 • Debayan Deb, Susan Wiper, Alexandra Russo, Sixue Gong, Yichun Shi, Cori Tymoszek, Anil Jain

We present a new method of primate face recognition, and evaluate this method on several endangered primates, including golden monkeys, lemurs, and chimpanzees.

Face Recognition

Paper
Code

Face Clustering: Representation and Pairwise Constraints

no code implementations • 15 Jun 2017 • Yichun Shi, Charles Otto, Anil K. Jain

Given this representation, we design a clustering algorithm, Conditional Pairwise Clustering (ConPaC), which directly estimates the adjacency matrix only based on the similarity between face images.

Clustering Face Clustering +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.