Search Results for author: Yichun Shi

Found 32 papers, 17 papers with code

Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

no code implementations16 Apr 2024 SeungWook Kim, Kejie Li, Xueqing Deng, Yichun Shi, Minsu Cho, Peng Wang

Leveraging multi-view diffusion models as priors for 3D optimization have alleviated the problem of 3D consistency, e. g., the Janus face problem or the content drift problem, in zero-shot text-to-3D models.

Common Sense Reasoning Text to 3D

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

no code implementations15 Apr 2024 Mude Hui, Siwei Yang, Bingchen Zhao, Yichun Shi, Heng Wang, Peng Wang, Yuyin Zhou, Cihang Xie

This study introduces HQ-Edit, a high-quality instruction-based image editing dataset with around 200, 000 edits.

Attribute

Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion

no code implementations9 Apr 2024 Fan Yang, Jianfeng Zhang, Yichun Shi, Bowen Chen, Chenxu Zhang, Huichao Zhang, Xiaofeng Yang, Jiashi Feng, Guosheng Lin

Benefiting from the rapid development of 2D diffusion models, 3D content creation has made significant progress recently.

3D Generation

X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention

no code implementations23 Mar 2024 You Xie, Hongyi Xu, Guoxian Song, Chao Wang, Yichun Shi, Linjie Luo

We propose X-Portrait, an innovative conditional diffusion model tailored for generating expressive and temporally coherent portrait animation.

Disentanglement

DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis

1 code implementation20 Dec 2023 Yuming Gu, You Xie, Hongyi Xu, Guoxian Song, Yichun Shi, Di Chang, Jing Yang, Linjie Luo

The rendering view is then manipulated with a novel conditional control module that interprets the camera pose by watching a condition image of a crossed subject from the same view.

Denoising

ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation

no code implementations2 Dec 2023 Peng Wang, Yichun Shi

We introduce "ImageDream," an innovative image-prompt, multi-view diffusion model for 3D object generation.

3D Generation Object

Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models

no code implementations4 Oct 2023 Jianglong Ye, Peng Wang, Kejie Li, Yichun Shi, Heng Wang

Specifically, we decompose the NVS task into two stages: (i) transforming observed regions to a novel view, and (ii) hallucinating unseen regions.

Image to 3D Novel View Synthesis

MVDream: Multi-view Diffusion for 3D Generation

2 code implementations31 Aug 2023 Yichun Shi, Peng Wang, Jianglong Ye, Mai Long, Kejie Li, Xiao Yang

We introduce MVDream, a diffusion model that is able to generate consistent multi-view images from a given text prompt.

3D Generation

OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis

no code implementations CVPR 2023 Hongyi Xu, Guoxian Song, Zihang Jiang, Jianfeng Zhang, Yichun Shi, Jing Liu, WanChun Ma, Jiashi Feng, Linjie Luo

We present OmniAvatar, a novel geometry-guided 3D head synthesis model trained from in-the-wild unstructured images that is capable of synthesizing diverse identity-preserved 3D heads with compelling dynamic details under full disentangled control over camera poses, facial expressions, head shapes, articulated neck and jaw poses.

AgileGAN3D: Few-Shot 3D Portrait Stylization by Augmented Transfer Learning

no code implementations24 Mar 2023 Guoxian Song, Hongyi Xu, Jing Liu, Tiancheng Zhi, Yichun Shi, Jianfeng Zhang, Zihang Jiang, Jiashi Feng, Shen Sang, Linjie Luo

Capitalizing on the recent advancement of 3D-aware GAN models, we perform \emph{guided transfer learning} on a pretrained 3D GAN generator to produce multi-view-consistent stylized renderings.

Transfer Learning

PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360$^{\circ}$

1 code implementation23 Mar 2023 Sizhe An, Hongyi Xu, Yichun Shi, Guoxian Song, Umit Ogras, Linjie Luo

We propose PanoHead, the first 3D-aware generative model that enables high-quality view-consistent image synthesis of full heads in $360^\circ$ with diverse appearance and detailed geometry using only in-the-wild unstructured images for training.

Image Generation Image Segmentation +1

PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg

1 code implementation CVPR 2023 Sizhe An, Hongyi Xu, Yichun Shi, Guoxian Song, Umit Y. Ogras, Linjie Luo

We propose PanoHead, the first 3D-aware generative model that enables high-quality view-consistent image synthesis of full heads in 360deg with diverse appearance and detailed geometry using only in-the-wild unstructured images for training.

Image Generation Image Segmentation +1

AvatarGen: A 3D Generative Model for Animatable Human Avatars

1 code implementation26 Nov 2022 Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng

Specifically, we decompose the generative 3D human synthesis into pose-guided mapping and canonical representation with predefined human pose and shape, such that the canonical representation can be explicitly driven to different poses and shapes with the guidance of a 3D parametric human model SMPL.

AvatarGen: a 3D Generative Model for Animatable Human Avatars

1 code implementation1 Aug 2022 Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng

Unsupervised generation of clothed virtual humans with various appearance and animatable poses is important for creating 3D human avatars and other AR/VR applications.

3D Human Reconstruction

Handling Data Heterogeneity in Federated Learning via Knowledge Distillation and Fusion

1 code implementation23 Jul 2022 Xu Zhou, Xinyu Lei, Cong Yang, Yichun Shi, Xiao Zhang, Jingwen Shi

The key idea in FedKF is to let the server return the global knowledge to be fused with the local knowledge in each training round so that the local model can be regularized towards the global optima.

Data-free Knowledge Distillation Fairness +2

IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis

1 code implementation31 May 2022 Jingxiang Sun, Xuan Wang, Yichun Shi, Lizhen Wang, Jue Wang, Yebin Liu

Existing 3D-aware facial generation methods face a dilemma in quality versus editability: they either generate editable results in low resolution or high-quality ones with no editing flexibility.

3D-Aware Image Synthesis

SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing

1 code implementation CVPR 2022 Yichun Shi, Xiao Yang, Yangyue Wan, Xiaohui Shen

When combined with editing methods designed for StyleGANs, it can achieve a more fine-grained control to edit synthesized or real images.

Disentanglement Facial Editing +2

Federated Learning with Domain Generalization

no code implementations20 Nov 2021 Liling Zhang, Xinyu Lei, Yichun Shi, Hongyu Huang, Chao Chen

Federated Learning (FL) enables a group of clients to jointly train a machine learning model with the help of a centralized server.

Domain Generalization Federated Learning

Semantic StyleGAN

no code implementations arXiv:2112.02236v2 [cs.CV] 7 Dec 2021 2021 Researchers at ByteDance Inc, Yichun Shi, Xiao Yang, Yangyue Wan, Xiaohui Shen

SemanticStyleGAN presents a method where a generator is trained to model local semantic parts separately and synthesizes images in a compositional way.

Disentanglement

Lifting 2D StyleGAN for 3D-Aware Face Generation

1 code implementation CVPR 2021 Yichun Shi, Divyansh Aggarwal, Anil K. Jain

We propose a framework, called LiftedGAN, that disentangles and lifts a pre-trained StyleGAN2 for 3D-aware face generation.

Face Generation

Boosting Unconstrained Face Recognition with Auxiliary Unlabeled Data

no code implementations17 Mar 2020 Yichun Shi, Anil K. Jain

In recent years, significant progress has been made in face recognition, which can be partially attributed to the availability of large-scale labeled face datasets.

Domain Generalization Face Recognition

Recurrent Embedding Aggregation Network for Video Face Recognition

no code implementations26 Apr 2019 Sixue Gong, Yichun Shi, Anil K. Jain

Recurrent networks have been successful in analyzing temporal data and have been widely used for video analysis.

Face Recognition

Probabilistic Face Embeddings

1 code implementation ICCV 2019 Yichun Shi, Anil K. Jain

Embedding methods have achieved success in face recognition by comparing facial features in a latent semantic space.

 Ranked #1 on Face Verification on IJB-C (training dataset metric)

Face Recognition Face Verification

WarpGAN: Automatic Caricature Generation

3 code implementations CVPR 2019 Yichun Shi, Debayan Deb, Anil K. Jain

We propose, WarpGAN, a fully automatic network that can generate caricatures given an input face photo.

Photo-To-Caricature Translation

DocFace+: ID Document to Selfie Matching

1 code implementation15 Sep 2018 Yichun Shi, Anil K. Jain

Numerous activities in our daily life require us to verify who we are by showing our ID documents containing face images, such as passports and driver licenses, to human operators.

TAR

DocFace: Matching ID Document Photos to Selfies

1 code implementation6 May 2018 Yichun Shi, Anil K. Jain

Numerous activities in our daily life, including transactions, access to services and transportation, require us to verify who we are by showing our ID documents containing face images, e. g. passports and driver licenses.

Face Recognition TAR +1

Face Recognition: Primates in the Wild

1 code implementation24 Apr 2018 Debayan Deb, Susan Wiper, Alexandra Russo, Sixue Gong, Yichun Shi, Cori Tymoszek, Anil Jain

We present a new method of primate face recognition, and evaluate this method on several endangered primates, including golden monkeys, lemurs, and chimpanzees.

Face Recognition

Face Clustering: Representation and Pairwise Constraints

no code implementations15 Jun 2017 Yichun Shi, Charles Otto, Anil K. Jain

Given this representation, we design a clustering algorithm, Conditional Pairwise Clustering (ConPaC), which directly estimates the adjacency matrix only based on the similarity between face images.

Clustering Face Clustering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.