Search Results for author: Zhen Xing

Found 15 papers, 5 papers with code

FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model

no code implementations15 Mar 2024 Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang

Reconstructing detailed 3D objects from single-view images remains a challenging task due to the limited information available.

3D Reconstruction

AdaDiff: Adaptive Step Selection for Fast Diffusion

no code implementations24 Nov 2023 HUI ZHANG, Zuxuan Wu, Zhen Xing, Jie Shao, Yu-Gang Jiang

Diffusion models, as a type of generative models, have achieved impressive results in generating images and videos conditioned on textual conditions.

Denoising Image Generation +1

A Survey on Video Diffusion Models

1 code implementation16 Oct 2023 Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang

However, existing surveys mainly focus on diffusion models in the context of image generation, with few up-to-date reviews on their application in the video domain.

Image Generation Video Editing +2

Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models

1 code implementation15 Sep 2023 Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Second, we propose a novel representation diffusion model (RDM) to disentangle 3D latent into facial identity and expression.

Face Verification Facial Expression Recognition +1

PanoSwin: a Pano-style Swin Transformer for Panorama Understanding

no code implementations CVPR 2023 Zhixin Ling, Zhen Xing, Xiangdong Zhou, Manliang Cao, Guichun Zhou

In panorama understanding, the widely used equirectangular projection (ERP) entails boundary discontinuity and spatial distortion.

ERP object-detection +2

SimDA: Simple Diffusion Adapter for Efficient Video Generation

no code implementations18 Aug 2023 Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang

In this work, we propose a Simple Diffusion Adapter (SimDA) that fine-tunes only 24M out of 1. 1B parameters of a strong T2I model, adapting it to video generation in a parameter-efficient way.

Transfer Learning Video Editing +2

FlexDTI: Flexible diffusion gradient encoding scheme-based highly efficient diffusion tensor imaging using deep learning

no code implementations2 Aug 2023 Zejun Wu, Jiechao Wang, Zunquan Chen, Qinqin Yang, Zhen Xing, Dairong Cao, Jianfeng Bao, Taishan Kang, Jianzhong Lin, Shuhui Cai, Zhong Chen, Congbo Cai

Significance: FlexDTI can well learn diffusion gradient direction information to achieve generalized DTI reconstruction with flexible diffusion gradient scheme.

TranSFormer: Slow-Fast Transformer for Machine Translation

no code implementations26 May 2023 Bei Li, Yi Jing, Xu Tan, Zhen Xing, Tong Xiao, Jingbo Zhu

Learning multiscale Transformer models has been evidenced as a viable approach to augmenting machine translation systems.

Machine Translation Translation

Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors

1 code implementation30 Sep 2022 Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang

In particular, we introduce an attention-guided prototype shape prior module for guiding realistic object reconstruction.

3D Reconstruction Object Reconstruction +2

Few-shot Single-view 3D Reconstruction with Memory Prior Contrastive Network

no code implementations30 Jul 2022 Zhen Xing, Yijiang Chen, Zhixin Ling, Xiangdong Zhou, Yu Xiang

In this paper, we present a Memory Prior Contrastive Network (MPCN) that can store shape prior knowledge in a few-shot learning based 3D reconstruction framework.

3D Reconstruction Contrastive Learning +3

3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose Estimation

no code implementations2 Jun 2022 Zhidan Liu, Zhen Xing, Xiangdong Zhou, Yijiang Chen, Guichun Zhou

We enhance the performance of image-based methods for category-agnostic object pose estimation by exploiting 3D knowledge learned by a multi-modal method.

Contrastive Learning Knowledge Distillation +2

CaSS: A Channel-aware Self-supervised Representation Learning Framework for Multivariate Time Series Classification

no code implementations8 Mar 2022 Yijiang Chen, Xiangdong Zhou, Zhen Xing, Zhidan Liu, Minyang Xu

Many previous works focus on the pretext task of self-supervised learning and usually neglect the complex problem of MTS encoding, leading to unpromising results.

Representation Learning Self-Supervised Learning +3

Feature Pyramid Network for Multi-task Affective Analysis

1 code implementation8 Jul 2021 Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Affective Analysis is not a single task, and the valence-arousal value, expression class, and action unit can be predicted at the same time.

Cannot find the paper you are looking for? You can Submit a new open access paper.