Search Results for author: Zhen Xing

Found 15 papers, 5 papers with code

FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model

no code implementations • 15 Mar 2024 • Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang

Reconstructing detailed 3D objects from single-view images remains a challenging task due to the limited information available.

3D Reconstruction

Paper
Add Code

VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models

no code implementations • 30 Nov 2023 • Zhen Xing, Qi Dai, Zihao Zhang, HUI ZHANG, Han Hu, Zuxuan Wu, Yu-Gang Jiang

Our model can edit and translate the desired results within seconds based on user instructions.

Semantic Segmentation Video Editing +3

Paper
Add Code

AdaDiff: Adaptive Step Selection for Fast Diffusion

no code implementations • 24 Nov 2023 • HUI ZHANG, Zuxuan Wu, Zhen Xing, Jie Shao, Yu-Gang Jiang

Diffusion models, as a type of generative models, have achieved impressive results in generating images and videos conditioned on textual conditions.

Denoising Image Generation +1

Paper
Add Code

A Survey on Video Diffusion Models

1 code implementation • 16 Oct 2023 • Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang

However, existing surveys mainly focus on diffusion models in the context of image generation, with few up-to-date reviews on their application in the video domain.

Image Generation Video Editing +2

1,289

Paper
Code

Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models

1 code implementation • 15 Sep 2023 • Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Second, we propose a novel representation diffusion model (RDM) to disentangle 3D latent into facial identity and expression.

Face Verification Facial Expression Recognition +1

Paper
Code

PanoSwin: a Pano-style Swin Transformer for Panorama Understanding

no code implementations • CVPR 2023 • Zhixin Ling, Zhen Xing, Xiangdong Zhou, Manliang Cao, Guichun Zhou

In panorama understanding, the widely used equirectangular projection (ERP) entails boundary discontinuity and spatial distortion.

ERP object-detection +2

Paper
Add Code

SimDA: Simple Diffusion Adapter for Efficient Video Generation

no code implementations • 18 Aug 2023 • Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang

In this work, we propose a Simple Diffusion Adapter (SimDA) that fine-tunes only 24M out of 1. 1B parameters of a strong T2I model, adapting it to video generation in a parameter-efficient way.

Transfer Learning Video Editing +2

Paper
Add Code

FlexDTI: Flexible diffusion gradient encoding scheme-based highly efficient diffusion tensor imaging using deep learning

no code implementations • 2 Aug 2023 • Zejun Wu, Jiechao Wang, Zunquan Chen, Qinqin Yang, Zhen Xing, Dairong Cao, Jianfeng Bao, Taishan Kang, Jianzhong Lin, Shuhui Cai, Zhong Chen, Congbo Cai

Significance: FlexDTI can well learn diffusion gradient direction information to achieve generalized DTI reconstruction with flexible diffusion gradient scheme.

Paper
Add Code

TranSFormer: Slow-Fast Transformer for Machine Translation

no code implementations • 26 May 2023 • Bei Li, Yi Jing, Xu Tan, Zhen Xing, Tong Xiao, Jingbo Zhu

Learning multiscale Transformer models has been evidenced as a viable approach to augmenting machine translation systems.

Machine Translation Translation

Paper
Add Code

SVFormer: Semi-supervised Video Transformer for Action Recognition

1 code implementation • CVPR 2023 • Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

In this paper, we investigate the use of transformer models under the SSL setting for action recognition.

Action Recognition Semi-Supervised Image Classification +1

Paper
Code

Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors

1 code implementation • 30 Sep 2022 • Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang

In particular, we introduce an attention-guided prototype shape prior module for guiding realistic object reconstruction.

3D Reconstruction Object Reconstruction +2

Paper
Code

Few-shot Single-view 3D Reconstruction with Memory Prior Contrastive Network

no code implementations • 30 Jul 2022 • Zhen Xing, Yijiang Chen, Zhixin Ling, Xiangdong Zhou, Yu Xiang

In this paper, we present a Memory Prior Contrastive Network (MPCN) that can store shape prior knowledge in a few-shot learning based 3D reconstruction framework.

3D Reconstruction Contrastive Learning +3

Paper
Add Code

3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose Estimation

no code implementations • 2 Jun 2022 • Zhidan Liu, Zhen Xing, Xiangdong Zhou, Yijiang Chen, Guichun Zhou

We enhance the performance of image-based methods for category-agnostic object pose estimation by exploiting 3D knowledge learned by a multi-modal method.

Contrastive Learning Knowledge Distillation +2

Paper
Add Code

CaSS: A Channel-aware Self-supervised Representation Learning Framework for Multivariate Time Series Classification

no code implementations • 8 Mar 2022 • Yijiang Chen, Xiangdong Zhou, Zhen Xing, Zhidan Liu, Minyang Xu

Many previous works focus on the pretext task of self-supervised learning and usually neglect the complex problem of MTS encoding, leading to unpromising results.

Representation Learning Self-Supervised Learning +3

Paper
Add Code

Feature Pyramid Network for Multi-task Affective Analysis

1 code implementation • 8 Jul 2021 • Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Affective Analysis is not a single task, and the valence-arousal value, expression class, and action unit can be predicted at the same time.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.