Search Results for author: Yachao Zhang

Found 17 papers, 7 papers with code

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

1 code implementation • 15 Mar 2024 • Ronghui Li, Yuxiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li

In contrast, the second-stage is the local diffusion, which parallelly generates detailed motion sequences under the guidance of the dance primitives and choreographic rules.

Ranked #1 on Motion Synthesis on FineDance

Motion Synthesis

Paper
Code

MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models

no code implementations • 14 Mar 2024 • Zunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang, Xiu Li

Gesture synthesis is a vital realm of human-computer interaction, with wide-ranging applications across various fields like film, robotics, and virtual reality.

Paper
Add Code

Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification

no code implementations • 12 Jan 2024 • Jiangming Shi, Xiangbo Yin, Yeyun Chen, Yachao Zhang, Zhizhong Zhang, Yuan Xie, Yanyun Qu

To associate cross-modality clustered pseudo-labels, we design a Multi-Memory Learning and Matching (MMLM) module, ensuring that optimization explicitly focuses on the nuances of individual perspectives and establishes reliable cross-modality correspondences.

Clustering Person Re-Identification +1

Paper
Add Code

Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

no code implementations • 1 Jan 2024 • Chaoqun Gong, Yuqin Dai, Ronghui Li, Achun Bao, Jun Li, Jian Yang, Yachao Zhang, Xiu Li

Generating 3D human models directly from text helps reduce the cost and time of character modeling.

Attribute Disentanglement +2

Paper
Add Code

Exploring Multi-Modal Control in Music-Driven Dance Generation

no code implementations • 1 Jan 2024 • Ronghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu Li

Existing music-driven 3D dance generation methods mainly concentrate on high-quality dance generation, but lack sufficient control during the generation process.

Paper
Add Code

Chain of Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional Control

no code implementations • 26 Dec 2023 • Zunnan Xu, Yachao Zhang, Sicheng Yang, Ronghui Li, Xiu Li

We introduce a novel method that separates priors from speech and employs multimodal priors as constraints for generating gestures.

Gesture Generation

Paper
Add Code

Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors

no code implementations • 29 Sep 2023 • Yukang Lin, Haonan Han, Chaoqun Gong, Zunnan Xu, Yachao Zhang, Xiu Li

However, due to utilizing the case-agnostic rigid strategy, their generalization ability to arbitrary cases and the 3D consistency of reconstruction are still poor.

Image to 3D

Paper
Add Code

UniHead: Unifying Multi-Perception for Detection Heads

1 code implementation • 23 Sep 2023 • Hantao Zhou, Rui Yang, Yachao Zhang, Haoran Duan, Yawen Huang, Runze Hu, Xiu Li, Yefeng Zheng

More precisely, our approach (1) introduces deformation perception, enabling the model to adaptively sample object features; (2) proposes a Dual-axial Aggregation Transformer (DAT) to adeptly model long-range dependencies, thereby achieving global perception; and (3) devises a Cross-task Interaction Transformer (CIT) that facilitates interaction between the classification and localization branches, thus aligning the two tasks.

Paper
Code

BEV-DG: Cross-Modal Learning under Bird's-Eye View for Domain Generalization of 3D Semantic Segmentation

no code implementations • ICCV 2023 • Miaoyu Li, Yachao Zhang, Xu Ma, Yanyun Qu, Yun Fu

In light of this, we propose cross-modal learning under bird's-eye view for Domain Generalization (DG) of 3D semantic segmentation, called BEV-DG.

3D Semantic Segmentation Contrastive Learning +2

Paper
Add Code

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

1 code implementation • 6 Aug 2023 • Chunming He, Kai Li, Yachao Zhang, Yulun Zhang, Zhenhua Guo, Xiu Li, Martin Danelljan, Fisher Yu

On the prey side, we propose an adversarial training framework, Camouflageator, which introduces an auxiliary generator to generate more camouflaged objects that are harder for a COD method to detect.

object-detection Object Detection

Paper
Code

Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping

no code implementations • NeurIPS 2023 • Chunming He, Kai Li, Yachao Zhang, Guoxia Xu, Longxiang Tang, Yulun Zhang, Zhenhua Guo, Xiu Li

It remains a challenging task since (1) it is hard to distinguish concealed objects from the background due to the intrinsic similarity and (2) the sparsely-annotated training data only provide weak supervision for model learning.

Segmentation Semantic Segmentation

Paper
Add Code

Efficient Converted Spiking Neural Network for 3D and 2D Classification

no code implementations • ICCV 2023 • Yuxiang Lan, Yachao Zhang, Xu Ma, Yanyun Qu, Yun Fu

Spiking Neural Networks (SNNs) have attracted enormous research interest due to their low-power and biologically plausible nature.

Image Classification Point Cloud Classification

Paper
Add Code

Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification

1 code implementation • ICCV 2023 • Jiangming Shi, Yachao Zhang, Xiangbo Yin, Yuan Xie, Zhizhong Zhang, Jianping Fan, Zhongchao shi, Yanyun Qu

Visible-infrared person re-identification (VI-ReID) aims to match a specific person from a gallery of images captured from non-overlapping visible and infrared cameras.

Person Re-Identification Pseudo Label

Paper
Code

Camouflaged Object Detection With Feature Decomposition and Edge Reconstruction

no code implementations • CVPR 2023 • Chunming He, Kai Li, Yachao Zhang, Longxiang Tang, Yulun Zhang, Zhenhua Guo, Xiu Li

COD is a challenging task due to the intrinsic similarity of camouflaged objects with the background, as well as their ambiguous boundaries.

object-detection Object Detection

Paper
Add Code

Weakly Supervised Semantic Segmentation for Large-Scale Point Cloud

4 code implementations • AAAI 2021 • Yachao Zhang, Zonghao Li, Yuan Xie, Yanyun Qu, Cuihua Li, Tao Mei

Firstly, we construct a pretext task, \textit{i. e.,} point cloud colorization, with a self-supervised learning to transfer the learned prior knowledge from a large amount of unlabeled point cloud to a weakly supervised network.

Colorization Pseudo Label +3

Paper
Code

FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation

1 code implementation • ICCV 2023 • Ronghui Li, Junfan Zhao, Yachao Zhang, Mingyang Su, Zeping Ren, Han Zhang, Yansong Tang, Xiu Li

To address these problems, we propose FineDance, which contains 14. 6 hours of music-dance paired data, with fine-grained hand motions, fine-grained genres (22 dance genres), and accurate posture.

Motion Synthesis Retrieval

Paper
Code

Perturbed Self-Distillation: Weakly Supervised Large-Scale Point Cloud Semantic Segmentation

3 code implementations • ICCV 2021 • Yachao Zhang, Yanyun Qu, Yuan Xie, Zonghao Li, Shanshan Zheng, Cuihua Li

In this way, the graph topology of the whole point cloud can be effectively established by the introduced auxiliary supervision, such that the information propagation between the labeled and unlabeled points will be realized.

Self-Supervised Learning Semantic Segmentation +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.