Search Results for author: Zunnan Xu

Found 6 papers, 1 papers with code

MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models

no code implementations • 14 Mar 2024 • Zunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang, Xiu Li

Gesture synthesis is a vital realm of human-computer interaction, with wide-ranging applications across various fields like film, robotics, and virtual reality.

Paper
Add Code

BATON: Aligning Text-to-Audio Model with Human Preference Feedback

no code implementations • 1 Feb 2024 • Huan Liao, Haonan Han, Kai Yang, Tianjiao Du, Rui Yang, Zunnan Xu, Qinmei Xu, Jingquan Liu, Jiasheng Lu, Xiu Li

With the development of AI-Generated Content (AIGC), text-to-audio models are gaining widespread attention.

Paper
Add Code

Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness

no code implementations • 7 Jan 2024 • Sicheng Yang, Zunnan Xu, Haiwei Xue, Yongkang Cheng, Shaoli Huang, Mingming Gong, Zhiyong Wu

To tackle these issues, we introduce FreeTalker, which, to the best of our knowledge, is the first framework for the generation of both spontaneous (e. g., co-speech gesture) and non-spontaneous (e. g., moving around the podium) speaker motions.

Gesture Generation

Paper
Add Code

Chain of Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional Control

no code implementations • 26 Dec 2023 • Zunnan Xu, Yachao Zhang, Sicheng Yang, Ronghui Li, Xiu Li

We introduce a novel method that separates priors from speech and employs multimodal priors as constraints for generating gestures.

Gesture Generation

Paper
Add Code

Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors

no code implementations • 29 Sep 2023 • Yukang Lin, Haonan Han, Chaoqun Gong, Zunnan Xu, Yachao Zhang, Xiu Li

However, due to utilizing the case-agnostic rigid strategy, their generalization ability to arbitrary cases and the 3D consistency of reconstruction are still poor.

Image to 3D

Paper
Add Code

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

1 code implementation • ICCV 2023 • Zunnan Xu, Zhihong Chen, Yong Zhang, Yibing Song, Xiang Wan, Guanbin Li

Parameter Efficient Tuning (PET) has gained attention for reducing the number of parameters while maintaining performance and providing better hardware resource savings, but few studies investigate dense prediction tasks and interaction between modalities.

Ranked #2 on Referring Expression Segmentation on RefCOCO

Image Segmentation Referring Expression Segmentation +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.