Search Results for author: Xingqun Qi

Found 13 papers, 4 papers with code

M$^{2}$Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation

1 code implementation29 Nov 2023 Xiaowei Chi, Rongyu Zhang, Zhengkai Jiang, Yijiang Liu, Yatian Wang, Xingqun Qi, Wenhan Luo, Peng Gao, Shanghang Zhang, Qifeng Liu, Yike Guo

Moreover, to further enhance the effectiveness of $M^{3}Adapter$ while preserving the coherence of semantic context comprehension, we introduce a two-stage $M^{3}FT$ fine-tuning strategy.

Image Generation Language Modelling +1

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

no code implementations29 Nov 2023 Xingqun Qi, Jiahao Pan, Peng Li, Ruibin Yuan, Xiaowei Chi, Mengfei Li, Wenhan Luo, Wei Xue, Shanghang Zhang, Qifeng Liu, Yike Guo

In addition, the lack of large-scale available datasets with emotional transition speech and corresponding 3D human gestures also limits the addressing of this task.

Audio inpainting Gesture Generation

Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics

no code implementations31 Jul 2023 Chen Liu, Peike Li, Xingqun Qi, Hu Zhang, Lincheng Li, Dadong Wang, Xin Yu

However, we observed that prior arts are prone to segment a certain salient object in a video regardless of the audio information.

Object Segmentation +1

EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation

1 code implementation30 May 2023 Xingqun Qi, Chen Liu, Lincheng Li, Jie Hou, Haoran Xin, Xin Yu

In this work, we propose EmotionGesture, a novel framework for synthesizing vivid and diverse emotional co-speech 3D gestures from audio.

Gesture Generation

Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement

1 code implementation CVPR 2023 Xingqun Qi, Chen Liu, Muyi Sun, Lincheng Li, Changjie Fan, Xin Yu

Considering the asymmetric gestures and motions of two hands, we introduce a Spatial-Residual Memory (SRM) module to model spatial interaction between the body and each hand by residual learning.

Disentanglement

LightVessel: Exploring Lightweight Coronary Artery Vessel Segmentation via Similarity Knowledge Distillation

no code implementations2 Nov 2022 Hao Dang, Yuekai Zhang, Xingqun Qi, Wanting Zhou, Muyi Sun

To tackle this problem, we propose \textbf{LightVessel}, a Similarity Knowledge Distillation Framework, for lightweight coronary artery vessel segmentation.

Knowledge Distillation

Exploring Generalizable Distillation for Efficient Medical Image Segmentation

1 code implementation26 Jul 2022 Xingqun Qi, Zhuojie Wu, Min Ren, Muyi Sun, Caifeng Shan, Zhenan Sun

Considering the domain-invariant representative vectors in MSAN, we propose two generalizable knowledge distillation schemes for cross-domain distillation, Dual Contrastive Graph Distillation (DCGD) and Domain-Invariant Cross Distillation (DICD).

Image Segmentation Knowledge Distillation +3

ShowFace: Coordinated Face Inpainting with Memory-Disentangled Refinement Networks

no code implementations6 Apr 2022 Zhuojie Wu, Xingqun Qi, Zijian Wang, Wanting Zhou, Kun Yuan, Muyi Sun, Zhenan Sun

Furthermore, to better improve the inter-coordination between the corrupted and non-corrupted regions and enhance the intra-coordination in corrupted regions, we design InCo2 Loss, a pair of similarity based losses to constrain the feature consistency.

Disentanglement Facial Inpainting

MOST-Net: A Memory Oriented Style Transfer Network for Face Sketch Synthesis

no code implementations8 Feb 2022 Fan Ji, Muyi Sun, Xingqun Qi, Qi Li, Zhenan Sun

Furthermore, we design a novel Memory Refinement Loss (MR Loss) for feature alignment in the memory module, which enhances the accuracy of memory slots in an unsupervised manner.

Face Sketch Synthesis Image-to-Image Translation +2

Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning

no code implementations5 Jan 2022 Xingqun Qi, Muyi Sun, Zijian Wang, Jiaming Liu, Qi Li, Fang Zhao, Shanghang Zhang, Caifeng Shan

To preserve the generated faces being more structure-coordinated, the IRSG models inter-class structural relations among every facial component by graph representation learning.

Generative Adversarial Network Graph Representation Learning +1

Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network

no code implementations29 Jun 2021 Xingqun Qi, Muyi Sun, Weining Wang, Xiaoxiao Dong, Qi Li, Caifeng Shan

To tackle these challenges, we propose a novel Semantic-Driven Generative Adversarial Network (SDGAN) which embeds global structure-level style injection and local class-level knowledge re-weighting.

Face Parsing Face Sketch Synthesis +2

Cannot find the paper you are looking for? You can Submit a new open access paper.