Search Results for author: Jiazheng Xing

Found 8 papers, 5 papers with code

TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On

1 code implementation • 1 Apr 2024 • Jiazheng Xing, Chao Xu, Yijie Qian, Yang Liu, Guang Dai, Baigui Sun, Yong liu, Jingdong Wang

However, the clothing identity uncontrollability and training inefficiency of existing diffusion-based methods, which struggle to maintain the identity even with full parameter training, are significant limitations that hinder the widespread applications.

Virtual Try-on

Paper
Code

SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking

1 code implementation • 24 Mar 2024 • Xiaojun Hou, Jiazheng Xing, Yijie Qian, Yaowei Guo, Shuo Xin, JunHao Chen, Kai Tang, Mengmeng Wang, Zhengkai Jiang, Liang Liu, Yong liu

Multimodal Visual Object Tracking (VOT) has recently gained significant attention due to its robustness.

Ranked #17 on Rgb-T Tracking on RGBT234

Rgb-T Tracking Visual Object Tracking

Paper
Code

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio

1 code implementation • 4 Mar 2024 • Chao Xu, Yang Liu, Jiazheng Xing, Weida Wang, Mingze Sun, Jun Dan, Tianxin Huang, Siyuan Li, Zhi-Qi Cheng, Ying Tai, Baigui Sun

In this paper, we abstract the process of people hearing speech, extracting meaningful cues, and creating various dynamically audio-consistent talking faces, termed Listening and Imagining, into the task of high-fidelity diverse talking faces generation from a single audio.

Disentanglement

8,393

Paper
Code

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition

no code implementations • 22 Jan 2024 • Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing Zuo, Guang Dai, Jingdong Wang, Yong liu

In this paper, we introduce a novel Multimodal, Multi-task CLIP adapting framework named \name to address these challenges, preserving both high supervised performance and robust transferability.

Action Recognition Decoder +1

Paper
Add Code

Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching

1 code implementation • ICCV 2023 • Jiazheng Xing, Mengmeng Wang, Yudi Ruan, Bofan Chen, Yaowei Guo, Boyu Mu, Guang Dai, Jingdong Wang, Yong liu

Class prototype construction and matching are core aspects of few-shot action recognition.

Feature Correlation Few-Shot action recognition +1

Paper
Code

Multimodal Adaptation of CLIP for Few-Shot Action Recognition

no code implementations • 3 Aug 2023 • Jiazheng Xing, Mengmeng Wang, Xiaojun Hou, Guang Dai, Jingdong Wang, Yong liu

The adapters we design can combine information from video-text multimodal sources for task-oriented spatiotemporal modeling, which is fast, efficient, and has low training costs.

Few-Shot action recognition Few Shot Action Recognition

Paper
Add Code

Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition

no code implementations • 19 Jan 2023 • Jiazheng Xing, Mengmeng Wang, Yong liu, Boyu Mu

In this paper, we propose SloshNet, a new framework that revisits the spatial and temporal modeling for few-shot action recognition in a finer manner.

Few-Shot action recognition Few Shot Action Recognition

Paper
Add Code

ActionCLIP: A New Paradigm for Video Action Recognition

2 code implementations • 17 Sep 2021 • Mengmeng Wang, Jiazheng Xing, Yong liu

Moreover, to handle the deficiency of label texts and make use of tremendous web data, we propose a new paradigm based on this multimodal learning framework for action recognition, which we dub "pre-train, prompt and fine-tune".

Ranked #2 on Action Recognition In Videos on Kinetics-400

Action Classification Action Recognition In Videos +4

3,013

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.