Search Results for author: Chengyou Jia

Found 8 papers, 0 papers with code

Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation

no code implementations • 27 Dec 2023 • Zhuohang Dang, Minnan Luo, Chengyou Jia, Guang Dai, Xiaojun Chang, Jingdong Wang

Cross-modal retrieval relies on well-matched large-scale datasets that are laborious in practice.

Cross-Modal Retrieval Memorization +2

Paper
Add Code

Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition

no code implementations • 4 Dec 2023 • Chengyou Jia, Minnan Luo, Xiaojun Chang, Zhuohang Dang, Mingfei Han, Mengmeng Wang, Guang Dai, Sizhe Dang, Jingdong Wang

To realize this, we innovatively blend video models with Large Language Models (LLMs) to devise Action-conditioned Prompts.

Action Recognition Descriptive +1

Paper
Add Code

Disentangled Representation Learning with Transmitted Information Bottleneck

no code implementations • 3 Nov 2023 • Zhuohang Dang, Minnan Luo, Chengyou Jia, Guang Dai, Jihong Wang, Xiaojun Chang, Jingdong Wang, Qinghua Zheng

Encoding only the task-related information from the raw data, \ie, disentangled representation learning, can greatly contribute to the robustness and generalizability of models.

Disentanglement Variational Inference

Paper
Add Code

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement

no code implementations • 20 Sep 2023 • Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Jingdong Wang

Dominant Person Search methods aim to localize and recognize query persons in a unified network, which jointly optimizes two sub-tasks, \ie, pedestrian detection and Re-IDentification (ReID).

Denoising Pedestrian Detection +2

Paper
Add Code

SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation

no code implementations • 20 Aug 2023 • Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Mengmeng Wang, Jingdong Wang

Despite significant progress in Text-to-Image (T2I) generative models, even lengthy and complex text descriptions still struggle to convey detailed controls.

Layout-to-Image Generation

Paper
Add Code

Multi-Modality Multi-Scale Cardiovascular Disease Subtypes Classification Using Raman Image and Medical History

no code implementations • 18 Apr 2023 • Bo Yu, Hechang Chen, Chengyou Jia, Hongren Zhou, Lele Cong, Xiankai Li, Jianhui Zhuang, Xianling Cong

Second, a probability matrix and a weight matrix are used to enhance the classification capacity by combining the RS and medical history data in the multi-modality data fusion module.

Specificity

Paper
Add Code

Disentangled Generation with Information Bottleneck for Few-Shot Learning

no code implementations • 29 Nov 2022 • Zhuohang Dang, Jihong Wang, Minnan Luo, Chengyou Jia, Caixia Yan, Qinghua Zheng

To these challenges, we propose a novel Information Bottleneck (IB) based Disentangled Generation Framework for FSL, termed as DisGenIB, that can simultaneously guarantee the discrimination and diversity of generated samples.

Disentanglement Few-Shot Learning

Paper
Add Code

CGUA: Context-Guided and Unpaired-Assisted Weakly Supervised Person Search

no code implementations • 27 Mar 2022 • Chengyou Jia, Minnan Luo, Caixia Yan, Xiaojun Chang, Qinghua Zheng

On the other hand, there are numerous unpaired persons in real-world scene images.

Person Search

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.