Search Results for author: Zhichao Wei

Found 3 papers, 0 papers with code

MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration

no code implementations22 Mar 2024 Zhichao Wei, Qingkun Su, Long Qin, Weizhi Wang

CLS embeddings are used on the one hand to augment the text embeddings, and on the other hand together with patch embeddings to derive a small number of detail-rich subject embeddings, both of which are efficiently integrated into the diffusion model through the well-designed multimodal cross-attention mechanism.

Image Generation

UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model

no code implementations22 May 2023 Zhenghao Zhang, Zhichao Wei, Shengfan Zhang, Zuozhuo Dai, Siyu Zhu

Unsupervised video object segmentation has made significant progress in recent years, but the manual annotation of video mask datasets is expensive and limits the diversity of available datasets.

Image Segmentation Object +5

Linguistic Query-Guided Mask Generation for Referring Image Segmentation

no code implementations16 Jan 2023 Zhichao Wei, Xiaohao Chen, Mingqiang Chen, Siyu Zhu

Referring image segmentation aims to segment the image region of interest according to the given language expression, which is a typical multi-modal task.

Contrastive Learning Decoder +3

Cannot find the paper you are looking for? You can Submit a new open access paper.