no code implementations • 19 Apr 2024 • Longfei Huang, Shupeng Zhong, Xiangyu Wu, Ruoxuan Li, QingGuo Chen, Yang Yang
Subsequently, we propose caption-level strategy for the high-quality caption data generated by the image caption models and integrate them with retrieval augmentation strategy into the template to compel the model to generate higher quality, more matching, and semantically enriched captions based on the retrieval augmentation prompts.
no code implementations • 13 Mar 2024 • Xuanpu Zhang, Dan Song, Pengxin Zhan, QingGuo Chen, Kuilong Liu, AnAn Liu
Image-based virtual try-on aims to transfer target in-shop clothing to a dressed model image, the objectives of which are totally taking off original clothing while preserving the contents outside of the try-on area, naturally wearing target clothing and correctly inpainting the gap between target clothing and original clothing.
no code implementations • 10 Oct 2023 • Xiangyu Wu, Yang Yang, Shengdong Xu, Yifeng Wu, QingGuo Chen, Jianfeng Lu
At the data level, inspired by the challenge paper, we categorized the whole questions into eight types and utilized the llama-2-chat model to directly generate the type for each question in a zero-shot manner.