Search Results for author: Weili Guan

Found 14 papers, 5 papers with code

MMGRec: Multimodal Generative Recommendation with Transformer Model

no code implementations25 Apr 2024 Han Liu, Yinwei Wei, Xuemeng Song, Weili Guan, Yuan-Fang Li, Liqiang Nie

Multimodal recommendation aims to recommend user-preferred candidates based on her/his historically interacted items and associated multimodal information.

Multimodal Recommendation Quantization +1

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

1 code implementation4 Apr 2024 Tiantian Geng, Teng Wang, yanfu Zhang, Jinming Duan, Weili Guan, Feng Zheng

Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL).

audio-visual event localization Event Detection +2

Prompt-based Multi-interest Learning Method for Sequential Recommendation

1 code implementation9 Jan 2024 Xue Dong, Xuemeng Song, Tongliang Liu, Weili Guan

Multi-interest learning method for sequential recommendation aims to predict the next item according to user multi-faceted interests given the user historical interactions.

Sequential Recommendation

Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog

no code implementations11 Oct 2023 Haoyu Zhang, Meng Liu, YaoWei Wang, Da Cao, Weili Guan, Liqiang Nie

In response to this gap, we present an iterative tracking and reasoning strategy that amalgamates a textual encoder, a visual encoder, and a generator.

Question Answering Response Generation +1

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models

no code implementations ICCV 2023 Baoshuo Kan, Teng Wang, Wenpeng Lu, XianTong Zhen, Weili Guan, Feng Zheng

Pre-trained vision-language models, e. g., CLIP, working with manually designed prompts have demonstrated great capacity of transfer learning.

Few-Shot Image Classification Transfer Learning

Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation

1 code implementation2 Aug 2023 Guojin Zhong, Jin Yuan, Pan Wang, Kailun Yang, Weili Guan, Zhiyong Li

The recently rising markup-to-image generation poses greater challenges as compared to natural image generation, due to its low tolerance for errors as well as the complex sequence and context correlations between markup and rendered image.

Denoising Image Generation

Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification

no code implementations10 Apr 2023 Zan Gao, Shenxun Wei, Weili Guan, Lei Zhu, Meng Wang, Shenyong Chen

Moreover, human semantic information and pedestrian identity information are not fully explored.

A Semantic-aware Attention and Visual Shielding Network for Cloth-changing Person Re-identification

no code implementations18 Jul 2022 Zan Gao, Hongwei Wei, Weili Guan, Jie Nie, Meng Wang, Shenyong Chen

In addition, a visual clothes shielding module (VCS) is also designed to extract a more robust feature representation for the cloth-changing task by covering the clothing regions and focusing the model on the visual semantic information unrelated to the clothes.

Cloth-Changing Person Re-Identification Semantic Segmentation

Disentangled Graph Neural Networks for Session-based Recommendation

1 code implementation10 Jan 2022 Ansong Li, Zhiyong Cheng, Fan Liu, Zan Gao, Weili Guan, Yuxin Peng

The session embedding is then generated by aggregating the item embeddings with attention weights of each item's factors.

Session-Based Recommendations

A Novel Patch Convolutional Neural Network for View-based 3D Model Retrieval

no code implementations25 Sep 2021 Zan Gao, Yuxiang Shao, Weili Guan, Meng Liu, Zhiyong Cheng, ShengYong Chen

Thus, we tackle this problem from the perspective of exploiting the relationships between patch features to capture long-range associations among multi-view images.

Retrieval

Multigranular Visual-Semantic Embedding for Cloth-Changing Person Re-identification

no code implementations10 Aug 2021 Zan Gao, Hongwei Wei, Weili Guan, Weizhi Nie, Meng Liu, Meng Wang

To solve these issues, in this work, a novel multigranular visual-semantic embedding algorithm (MVSE) is proposed for cloth-changing person ReID, where visual semantic information and human attributes are embedded into the network, and the generalized features of human appearance can be well learned to effectively solve the problem of clothing changes.

Cloth-Changing Person Re-Identification

TBNet:Two-Stream Boundary-aware Network for Generic Image Manipulation Localization

no code implementations10 Aug 2021 Zan Gao, Chao Sun, Zhiyong Cheng, Weili Guan, AnAn Liu, Meng Wang

In this work, a novel end-to-end two-stream boundary-aware network (abbreviated as TBNet) is proposed for generic image manipulation localization in which the RGB stream, the frequency stream, and the boundary artifact location are explored in a unified framework.

Image Manipulation Image Manipulation Localization

Cannot find the paper you are looking for? You can Submit a new open access paper.