Search Results for author: Huixuan Zhang

Found 3 papers, 1 papers with code

Quantity Matters: Towards Assessing and Mitigating Number Hallucination in Large Vision-Language Models

no code implementations3 Mar 2024 Huixuan Zhang, Junzhe Zhang, Xiaojun Wan

Large-scale vision-language models have demonstrated impressive skill in handling tasks that involve both areas.

Hallucination

EAMA : Entity-Aware Multimodal Alignment Based Approach for News Image Captioning

no code implementations29 Feb 2024 Junzhe Zhang, Huixuan Zhang, Xunjian Yin, Xiaojun Wan

News image captioning requires model to generate an informative caption rich in entities, with the news image and the associated news article.

Image Captioning Sentence

Image Matters: A New Dataset and Empirical Study for Multimodal Hyperbole Detection

1 code implementation1 Jul 2023 Huixuan Zhang, Xiaojun Wan

We create a multimodal detection dataset from Weibo (a Chinese social media) and carry out some studies on it.

Cannot find the paper you are looking for? You can Submit a new open access paper.