Search Results for author: Chenhang Cui

Found 6 papers, 5 papers with code

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

1 code implementation • 18 Feb 2024 • Yiyang Zhou, Chenhang Cui, Rafael Rafailov, Chelsea Finn, Huaxiu Yao

This procedure is not perfect and can cause the model to hallucinate - provide answers that do not accurately reflect the image, even when the core LLM is highly factual and the vision backbone has sufficiently complete representations.

Hallucination Instruction Following +1

Paper
Code

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs

1 code implementation • 27 Nov 2023 • Haoqin Tu, Chenhang Cui, Zijun Wang, Yiyang Zhou, Bingchen Zhao, Junlin Han, Wangchunshu Zhou, Huaxiu Yao, Cihang Xie

Different from prior studies, we shift our focus from evaluating standard performance to introducing a comprehensive safety evaluation suite, covering both out-of-distribution (OOD) generalization and adversarial robustness.

Adversarial Robustness Visual Question Answering (VQA) +1

Paper
Code

Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference Challenges

1 code implementation • 6 Nov 2023 • Chenhang Cui, Yiyang Zhou, Xinyu Yang, Shirley Wu, Linjun Zhang, James Zou, Huaxiu Yao

To bridge this gap, we introduce a new benchmark, namely, the Bias and Interference Challenges in Visual Language Models (Bingo).

Hallucination

Paper
Code

Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

1 code implementation • 1 Oct 2023 • Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao

Large vision-language models (LVLMs) have shown remarkable abilities in understanding visual information with human languages.

Hallucination Hallucination Evaluation +1

101

Paper
Code

Bright Channel Prior Attention for Multispectral Pedestrian Detection

no code implementations • 22 May 2023 • Chenhang Cui, Jinyu Xie, Yechenhao Yang

The method uses the V-channel of the HSV image of the thermal image as an attention map to trigger the unsupervised auto-encoder for visible light images, which gradually emphasizes pedestrian features across layers.

Image Enhancement object-detection +2

Paper
Add Code

Deep Multi-View Subspace Clustering with Anchor Graph

1 code implementation • 11 May 2023 • Chenhang Cui, Yazhou Ren, Jingyu Pu, Xiaorong Pu, Lifang He

To significantly reduce the complexity, we construct an anchor graph with small size for each view.

Clustering Contrastive Learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.