Search Results for author: Qirui Jiao

Found 1 papers, 0 papers with code

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

no code implementations31 Jan 2024 Qirui Jiao, Daoyuan Chen, Yilun Huang, Yaliang Li, Ying Shen

Despite the impressive capabilities of Multimodal Large Language Models (MLLMs) in integrating text and image modalities, challenges remain in accurately interpreting detailed visual elements.

Hallucination object-detection +3

Cannot find the paper you are looking for? You can Submit a new open access paper.