Search Results for author: Shan Dong

Found 1 papers, 1 papers with code

All in an Aggregated Image for In-Image Learning

1 code implementation • 28 Feb 2024 • Lei Wang, Wanyu Xu, Zhiqiang Hu, Yihuai Lan, Shan Dong, Hao Wang, Roy Ka-Wei Lee, Ee-Peng Lim

This paper introduces a new in-context learning (ICL) mechanism called In-Image Learning (I$^2$L) that combines demonstration examples, visual cues, and chain-of-thought reasoning into an aggregated image to enhance the capabilities of Large Multimodal Models (e. g., GPT-4V) in multimodal reasoning tasks.

Hallucination In-Context Learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.