Search Results for author: Vernon Toh Yan Han

Found 2 papers, 2 papers with code

PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns

2 code implementations20 Mar 2024 Yew Ken Chia, Vernon Toh Yan Han, Deepanway Ghosal, Lidong Bing, Soujanya Poria

As recognizing patterns and abstracting concepts are key to general intelligence, we introduce PuzzleVQA, a collection of puzzles based on abstract patterns.

Multimodal Reasoning

Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning

2 code implementations6 Mar 2024 Deepanway Ghosal, Vernon Toh Yan Han, Chia Yew Ken, Soujanya Poria

We present a new dataset, AlgoPuzzleVQA designed to challenge and evaluate the capabilities of multimodal language models in solving algorithmic puzzles that necessitate both visual understanding, language understanding, and complex algorithmic reasoning.

Multimodal Reasoning Question Answering +1

Cannot find the paper you are looking for? You can Submit a new open access paper.