Common Sense Reasoning

245 papers with code • 24 benchmarks • 52 datasets

Common sense reasoning tasks are intended to require the model to go beyond pattern recognition. Instead, the model should use "common sense" or world knowledge to make inferences.

Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior Simulation

hakys-a/meow 27 Mar 2024

In the MEOW framework, simulated data are utilized to train an expert model concentrating ``experience'' about a specific task in each independent time of simulation.

0
27 Mar 2024

Common Sense Enhanced Knowledge-based Recommendation with Large Language Model

ysh-1998/csrec 27 Mar 2024

Knowledge-based recommendation models effectively alleviate the data sparsity issue leveraging the side information in the knowledge graph, and have achieved considerable performance.

0
27 Mar 2024

IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

csebuetnlp/illusionvqa 23 Mar 2024

GPT4V, the best-performing VLM, achieves 62. 99% accuracy (4-shot) on the comprehension task and 49. 7% on the localization task (4-shot and Chain-of-Thought).

1
23 Mar 2024

Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation

18979705623/hspr 18 Mar 2024

Most Vision-and-Language Navigation (VLN) algorithms tend to make decision errors, primarily due to a lack of visual common sense and insufficient reasoning capabilities.

3
18 Mar 2024

Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving

mehdiazarafza/hybrid-reasoning 21 Feb 2024

Large Language Models (LLMs) have garnered significant attention for their ability to understand text and images, generate human-like text, and perform complex reasoning tasks.

1
21 Feb 2024

G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

xiaoxinhe/g-retriever 12 Feb 2024

Given a graph with textual attributes, we enable users to `chat with their graph': that is, to ask questions about the graph using a conversational interface.

117
12 Feb 2024

HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments

umass-foundation-model/hazard 23 Jan 2024

Recent advances in high-fidelity virtual environments serve as one of the major driving forces for building intelligent embodied agents to perceive, reason and interact with the physical world.

16
23 Jan 2024

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

QQBrowserVideoSearch/CBVS-UniCLIP 19 Jan 2024

Differently, video covers in short video search scenarios are presented as user-originated contents that provide important visual summaries of videos.

5
19 Jan 2024

Large Language Models Are Neurosymbolic Reasoners

hyintell/llmsymbolic 17 Jan 2024

A wide range of real-world applications is characterized by their symbolic nature, necessitating a strong capability for symbolic reasoning.

4
17 Jan 2024

Mixtral of Experts

hit-scir/chinese-mixtral-8x7b 8 Jan 2024

In particular, Mixtral vastly outperforms Llama 2 70B on mathematics, code generation, and multilingual benchmarks.

565
08 Jan 2024