Search Results for author: Jingzhou Luo

Found 2 papers, 1 papers with code

MEIA: Towards Realistic Multimodal Interaction and Manipulation for Embodied Robots

1 code implementation1 Feb 2024 Yang Liu, Xinshuai Song, Kaixuan Jiang, Weixing Chen, Jingzhou Luo, Guanbin Li, Liang Lin

To overcome this limitation, we introduce the Multimodal Embodied Interactive Agent (MEIA), capable of translating high-level tasks expressed in natural language into a sequence of executable actions.

Embodied Question Answering Language Modelling +3

VCD: Visual Causality Discovery for Cross-Modal Question Reasoning

no code implementations17 Apr 2023 Yang Liu, Ying Tan, Jingzhou Luo, Weixing Chen

Existing visual question reasoning methods usually fail to explicitly discover the inherent causal mechanism and ignore jointly modeling cross-modal event temporality and causality.

Cannot find the paper you are looking for? You can Submit a new open access paper.