Search Results for author: Jingzhou Luo

Found 2 papers, 1 papers with code

MEIA: Towards Realistic Multimodal Interaction and Manipulation for Embodied Robots

1 code implementation • 1 Feb 2024 • Yang Liu, Xinshuai Song, Kaixuan Jiang, Weixing Chen, Jingzhou Luo, Guanbin Li, Liang Lin

To overcome this limitation, we introduce the Multimodal Embodied Interactive Agent (MEIA), capable of translating high-level tasks expressed in natural language into a sequence of executable actions.

Embodied Question Answering Language Modelling +3

110

Paper
Code

VCD: Visual Causality Discovery for Cross-Modal Question Reasoning

no code implementations • 17 Apr 2023 • Yang Liu, Ying Tan, Jingzhou Luo, Weixing Chen

Existing visual question reasoning methods usually fail to explicitly discover the inherent causal mechanism and ignore jointly modeling cross-modal event temporality and causality.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.