Search Results for author: Ruifei Ma

Found 1 papers, 1 papers with code

3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding

1 code implementation • 6 Jan 2024 • Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liu

The remarkable potential of multi-modal large language models (MLLMs) in comprehending both vision and language information has been widely acknowledged.

Scene Understanding Visual Question Answering (VQA)

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.