1 code implementation • 6 Jan 2024 • Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liu
The remarkable potential of multi-modal large language models (MLLMs) in comprehending both vision and language information has been widely acknowledged.