Search Results for author: Songtao Jiang

Found 2 papers, 2 papers with code

MoE-TinyMed: Mixture of Experts for Tiny Medical Large Vision-Language Models

1 code implementation • 16 Apr 2024 • Songtao Jiang, Tuo Zheng, Yan Zhang, Yeying Jin, Zuozhu Liu

Mixture of Expert Tuning (MoE-Tuning) has effectively enhanced the performance of general MLLMs with fewer parameters, yet its application in resource-limited medical settings has not been fully explored.

Visual Question Answering (VQA)

Paper
Code

Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models

2 code implementations • 6 Apr 2024 • Songtao Jiang, Yan Zhang, Chenyi Zhou, Yeying Jin, Yang Feng, Jian Wu, Zuozhu Liu

In this paper, we present a novel approach, Joint Visual and Text Prompting (VTPrompt), that employs fine-grained visual information to enhance the capability of MLLMs in VQA, especially for object-oriented perception.

Object Question Answering +1

189

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.