BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks

Conventional task- and modality-specific artificial intelligence (AI) models are inflexible in real-world deployment and maintenance for biomedicine. At the same time, the growing availability of biomedical data, coupled with the advancements in modern multi-modal multi-task AI techniques, has paved the way for the emergence of generalist biomedical AI solutions. These solutions hold the potential to interpret different medical modalities and produce expressive outputs such as free-text reports or disease diagnosis. Here, we propose BiomedGPT, the first open-source and generalist visual language AI for diverse biomedical tasks. BiomedGPT achieved 16 state-of-the-art results across five clinically significant tasks on 26 datasets. Notably, it outperformed OpenAI's GPT-4 with vision (GPT-4V) in radiology human evaluation and surpassed Google's Med-PaLM M (12B) in breast cancer diagnosis and medical visual question answering. Moreover, BiomedGPT facilitates zero-shot transfer learning, greatly enhancing its utility as a biomedical assistant, similar to ChatGPT. Our method demonstrates effective training with diverse datasets can lead to more practical biomedical AI.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Captioning IU X-Ray BiomedGPT CIDEr 36.0 # 1
Natural Language Inference MedNLI BiomedGPT-B Accuracy 83.83 # 4
Text Summarization MeQSum BiomedGPT RougeL 52.3 # 1
Medical Visual Question Answering PathVQA BiomedGPT Free-form Accuracy 28.0 # 4
Yes/No Accuracy 88.0 # 2
Overall Accuracy 58.1 # 4
Image Captioning Peir Gross BiomedGPT CIDEr 122.7 # 1
METEOR 15.4 # 1
ROUGE-L 30.2 # 1
Medical Visual Question Answering SLAKE-English BiomedGPT Overall Accuracy 86.1 # 1
Close-ended Accuracy 89.9 # 2
Open-ended Accuracy 84.3 # 1
Medical Visual Question Answering VQA-RAD BiomedGPT Close-ended Accuracy 81.3 # 9
Open-ended Accuracy 60.9 # 7
Overall Accuracy 73.2 # 8

Methods