Medical Report Generation

29 papers with code • 2 benchmarks • 3 datasets

Medical report generation (MRG) is a task which focus on training AI to automatically generate professional report according the input image data. This can help clinicians make faster and more accurate decision since the task itself is both time consuming and error prone even for experienced doctors.

Deep neural network and transformer based architecture are currently the most popular methods for this certain task, however, when we try to transfer out pre-trained model into this certain domain, their performance always degrade.

The following are some of the reasons why RSG is hard for pre-trained models:

Language datasets in a particular domain can sometimes be quite different from the large number of datasets available on the Internet
During the fine-tuning phase, datasets in the medical field are often unevenly distributed

More recently, multi-modal learning and contrastive learning have shown some inspiring results in this field, but it's still challenging and requires further attention.

Here are some additional readings to go deeper on the task:

On the Automatic Generation of Medical Imaging Reports

https://doi.org/10.48550/arXiv.1711.08195

A scoping review of transfer learning research on medical image analysis using ImageNet

https://arxiv.org/abs/2004.13175

A Survey on Incorporating Domain Knowledge into Deep Learning for Medical Image Analysis

https://arxiv.org/abs/2004.12150

(Image credit : Transformers in Medical Imaging: A Survey)

Benchmarks

Add a Result

These leaderboards are used to track progress in Medical Report Generation

Trend	Dataset	Best Model	Paper	Code	Compare
	MIMIC-CXR	RGRG			See all
	HistGen WSI-Report Dataset	HistGen			See all

Libraries

Use these libraries to find Medical Report Generation models and implementations

hcplab-sysu/causalvlr

2 papers

110

Datasets

Latest papers

Most implemented Social Latest No code

Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark

control-xl/medical-vision-langauge-transformer • • 10 Jun 2023

With the availability of large-scale, comprehensive, and general-purpose vision-language (VL) datasets such as MSCOCO, vision-language pre-training (VLP) has become an active area of research and proven to be effective for various VL tasks such as visual-question answering.

10 Jun 2023

Paper
Code

Automatic Radiology Report Generation by Learning with Increasingly Hard Negatives

bhanu068/ithn • • 11 May 2023

At each iteration, conditioned on a given set of hard negative reports, image and report features are learned as usual by minimising the loss functions related to report generation.

11 May 2023

Paper
Code

Interactive and Explainable Region-guided Radiology Report Generation

ttanida/rgrg • • CVPR 2023

While previous methods generate reports without the possibility of human intervention and with limited explainability, our method opens up novel clinical use cases through additional interactive capabilities and introduces a high degree of transparency and explainability.

107

17 Apr 2023

Paper
Code

Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation

mlii0117/dcl • • CVPR 2023

To address the limitation, we propose a knowledge graph with Dynamic structure and nodes to facilitate medical report generation with Contrastive Learning, named DCL.

18 Mar 2023

Paper
Code

Cross-Modal Causal Intervention for Medical Report Generation

hcplab-sysu/causalvlr • • 16 Mar 2023

Medical report generation (MRG) is essential for computer-aided diagnosis and medication guidance, which can relieve the heavy burden of radiologists by automatically generating the corresponding medical reports according to the given radiology image.

110

16 Mar 2023

Paper
Code

Lesion Guided Explainable Few Weak-shot Medical Report Generation

jinghansunn/few-weak-shot-rg • • 16 Nov 2022

To this end, we propose a lesion guided explainable few weak-shot medical report generation framework that learns correlation between seen and novel classes through visual and semantic feature alignment, aiming to generate medical reports for diseases not observed in training.

16 Nov 2022

Paper
Code

DeltaNet:Conditional Medical Report Generation for COVID-19 Diagnosis

lx-doctorai1/deltanet • • 12 Nov 2022

To reduce the workload of radiologists, we propose DeltaNet to generate medical reports automatically.

12 Nov 2022

Paper
Code

M^4I: Multi-modal Models Membership Inference

multimodalmi/multimodal-membership-inference • • 15 Sep 2022

To achieve this, we propose Multi-modal Models Membership Inference (M^4I) with two attack methods to infer the membership status, named metric-based (MB) M^4I and feature-based (FB) M^4I, respectively.

15 Sep 2022

Paper
Code

A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets

lemuria-wchen/imcs21 • • 19 Apr 2022

In recent years, interest has arisen in using machine learning to improve the efficiency of automatic medical consultation and enhance patient experience.

19 Apr 2022

Paper
Code

Transformers in Medical Imaging: A Survey

fahadshamshad/awesome-transformers-in-medical-imaging • • 24 Jan 2022

Following unprecedented success on the natural language tasks, Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results and prompting researchers to reconsider the supremacy of convolutional neural networks (CNNs) as {de facto} operators.

1,111

24 Jan 2022

Paper
Code

Medical Report Generation

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result