A Comprehensive Study on Colorectal Polyp Segmentation with ResUNet++, Conditional Random Field and Test-Time Augmentation

Colonoscopy is considered the gold standard for detection of colorectal cancer and its precursors. Existing examination methods are, however, hampered by high overall miss-rate, and many abnormalities are left undetected. Computer-Aided Diagnosis systems based on advanced machine learning algorithms are touted as a game-changer that can identify regions in the colon overlooked by the physicians during endoscopic examinations, and help detect and characterize lesions. In previous work, we have proposed the ResUNet++ architecture and demonstrated that it produces more efficient results compared with its counterparts U-Net and ResUNet. In this paper, we demonstrate that further improvements to the overall prediction performance of the ResUNet++ architecture can be achieved by using conditional random field and test-time augmentation. We have performed extensive evaluations and validated the improvements using six publicly available datasets: Kvasir-SEG, CVC-ClinicDB, CVC-ColonDB, ETIS-Larib Polyp DB, ASU-Mayo Clinic Colonoscopy Video Database, and CVC-VideoClinicDB. Moreover, we compare our proposed architecture and resulting model with other State-of-the-art methods. To explore the generalization capability of ResUNet++ on different publicly available polyp datasets, so that it could be used in a real-world setting, we performed an extensive cross-dataset evaluation. The experimental results show that applying CRF and TTA improves the performance on various polyp segmentation datasets both on the same dataset and cross-dataset.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Medical Image Segmentation CVC-ClinicDB ResUNet++ + TTA mean Dice 0.9020 # 30
Medical Image Segmentation CVC-ClinicDB ResUNet++ + CRF+ TTA mean Dice 0.9017 # 31
Medical Image Segmentation CVC-ColonDB ResUNet++ + TTA mean Dice 0.8474 # 4
mIoU 0.8466 # 2
Medical Image Segmentation CVC-VideoClinicDB ResUNet++ + TTA Dice 0.8125 # 1
mIoU 0.8467 # 5
Recall 0.6896 # 3
precision 0.6421 # 3
Medical Image Segmentation CVC-VideoClinicDB ResUNet++ + CRF Dice 0.8811 # 4
mIoU 0.8739 # 1
Recall 0.7743 # 2
precision 0.6706 # 1
Medical Image Segmentation CVC-VideoClinicDB ResUNet++ + TTA + CRF Dice 0.8130 # 2
mIoU 0.8477 # 4
Recall 0.6875 # 4
precision 0.6276 # 4
Medical Image Segmentation ETIS-LARIBPOLYPDB ResUNet++ + TTA mIoU 0.7458 # 6
mean Dice 0.6136 # 18
Medical Image Segmentation Kvasir-SEG ResUNet++ + TTA + CRF mean Dice 0.8508 # 39
mIoU 0.7800 # 36
FPS 69.59 # 4

Methods