Machine Learning Algorithms for Breast Cancer Detection in Mammography Images: A Comparative Study

Breast tumor is the most common type of cancer in women worldwide, representing approximately 12% of reported new cases and 6.5% of cancer deaths in 2018. Mammography screening are extremely important for early detection of breast cancer. The assessment of mammograms is a complex task with significant variability due to professional experience and human errors, an opportunity for assisting tools to improve both reliability and accuracy. The usage of deep learning in medical image analysis have increased, assisting specialists in early detection, diagnosis, treatment or prognosis of diseases. In this article, we compare the performance of XGBoost and VGG16 in the task of breast cancer detection by using digital mammograms from CBIS-DDSM dataset. In addition, we perform a comparison of prediction accuracy between full mammogram images and patches extracted from original images based on ROI annotated by experts. Moreover, we also perform experiments with transfer learning and data augmentation to exploit data diversity, and the ability to extract features and learn from raw unprocessed data. Experimental results show that XGBoost achieves 68.29% in AUC, while VGG16 achieves approximately the same performance of 68.24% in AUC

PDF

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Cancer-no cancer per image classification CBIS-DDSM VGG16 AUC 0.6822 # 18
Cancer-no cancer per image classification CBIS-DDSM XGBoost AUC 0.6849 # 17

Methods


No methods listed for this paper. Add relevant methods here