Analysis and Visualization of Breast Cancer Prediction through Machine Learning Models

Olukayode Felix Ayepeku

Abstract


This research presents an in-depth exploration of breast cancer prediction through the application of machine learning models, specifically focusing on Logistic Regression, K-Nearest Neighbors, Support Vector Classifier, 'Decision Tree Classifier, Random Forest Classifier, Gradient Boosting Classifier, AdaBoost Classifier, and XGBoost Classifier. The study utilizes a comprehensive dataset comprising clinical features extracted from Kaggle. Various algorithms are employed, and a meticulous analysis of precision, recall, F1-score, and accuracy is conducted to assess model performance. Through advanced visualization techniques and statistical analysis, the research provides insights into the effectiveness of machine learning models in predicting breast cancer. The outcomes of this study aim to contribute valuable knowledge to the field of medical diagnostics, emphasizing the importance of machine learning methodologies in enhancing breast cancer prediction and classification.

Full Text:

PDF

References


reddy, Anuradha. 2022. “Support Vector Machine Classifier For Prediction Of Breast

Malignancy Using Wisconsin Breast Cancer Dataset.” Journal of Artificial Intelligence, Machine Learning and Neural Network, no. 21 (January): 1–8. https://doi.org/10.55529/jaimlnn.21.1.8.

Rautalin, Mervi, Tiina Jahkola, and Risto P. Roine. 2022. “Breast Reconstruction–

Prospective Follow up on Breast Cancer Patients’ Health-Related Quality of Life.” World Journal of Surgery 46, no. 4 (January): 836–44. https://doi.org/10.1007/s00268-021-06426-4.

Gupta, Puja, and Shruti Garg. 2020. “Breast Cancer Prediction Using Varying

Parameters of Machine Learning Models.” Procedia Computer Science 171: 593–601. https://doi.org/10.1016/j.procs.2020.04.064.

Mutasa, Simukayi, Peter Chang, John Nemer, Eduardo Pascual Van Sant, Mary Sun, Alison McIlvride, Maham Siddique, and Richard Ha. 2020. “Prospective Analysis Using a Novel CNN Algorithm to Distinguish Atypical Ductal Hyperplasia From Ductal Carcinoma in Situ in Breast.” Clinical Breast Cancer 20, no. 6 (December):e757–60. https://doi.org/10.1016/j.clbc.2020.06.001.

Dong, Lin, and Kohei Inoue. 2020. “Diagnosis of Breast Cancer from Mammogram,Images Based on CNN.” Journal of the Institute of Industrial Applications Engineers 8, no. 4 (October): 117–21. https://doi.org/10.12792/jiiae.8.117.

Sunardi, Sunardi, Anton Yudhana, and Anggi Rizky WindraPutri. 2022. “Mass

Classification of Breast Cancer Using CNN and Faster R-CNN Model Comparison.” Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, August (August). https://doi.org/10.22219/kinetik.v7i3.1462.

Fatima, Noreen, Li Liu, Sha Hong, and Haroon Ahmed. 2020. “Prediction of Breast

Cancer, Comparative Review of Machine Learning Techniques, and Their Analysis.” IEEE Access 8: 150360–76. https://doi.org/10.1109/access.2020.3016715.

Wang, Yuqun, Lei Tang, Pingping Chen, and Man Chen. 2023. “The Role of a Deep

Learning-Based Computer-Aided Diagnosis System and Elastography in Reducing Unnecessary Breast Lesion Biopsies.” Clinical Breast Cancer 23, no. 3 (April): e112–21. https://doi.org/10.1016/j.clbc.2022.12.016.

Khan, Farrukh, Muhammad Adnan Khan, Sagheer Abbas, Atifa Athar, Shahan Yamin

Siddiqui, Abdul Hannan Khan, Muhammad Anwaar Saeed, and Muhammad Hussain. 2020. “Cloud-Based Breast Cancer Prediction Empowered with Soft Computing Approaches.” Journal of Healthcare Engineering 2020, no. May (May):1–16. https://doi.org/10.1155/2020/8017496.

Matsuo, Koji, Kazuhiro Tanabe, Masaru Hayashi, Masae Ikeda, Miwa Yasaka, Hiroko Machida, Masako Shida, et al. 2020. “Utility of Comprehensive Serum Glycopeptide Spectra Analysis (CSGSA) for the Detection of Early Stage Epithelial Ovarian Cancer.” Cancers 12, no. 9 (August): 2374. https://doi.org/10.3390/cancers12092374.

Mallika, Madasu, and K. Suresh Babu. 2023. “Breast Cancer Prediction Using Machine Learning Algorithms.” International Journal of Science and Research (IJSR) 12, no. 10 (October): 1235–38. https://doi.org/10.21275/sr231015173828.

Allugunti, Viswanatha Reddy. 2022. “Breast Cancer Detection Based on Thermographic Images Using Machine Learning and Deep Learning Algorithms.”

International Journal of Engineering in Computer Science 4, no. 1 (January): 49–56. https://doi.org/10.33545/26633582.2022.v4.i1a.68.

Meenalochini, G., and S. Ramkumar. 2021. “Survey of Machine Learning Algorithms

for Breast Cancer Detection Using Mammogram Images.” Materials Today: Proceedings 37: 2738–43. https://doi.org/10.1016/j.matpr.2020.08.543.

Twala, Bhekisipho, and Eamon Molloy. 2022. “On Effectively Predicting Autism

Spectrum Disorder Using an Ensemble of Classifiers.” SSRN Electronic Journal. https://doi.org/10.2139/ssrn.4215300.

Altinok, Oya, and Albert Guvenis. 2023. “Interpretable Radiomics Method for

Predicting Human Papillomavirus Status in Oropharyngeal Cancer Using Bayesian Networks.” Physica Medica 114, no. October (October): 102671. https://doi.org/10.1016/j.ejmp.2023.102671.

Jinbo, Kazumi, Takaaki Fujita, Ryuichi Kasahara, Ryohei Jinbo, Sayaka Kisara, Jun

Onobe, Izo Kimijima, Mitsuhiko Yasuda, and Yuichi Yamamoto. 2023. “The Effect of Combined Risk Factors on Breast Cancer-Related Lymphedema: A Study Using Decision Trees.” Breast Cancer 30, no. 4 (March): 685–88. https://doi.org/10.1007/s12282-023-01450-9.

Al-Azzam, Nosayba, and Ibrahem Shatnawi. 2021. “Comparing Supervised and Semi-Supervised Machine Learning Models on Diagnosing Breast Cancer.” Annals of Medicine and Surgery 62, no. February (February): 53–64. https://doi.org/10.1016/j.amsu.2020.12.043.

Huang, Zexian, and Daqi Chen. 2022. “A Breast Cancer Diagnosis Method Based on

VIM Feature Selection and Hierarchical Clustering Random Forest Algorithm.” IEEE Access 10: 3284–93. https://doi.org/10.1109/access.2021.3139595.

Kim, Jongmo, Jaewoong Kang, and Mye Sohn. 2021. “Ensemble Learning-Based

Filter-Centric Hybrid Feature Selection Framework for High-Dimensional Imbalanced Data.” Knowledge-Based Systems 220, no. May (May): 106901. https://doi.org/10.1016/j.knosys.2021.106901.




DOI: https://doi.org/10.32520/stmsi.v13i3.4100

Article Metrics

Abstract view : 77 times
PDF - 33 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.