A Supervised Classification Phenotyping Approach using Machine Learning for Patients Diagnosed with Primary Breast Cancer

Bashir Ahmad*, Burhan Ullah**, Fouzia Sardar***, Hazrat Junaid****, Gul Zaman Khan*****
*-** Department of Computer Science, Ghazi Umara Khan Degree College, University of Malakand, Dir Lower, Pakistan.
*** Department of Zoology, University of Malakand, Dir lower, Pakistan.
**** Department of Computer Science and Information Technology, University of Malakand Dir lower, Pakistan.
***** Department of Software Engineering, University of Engineering and Technology, Mardan, Pakistan.
Periodicity:April - June'2023
DOI : https://doi.org/10.26634/jcom.11.1.19374

Abstract

This paper presents a methodology for the early detection and diagnosis of breast cancer using the Wisconsin dataset. The methodology involves four main steps, including data collection, preprocessing, feature selection, and classification. Fine needle aspiration technique is used to extract the ultrasound image features of breast cancer, and preprocessing is performed to eliminate outliers, null values, and noise. Redundant parameters are removed during the feature selection process to improve accuracy. Six machine learning algorithms, including Logistic Regression, Support Vector Machine, K-Nearest Neighbor, Random Forest, Decision Tree, and Gaussian Naive Bayes, are employed for the classification of the breast cancer dataset. Support Vector Machine and K-Nearest Neighbor achieved the highest accuracy, with Logistic Regression, Gaussian Naive Bayes, Random Forest, and Decision Tree having lower accuracy scores. The proposed methodology could aid in the timely detection and diagnosis of breast cancer, and help doctors in selecting the optimal clinical treatment plan for their patients. Further work will be carried out to investigate the effectiveness of additional preprocessing algorithms in improving the classification accuracy of the breast cancer dataset.

Keywords

Machine Learning, Breast Cancer, Supervised Algorithm, Preprocessing.

How to Cite this Article?

Ahmad, B., Ullah, B., Sardar, F., Junaid, H., and Khan, G. Z. (2023). A Supervised Classification Phenotyping Approach using Machine Learning for Patients Diagnosed with Primary Breast Cancer. i-manager’s Journal on Computer Science, 11(1), 1-11. https://doi.org/10.26634/jcom.11.1.19374

References

[19]. Sinha, N. K., Khulal, M., Gurung, M., & Lal, A. (2020). Developing a web based system for breast cancer prediction using xgboost classifier. International Journal of Engineering Research Technology (IJERT), 9(6), 852-856.
[20]. Sivapriya, J., Kumar, A., Sai, S. S., & Sriram, S. (2019). Breast cancer prediction using machine learning. International Journal of Recent Technology and Engineering (IJRTE), 8(4), 4879-4881.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.