A precise machine learning model: Detecting cervical cancer using feature selection and explainable AI

Cervical cancer is a cancer that remains a significant global health challenge all over the world. Due to improper screening in the early stages, and healthcare disparities, a large number of women are suffering from this disease, and the mortality rate increases day by day. Hence, in these studies,...

Full description

Saved in:
Bibliographic Details
Main Authors: Rashiduzzaman Shakil (Author), Sadia Islam (Author), Bonna Akter (Author)
Format: Book
Published: Elsevier, 2024-12-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_4e7b8f18f34c48d992eb3ebab085ce24
042 |a dc 
100 1 0 |a Rashiduzzaman Shakil  |e author 
700 1 0 |a Sadia Islam  |e author 
700 1 0 |a Bonna Akter  |e author 
245 0 0 |a A precise machine learning model: Detecting cervical cancer using feature selection and explainable AI 
260 |b Elsevier,   |c 2024-12-01T00:00:00Z. 
500 |a 2153-3539 
500 |a 10.1016/j.jpi.2024.100398 
520 |a Cervical cancer is a cancer that remains a significant global health challenge all over the world. Due to improper screening in the early stages, and healthcare disparities, a large number of women are suffering from this disease, and the mortality rate increases day by day. Hence, in these studies, we presented a precise approach utilizing six different machine learning models (decision tree, logistic regression, naïve bayes, random forest, k nearest neighbors, support vector machine), which can predict the early stage of cervical cancer by analysing 36 risk factor attributes of 858 individuals. In addition, two data balancing techniques-Synthetic Minority Oversampling Technique and Adaptive Synthetic Sampling-were used to mitigate the data imbalance issues. Furthermore, Chi-square and Least Absolute Shrinkage and Selection Operator are two distinct feature selection processes that have been applied to evaluate the feature rank, which are mostly correlated to identify the particular disease, and also integrate an explainable artificial intelligence technique, namely Shapley Additive Explanations, for clarifying the model outcome. The applied machine learning model outcome is evaluated by performance evaluation matrices, namely accuracy, sensitivity, specificity, precision, f1-score, false-positive rate and false-negative rate, and area under the Receiver operating characteristic curve score. The decision tree outperformed in Chi-square feature selection with outstanding accuracy with 97.60%, 98.73% sensitivity, 80% specificity, and 98.73% precision, respectively. During the data imbalance, DT performed 97% accuracy, 99.35% sensitivity, 69.23% specificity, and 97.45% precision. This research is focused on developing diagnostic frameworks with automated tools to improve the detection and management of cervical cancer, as well as on helping healthcare professionals deliver more efficient and personalized care to their patients. 
546 |a EN 
690 |a Cervical cancer 
690 |a SMOTE 
690 |a ADASYN 
690 |a Chi-square 
690 |a LASSO 
690 |a Machine learning 
690 |a Computer applications to medicine. Medical informatics 
690 |a R858-859.7 
690 |a Pathology 
690 |a RB1-214 
655 7 |a article  |2 local 
786 0 |n Journal of Pathology Informatics, Vol 15, Iss , Pp 100398- (2024) 
787 0 |n http://www.sciencedirect.com/science/article/pii/S2153353924000373 
787 0 |n https://doaj.org/toc/2153-3539 
856 4 1 |u https://doaj.org/article/4e7b8f18f34c48d992eb3ebab085ce24  |z Connect to this object online.