☆ 4.5 Article

Hepatitis C Virus prediction based on machine learning framework: a real-world case study in Egypt

KNOWLEDGE AND INFORMATION SYSTEMS (2023)

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

卷 65, 期 6, 页码 2595-2617

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s10115-023-01851-4

关键词

Machine learning; Classification; Feature selection; Hepatitis C Virus

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study proposes a prediction framework based on machine learning methods to predict Hepatitis C Virus among healthcare workers in Egypt. By utilizing real-world data and performing feature selection, the framework effectively predicts virus infection and improves classification accuracy.

Prediction and classification of diseases are essential in medical science, as it attempts to immune the spread of the disease and discover the infected regions from the early stages. Machine learning (ML) approaches are commonly used for predicting and classifying diseases that are precisely utilized as an efficient tool for doctors and specialists. This paper proposes a prediction framework based on ML approaches to predict Hepatitis C Virus among healthcare workers in Egypt. We utilized real-world data from the National Liver Institute, founded at Menoufiya University (Menoufiya, Egypt). The collected dataset consists of 859 patients with 12 different features. To ensure the robustness and reliability of the proposed framework, we performed two scenarios: the first without feature selection and the second after the features are selected based on sequential forward selection (SFS). Furthermore, the feature subset selected based on the generated features from SFS is evaluated. Naive Bayes, random forest (RF), K-nearest neighbor, and logistic regression are utilized as induction algorithms and classifiers for model evaluation. Then, the effect of parameter tuning on learning techniques is measured. The experimental results indicated that the proposed framework achieved higher accuracies after SFS selection than without feature selection. Moreover, the RF classifier achieved 94.06% accuracy with a minimum learning elapsed time of 0.54 s. Finally, after adjusting the hyperparameter values of the RF classifier, the classification accuracy is improved to 94.88% using only four features.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.5

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Explainable Machine Learning Approach for Hepatitis C Diagnosis Using SFS Feature Selection

Ali Mohd Ali, Mohammad R. Hassan, Faisal Aburub, Mohammad Alauthman, Amjad Aldweesh, Ahmad Al-Qerem, Issam Jebreen, Ahmad Nabot

Summary: Hepatitis C is a significant public health concern, and machine learning algorithms have been used to improve the diagnostic process. However, there is a concern about their interpretability.

MACHINES (2023)