4.7 Article

Cytokine gene variants and socio-demographic characteristics as predictors of cervical cancer: A machine learning approach

期刊

COMPUTERS IN BIOLOGY AND MEDICINE
卷 134, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2021.104559

关键词

Artificial intelligence; Bioinformatics; Cervical cancer; Computational biology; Cytokine gene polymorphisms; Machine learning

资金

  1. Department of Science and Technology (DST), New Delhi, India
  2. Centre of Excellence, Higher Education, Government of Uttar Pradesh, Lucknow, India
  3. Indian Council of Medical Research (ICMR)

向作者/读者索取更多资源

This study utilized machine learning models to analyze the risk factors of cervical cancer, focusing on cytokine gene variants and socio-demographic characteristics, to provide better prognosis and prediction. After evaluating various machine learning approaches, logistic regression technique achieved the highest accuracy and F1-score, while ridge classifiers and Gaussian Naive Bayes classifiers demonstrated the highest sensitivity. The study suggests that the analysis of cytokine gene variants and socio-demographic characteristics with machine learning can effectively predict the risk of developing cervical cancer.
Cervical cancer is still one of the most prevalent cancers in women and a significant cause of mortality. Cytokine gene variants and socio-demographic characteristics have been reported as biomarkers for determining the cervical cancer risk in the Indian population. This study was designed to apply a machine learning-based model using these risk factors for better prognosis and prediction of cervical cancer. This study includes the dataset of cytokine gene variants, clinical and socio-demographic characteristics of normal healthy control subjects, and cervical cancer cases. Different risk factors, including demographic details and cytokine gene variants, were analysed using different machine learning approaches. Various statistical parameters were used for evaluating the proposed method. After multi-step data processing and random splitting of the dataset, machine learning methods were applied and evaluated with 5-fold cross-validation and also tested on the unseen data records of a collected dataset for proper evaluation and analysis. The proposed approaches were verified after analysing various performance metrics. The logistic regression technique achieved the highest average accuracy of 82.25% and the highest average F1-score of 82.58% among all the methods. Ridge classifiers and the Gaussian N & auml;ve Bayes classifier achieved the highest sensitivity & mdash;85%. The ridge classifier surpasses most of the machine learning classifiers with 84.78% accuracy and 97.83% sensitivity. The risk factors analysed in this study can be taken as biomarkers in developing a cervical cancer diagnosis system. The outcomes demonstrate that the machine learning assisted analysis of cytokine gene variants and socio-demographic characteristics can be utilised effectively for predicting the risk of developing cervical cancer.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据