4.4 Review

Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis

Journal

JMIR MEDICAL INFORMATICS
Volume 8, Issue 11, Pages -

Publisher

JMIR PUBLICATIONS, INC
DOI: 10.2196/16503

Keywords

machine learning; pregnancy complications; prognosis; clinical prediction rule; meta-analysis; systematic review

Funding

  1. Ministry of Science and Technology of Taiwan [MOST108-2221-E-038-018, MOST109-2221-E-038-018]

Ask authors/readers for more resources

Background: Predictions in pregnancy care are complex because of interactions among multiple factors. Hence, pregnancy outcomes are not easily predicted by a single predictor using only one algorithm or modeling method. Objective: This study aims to review and compare the predictive performances between logistic regression (LR) and other machine learning algorithms for developing or validating a multivariable prognostic prediction model for pregnancy care to inform clinicians' decision making. Methods: Research articles from MEDLINE, Scopus, Web of Science, and Google Scholar were reviewed following several guidelines for a prognostic prediction study, including a risk of bias (ROB) assessment. We report the results based on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Studies were primarily framed as PICOTS (population, index, comparator, outcomes, timing, and setting): Population: men or women in procreative management, pregnant women, and fetuses or newborns; Index: multivariable prognostic prediction models using non-LR algorithms for risk classification to inform clinicians' decision making; Comparator: the models applying an LR; Outcomes: pregnancy-related outcomes of procreation or pregnancy outcomes for pregnant women and fetuses or newborns; Timing: pre-, inter-, and peripregnancy periods (predictors), at the pregnancy, delivery, and either puerperal or neonatal period (outcome), and either short- or long-term prognoses (time interval); and Setting: primary care or hospital. The results were synthesized by reporting study characteristics and ROBs and by random effects modeling of the difference of the logit area under the receiver operating characteristic curve of each non-LR model compared with the LR model for the same pregnancy outcomes. We also reported between-study heterogeneity by using tau(2) and I-2. Results: Of the 2093 records, we included 142 studies for the systematic review and 62 studies for a meta-analysis. Most prediction models used LR (92/142, 64.8%) and artificial neural networks (20/142, 14.1%) among non-LR algorithms. Only 16.9% (24/142) of studies had a low ROB. A total of 2 non-LR algorithms from low ROB studies significantly outperformed LR. The first algorithm was a random forest for preterm delivery (logit AUROC 2.51, 95% CI 1.49-3.53; I-2=86%; tau(2)=0.77) and pre-eclampsia (logit AUROC 1.2, 95% CI 0.72-1.67; I-2=75%; tau(2)=0.09). The second algorithm was gradient boosting for cesarean section (logit AUROC 2.26, 95% CI 1.39-3.13; I-2=75%; tau(2)=0.43) and gestational diabetes (logit AUROC 1.03, 95% CI 0.69-1.37; I-2=83%; tau(2)=0.07). Conclusions: Prediction models with the best performances across studies were not necessarily those that used LR but also used random forest and gradient boosting that also performed well. We recommend a reanalysis of existing LR models for several pregnancy outcomes by comparing them with those algorithms that apply standard guidelines.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Oncology

Developing an International Classification of Functioning, Disability and Health Core Set for Pediatric Brain Tumor Survivors in Chinese Clinical Settings

Shu-Tai Shen Hsiao, Chao-Yang Kuo, Tsan-Hon Liou, Tai-Ton Wang, Yen-Lin Liu, Sung-Hui Tseng

Summary: This study developed an ICF core set for interviewing pediatric brain tumor survivors in Taiwan, aiming to assist healthcare professionals in implementing disability assessment and management measures.

CANCER NURSING (2023)

Article Multidisciplinary Sciences

Questionnaire-free machine-learning method to predict depressive symptoms among community-dwelling older adults

Sri Susanty, Herdiantri A. Sufriyana, Emily Chia-Yu A. Su, Yeu-Hui A. Chuang

Summary: This study aimed to develop and validate a questionnaire-free, machine-learning model as an alternative triage test for the GDS-15 among community-dwelling older adults. The best models were the random forest (RF) and deep-insight visible neural network by internal validation, but both performances were undifferentiated by external validation. The AUROC of the RF model was 0.619 (95% CI 0.610 to 0.627) for the external validation set with a non-local ethnic group.

PLOS ONE (2023)

Article Microbiology

Genome-centered metagenomics illuminates adaptations of core members to a partial Nitritation-Anammox bioreactor under periodic microaeration

Yung-Hsien Shao, Yu-Wei Wu, Muhammad Naufal, Jer-Horng Wu

Summary: In this study, a metagenomic approach was used to analyze the genomic contents of core members in a partial nitritation-anaerobic ammonium oxidation (PN-A) reactor treating low-strength ammonium wastewater. The analysis showed the metabolic traits and predicted microbial interactions of 18 core species, providing new insights into microbial adaptation to the intermittent microaeration specific to the PN-A reactor and its application to low-strength ammonium wastewater.

FRONTIERS IN MICROBIOLOGY (2023)

Article Biochemistry & Molecular Biology

A Cross-Validated Feature Selection (CVFS) approach for extracting the most parsimonious feature sets and discovering potential antimicrobial resistance (AMR) biomarkers

Ming-Ren Yang, Yu-Wei Wu

Summary: Understanding the mechanisms of genes is crucial in studying antimicrobial-resistant bacteria. Genes related to antimicrobial resistance can be used as biomarkers for predicting bacterial resistance to antibiotics. We developed a Cross-Validated Feature Selection approach for selecting the most predictive gene sets from bacterial genomes. By testing this approach on bacterial pan-genome datasets, we found that it successfully identified both known and novel antimicrobial resistance genes, demonstrating its potential as a tool for expanding resistance gene databases.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2023)

Article Biochemistry & Molecular Biology

Bioinformatic Analysis Reveals both Oversampled and Underexplored Biosynthetic Diversity in Nonribosomal Peptides

Bo-Siyuan Jian, Shao-Lun Chiou, Chun-Chia Hsu, Josh Ho, Yu-Wei Wu, John Chu

Summary: The traditional natural product discovery approach has only explored a small portion of nature's chemical diversity. By using bioinformatic tools, we can interpret the instructions encoded in microbial biosynthetic genes and overcome methodological limitations, thus expanding the scope of discovery. The evaluation of prediction algorithms for nonribosomal peptides (NRPs) synthesis shows that there is still a vast unexplored biosynthetic diversity in nature, suggesting the possibility of discovering unknown NRPs with functional roles other than siderophores.

ACS CHEMICAL BIOLOGY (2023)

Article Computer Science, Artificial Intelligence

Human-guided deep learning with ante-hoc explainability by convolutional network from non-image data for pregnancy prognostication

Herdiantri Sufriyana, Yu-Wei Wu, Emily Chia-Yu Su

Summary: This study demonstrates a human-guided deep learning approach that develops, validates, and deploys prognostic prediction models for PROM and an estimator of time of delivery from non-image data using convolutional neural networks. The developed model outperforms previous models and can be explained using knowledge-based diagrams and model representation.

NEURAL NETWORKS (2023)

Article Genetics & Heredity

Using bacterial pan-genome-based feature selection approach to improve the prediction of minimum inhibitory concentration (MIC)

Ming-Ren Yang, Shun-Feng Su, Yu-Wei Wu

Summary: By using a machine learning feature selection approach, we accurately predicted the minimum inhibitory concentration (MIC) values of antimicrobial drugs, and found that half of the selected features were annotated as hypothetical proteins with unknown functions, indicating the potential of uncovering novel genes associated with pathogenic antimicrobial resistance through feature selection on the entire gene set.

FRONTIERS IN GENETICS (2023)

Article Medicine, General & Internal

XGBoost-Based Simple Three-Item Model Accurately Predicts Outcomes of Acute Ischemic Stroke

Chen-Chih Chung, Emily Chia-Yu Su, Jia-Hung Chen, Yi-Tui Chen, Chao-Yang Kuo

Summary: This study developed an extreme gradient boosting (XGBoost)-based model using age, fasting glucose, and National Institutes of Health Stroke Scale (NIHSS) scores as predictors to accurately predict three-month functional outcomes after acute ischemic stroke (AIS). The model showed reliable predictive power and identified factors associated with unfavorable prognoses, such as NIHSS score > 5, age over 64 years, and fasting blood glucose > 86 mg/dL. Furthermore, it demonstrated the importance of different predictors for patients receiving different AIS treatments.

DIAGNOSTICS (2023)

Article Health Care Sciences & Services

Constructing a Learning Curve to Discuss the Medical Treatments and the Effect of Vaccination of COVID-19

Yi-Tui Chen, Emily Chia-Yu Su, Fang Ming Hung, Tomoru Hiramatsu, Tzu-Jen Hung, Chao-Yang Kuo

Summary: This paper aimed to analyze and compare case fatality rates of COVID-19, examine the existence of learning curves for medical treatments, and explore the impact of vaccination on fatality rate reduction. The results showed that low registration and viral test rates led to low fatality rates, and there was a significant learning curve for all countries except China. Treatment for COVID-19 can be improved through repeated experience. Vaccinations in the U.K. and U.S.A. effectively reduced fatality rates, possibly due to higher vaccination rates. Learning curves for medical treatment were identified, explaining the effect of vaccination rates on fatalities.

HEALTHCARE (2023)

Article Computer Science, Interdisciplinary Applications

Ability of machine-learning based clinical decision support system to reduce alert fatigue, wrong-drug errors, and alert users about look alike, sound alike medication

Chun-You Chen, Ya-Lin Chen, Jeremiah Scholl, Hsuan-Chia Yang, Yu-Chuan (Jack) Li

Summary: This study evaluated the overall performance of a machine learning-based CDSS (MedGuard) in triggering clinically relevant alerts and intercepting inappropriate drug errors and LASA drug errors. The results showed that MedGuard has the ability to improve patients' safety by triggering clinically valid alerts.

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE (2024)

Article Zoology

Long-Read Genome Sequencing of Abscondita cerata (Coleoptera: Lampyridae), the Endemic Firefly of Taiwan

Tzi-Yuan Wang, King-Siang Goh, Liang-Jong Wang, Li-Ling Wu, Feng-Yu Wang, Yu-Wei Wu

Summary: Abscondita cerata is an endemic firefly species in Taiwan, which has a high abundance and wide distribution. The draft genome of Abs. cerata, sequenced using Nanopore technology, was found to be 967 Mb in size, assembled into 4,855 contigs with an N50 length of 325.269 kb. It was predicted to contain 55,206 protein-coding genes, with functional annotations for 37.78% of them. Repeat elements accounted for 47.11% of the genome, with DNA transposons being the most common type. The completeness of the genome and genes was evaluated to be 84.8% and 79%, respectively. The comparative transcriptome analysis revealed insights into the vision, humidity sensing, and luminescence of Abs. cerata.

ZOOLOGICAL STUDIES (2023)

Article Multidisciplinary Sciences

Yeast cell detection using fuzzy automatic contrast enhancement (FACE) and you only look once (YOLO)

Zheng-Jie Huang, Brijesh Patel, Wei-Hao Lu, Tz-Yu Yang, Wei-Cheng Tung, Vytautas Bucinskas, Modris Greitans, Yu-Wei Wu, Po Ting Lin

Summary: This study presents a novel approach using deep learning techniques for automatic cell detection. By optimizing image contrast and introducing a universal contrast enhancement variable, the proposed method achieves high accuracy in yeast cell detection. Comparative experiments demonstrate the superior performance of this method in cell detection, with significant improvements compared to conventional methods.

SCIENTIFIC REPORTS (2023)

No Data Available