4.7 Review

Use of Electronic Health Data for Disease Prediction: A Comprehensive Literature Review

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2019.2937862

关键词

Diseases; Predictive models; Data models; Computational modeling; Logistics; Consumer electronics; Disease prediction; electronic health data; healthcare informatics; machine learning; data mining; social network analysis

向作者/读者索取更多资源

Disease prediction using electronic health data can benefit stakeholders by identifying at-risk patients and improving quality of care to avoid hospital admissions. Various models have been proposed utilizing large-scale electronic health databases, different methods, and healthcare variables, with the goal to discuss different risk prediction models based on electronic health data.
Disease prediction has the potential to benefit stakeholders such as the government and health insurance companies. It can identify patients at risk of disease or health conditions. Clinicians can then take appropriate measures to avoid or minimize the risk and in turn, improve quality of care and avoid potential hospital admissions. Due to the recent advancement of tools and techniques for data analytics, disease risk prediction can leverage large amounts of semantic information, such as demographics, clinical diagnosis and measurements, health behaviours, laboratory results, prescriptions and care utilisation. In this regard, electronic health data can be a potential choice for developing disease prediction models. A significant number of such disease prediction models have been proposed in the literature over time utilizing large-scale electronic health databases, different methods, and healthcare variables. The goal of this comprehensive literature review was to discuss different risk prediction models that have been proposed based on electronic health data. Search terms were designed to find relevant research articles that utilized electronic health data to predict disease risks. Online scholarly databases were searched to retrieve results, which were then reviewed and compared in terms of the method used, disease type, and prediction accuracy. This paper provides a comprehensive review of the use of electronic health data for risk prediction models. A comparison of the results from different techniques for three frequently modelled diseases using electronic health data was also discussed in this study. In addition, the advantages and disadvantages of different risk prediction models, as well as their performance, were presented. Electronic health data have been widely used for disease prediction. A few modelling approaches show very high accuracy in predicting different diseases using such data. These modelling approaches have been used to inform the clinical decision process to achieve better outcomes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Health Care Sciences & Services

Comorbidity progression patterns of major chronic diseases: The impact of age, gender and time-window

Shahadat Uddin, Shangzhou Wang, Arif Khan, Haohui Lu

Summary: This study examines the progression of chronic diseases and their risk factors using a healthcare dataset sample of hospitalized patients. The results show that certain chronic diseases, such as cardiovascular diseases and diabetes, have a high prevalence in progressing to other chronic diseases, which is statistically significant. The progression frequencies increase with time and age, and the patients' sex also affects the disease progressions differently.

CHRONIC ILLNESS (2023)

Article Biology

PSRTTCA: A new approach for improving the prediction and characterization of tumor T cell antigens using propensity score representation learning

Phasit Charoenkwan, Chonlatip Pipattanaboon, Chanin Nantasenamat, Md Mehedi Hasan, Mohammad Ali Moni, Pietro Lio, Watshara Shoombuatong

Summary: Despite existing cancer therapies, the development of new and effective treatments is necessary to address the ongoing cancer recurrence and new cases. This study proposes a new machine learning-based approach, PSRTTCA, for improving the identification and characterization of tumor T cell antigens (TTCAs) based on their primary sequences.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Multidisciplinary Sciences

Stakeholder engagement variability across public, private and public-private partnership projects: A data-driven network-based analysis

Shahadat Uddin, Stephen Ong, Petr Matous

Summary: Stakeholder engagement is a crucial factor affecting project outcomes, but there is a lack of empirical evidence on the differences in stakeholder engagement patterns between public, private, and public-private partnership (PPP) projects. This study uses social network research methods to capture and compare these engagement structures quantitatively. The findings reveal significant differences in network size, edge number, density, and betweenness centralization across the three types of projects. Additionally, the density varies significantly between 'within budget' and cost overrun projects for private and PPP projects. The study highlights the importance of network data and analytical techniques in managing relationships in complex project ecosystems.

PLOS ONE (2023)

Article Computer Science, Artificial Intelligence

GRU-INC: An inception-attention based approach using GRU for human activity recognition

Taima Rahman Mim, Maliha Amatullah, Sadia Afreen, Mohammad Abu Yousuf, Shahadat Uddin, Salem A. Alyami, Khondokar Fida Hasan, Mohammad Ali Moni

Summary: Human Activity Recognition (HAR) is a valuable research field for clinical applications, where machine learning algorithms play a significant role. The proposed Gated Recurrent Unit-Inception (GRU-INC) model effectively utilizes both temporal and spatial information of time-series data, achieving high F1-scores on various publicly available datasets. The combination of GRU with Attention Mechanism and Inception module with Convolutional Block Attention Module (CBAM) contributes to the superior recognition rate and lower computational cost of the GRU-INC model. This framework has the potential to be applied in activity-associated clinical and rehabilitation applications.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

HARDC : A novel ECG-based heartbeat classification method to detect arrhythmia using hierarchical attention based dual structured RNN with dilated CNN

Md Shofiqul Islam, Khondokar Fida Hasan, Sunjida Sultana, Shahadat Uddin, Pietro Lio', Julian M. W. Quinn, Mohammad Ali Moni

Summary: We propose a hybrid hierarchical attention-based bidirectional recurrent neural network with dilated CNN (HARDC) method for arrhythmia classification. This method fully exploits the dilated CNN and bidirectional recurrent neural network unit (BiGRU-BiLSTM) architecture to generate fusion features, improving the model's performance for prediction. By combining the fusion features with a dilated CNN and a hierarchical attention mechanism, the trained HARDC model showed significantly improved classification results and interpretability of feature extraction on the PhysioNet 2017 challenge dataset.

NEURAL NETWORKS (2023)

Article Chemistry, Multidisciplinary

Evolutionary Features for Dynamic Link Prediction in Social Networks

Nazim Choudhury, Shahadat Uddin

Summary: One of the characteristics of dynamic networks is the evolution of their actors and links. The link prediction mechanism in dynamic networks can capture the growth mechanisms of social networks. Researchers have utilized the temporal patterns of dynamic networks for dynamic link prediction. However, little attention has been given to the temporal variations of actor-level network structure and neighborhood information. This study attempts to build dynamic similarity metrics considering the temporal similarity and correlation between different actor-level evolutionary information of non-connected actor pairs. These metrics are used as dynamic features in the link prediction model and show improved performance compared to static similarity metrics.

APPLIED SCIENCES-BASEL (2023)

Article Medicine, General & Internal

A Novel Hybrid Approach for Classifying Osteosarcoma Using Deep Feature Extraction and Multilayer Perceptron

Md. Tarek Aziz, S. M. Hasan Mahmud, Md. Fazla Elahe, Hosney Jahan, Md Habibur Rahman, Dip Nandi, Lassaad K. Smirani, Kawsar Ahmed, Francis M. Bui, Mohammad Ali Moni

Summary: In this paper, a hybrid framework was proposed to improve the efficiency of osteosarcoma tumor classification by merging different types of CNN-based architectures with a multilayer perceptron algorithm. The proposed model achieved high accuracy for both multiclass and binary classification of osteosarcoma, outperforming existing methods. Experimental findings indicate the potential applicability of this model in supporting osteosarcoma diagnosis in clinics.

DIAGNOSTICS (2023)

Review Health Care Sciences & Services

Ensemble Learning for Disease Prediction: A Review

Palak Mahajan, Shahadat Uddin, Farshid Hajati, Mohammad Ali Moni

Summary: Machine learning models are utilized to create and improve disease prediction frameworks, and ensemble learning is a technique that combines multiple classifiers to enhance performance. In this study, the performance accuracies of different ensemble techniques (bagging, boosting, stacking, and voting) are assessed against five highly researched diseases. The findings reveal that stacking has the most accurate performance and can assist researchers in understanding current trends in disease prediction models that employ ensemble learning.

HEALTHCARE (2023)

Article Computer Science, Hardware & Architecture

Monitoring water quality metrics of ponds with IoT sensors and machine learning to predict fish species survival

Md. Monirul Islam, Mohammod Abul Kashem, Salem A. Alyami, Mohammad Ali Moni

Summary: This paper presents an IoT framework for aquaculture that allows real-time monitoring and effective control of water-related parameters. The proposed system utilizes sensors and an Arduino microcontroller to collect and store data in an IoT cloud platform. The collected data is then analyzed using various machine learning algorithms, with Random Forest achieving the highest performance scores. The study also includes hardware details of the IoT system and calculates biochemical and chemical oxygen demands.

MICROPROCESSORS AND MICROSYSTEMS (2023)

Article Genetics & Heredity

Cancer Classification Utilizing Voting Classifier with Ensemble Feature Selection Method and Transcriptomic Data

Rabea Khatun, Maksuda Akter, Md. Manowarul Islam, Md. Ashraf Uddin, Md. Alamin Talukder, Joarder Kamruzzaman, Akm Azad, Bikash Kumar Paul, Muhammad Ali Abdulllah Almoyad, Sunil Aryal, Mohammad Ali Moni

Summary: This article proposes an ensemble rank-based feature selection method and classifier to address the challenge of high-dimensional data in cancer diagnosis. The method efficiently discovers the most relevant and useful features by aggregating rankings from different selection methods. The results show high accuracy on multiple datasets and the study identifies a subset of the most important cancer-causing genes and demonstrates their significance.
Article Computer Science, Interdisciplinary Applications

Impact of COVID-19 on Journal Impact Factor

Shahadat Uddin, Arif Khan, Haohui Lu

Summary: Research on COVID-19 has seen significant growth in recent years and has been a dominant topic in health-related publications. This study explores the impact of COVID-19 research on journal performance using the Impact Factor and six years of data. The results show that journals publishing COVID-19-related articles experienced a significant increase in their Impact Factor, with lower Impact Factor journals contributing the most to this growth. It suggests that journals prioritizing COVID-19 research may experience increased visibility and Impact Factor growth in the long term.

JOURNAL OF INFORMETRICS (2023)

Proceedings Paper Computer Science, Theory & Methods

How could a weighted drug-drug network help improve adverse drug reaction predictions? Machine learning reveals the importance of edge weights

Fangyu Zhou, Shahadat Uddin

Summary: In recent years, there has been an exponential growth in drug-related data and adverse drug reactions (ADRs), leading to a comparatively high hospitalization rate worldwide. To minimize risks, extensive research has been conducted to predict ADRs. Due to the high cost and time-consuming nature of lab experiments, researchers are exploring the use of data mining and machine learning techniques in this field. This paper constructs a weighted drug-drug network by integrating various data sources, revealing underlying relationships between drugs based on common ADRs. Network features are extracted from this network, such as weighted degree centrality and weighted PageRanks, which are concatenated with original drug features to train and test seven classical machine learning algorithms. Experiment results show that adding these network measures benefits all tested machine learning methods, with logistic regression achieving the highest mean AUROC score (0.821) across all ADRs. Weighted degree centrality and weighted PageRanks are identified as the most important network features in the logistic regression classifier. This evidence strongly supports the fundamental role of the network approach in future ADR prediction, where network edge weights play a crucial role in the logistic regression model.

PROCEEDINGS OF 2023 AUSTRALIAN COMPUTER SCIENCE WEEK, ACSW 2023 (2023)

暂无数据