4.6 Article

Disease Prediction via Graph Neural Networks

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JBHI.2020.3004143

关键词

Diseases; Predictive models; Medical diagnostic imaging; Data models; Task analysis; Informatics; Machine learning; Disease prediction; big data health applications; data mining; graph embedding

资金

  1. NSFC [91846205]
  2. National Key RD Program [2017YFB1400100]
  3. Innovation Method Fund of China [2018IM020200]
  4. Shandong Key RD Program [2018YFJH0506, 2019JZZY011007]
  5. Australian Research Council [DP190101985]

向作者/读者索取更多资源

With the increasing availability of electronic medical records (EMRs), disease prediction has received significant research attention. This study introduces an innovative model based on Graph Neural Networks (GNNs) for disease prediction, which utilizes external knowledge bases to augment EMR data and accurately predicts both general and rare diseases.
With the increasingly available electronic medical records (EMRs), disease prediction has recently gained immense research attention, where an accurate classifier needs to be trained to map the input prediction signals (e.g., symptoms, patient demographics, etc.) to the estimated diseases for each patient. However, existing machine learning-based solutions heavily rely on abundant manually labeled EMR training data to ensure satisfactory prediction results, impeding their performance in the existence of rare diseases that are subject to severe data scarcity. For each rare disease, the limited EMR data can hardly offer sufficient information for a model to correctly distinguish its identity from other diseases with similar clinical symptoms. Furthermore, most existing disease prediction approaches are based on the sequential EMRs collected for every patient and are unable to handle new patients without historical EMRs, reducing their real-life practicality. In this paper, we introduce an innovative model based on Graph Neural Networks (GNNs) for disease prediction, which utilizes external knowledge bases to augment the insufficient EMR data, and learns highly representative node embeddings for patients, diseases and symptoms from the medical concept graph and patient record graph respectively constructed from the medical knowledge base and EMRs. By aggregating information from directly connected neighbor nodes, the proposed neural graph encoder can effectively generate embeddings that capture knowledge from both data sources, and is able to inductively infer the embeddings for a new patient based on the symptoms reported in her/his EMRs to allow for accurate prediction on both general diseases and rare diseases. Extensive experiments on a real-world EMR dataset have demonstrated the state-of-the-art performance of our proposed model.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据