4.6 Article

Drug repurposing for COVID-19 via knowledge graph completion

期刊

JOURNAL OF BIOMEDICAL INFORMATICS
卷 115, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2021.103696

关键词

COVID-19; Drug repurposing; Knowledge graph completion; Literature-based discovery; Text mining

资金

  1. U.S. National Institutes of Health's National Center for Complementary and Integrative Health [R01AT009457]
  2. Slovenian Research Agency [P3-0154, Z5-9352, J5-2552, J5-1780]

向作者/读者索取更多资源

This study utilized literature-derived knowledge and knowledge graph completion methods to identify potential drug candidates for COVID-19. The accuracy classifier based on PubMedBERT performed the best in identifying accurate semantic predications, and the TransE model outperformed others in predicting drug repurposing candidates. Several known drugs linked to COVID-19 were identified, as well as novel drugs that have not been studied yet.
Objective: To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. Methods: We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other COVID-19-focused research literature. Our approach relies on semantic triples extracted using SemRep (via SemMedDB). We identified an informative and accurate subset of semantic triples using filtering rules and an accuracy classifier developed on a BERT variant. We used this subset to construct a knowledge graph, and applied five state-of-the-art, neural knowledge graph completion algorithms (i.e., TransE, RotatE, DistMult, ComplEx, and STELP) to predict drug repurposing candidates. The models were trained and assessed using a time slicing approach and the predicted drugs were compared with a list of drugs reported in the literature and evaluated in clinical trials. These models were complemented by a discovery pattern-based approach. Results: Accuracy classifier based on PubMedBERT achieved the best performance (F1 = 0.854) in identifying accurate semantic predications. Among five knowledge graph completion models, TransE outperformed others (MR = 0.923, Hits@1 = 0.417). Some known drugs linked to COVID-19 in the literature were identified, as well as others that have not yet been studied. Discovery patterns enabled identification of additional candidate drugs and generation of plausible hypotheses regarding the links between the candidate drugs and COVID-19. Among them, five highly ranked and novel drugs (i.e., paclitaxel, SB 203580, alpha 2-antiplasmin, metoclopramide, and oxymatrine) and the mechanistic explanations for their potential use are further discussed. Conclusion: We showed that a LBD approach can be feasible not only for discovering drug candidates for COVID19, but also for generating mechanistic explanations. Our approach can be generalized to other diseases as well as to other clinical questions. Source code and data are available at https://github.com/kilicogluh/lbd-covid.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据