☆ 4.4 Article Proceedings Paper

Natural language processing (NLP) tools in extracting biomedical concepts from research articles: a case study on autism spectrum disorder

BMC MEDICAL INFORMATICS AND DECISION MAKING (2020)

期刊

BMC MEDICAL INFORMATICS AND DECISION MAKING

卷 20, 期 -, 页码 -

出版社

BMC

DOI: 10.1186/s12911-020-01352-2

关键词

Natural language processing; Machine learning; Named entity recognition; Autism spectrum disorder

类别

Medical Informatics

资金

Eagles Charitable Foundation
CHOP Research Institute
NIH/NLM/NHGRI grant [LM012895]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

BackgroundNatural language processing (NLP) tools can facilitate the extraction of biomedical concepts from unstructured free texts, such as research articles or clinical notes. The NLP software tools CLAMP, cTAKES, and MetaMap are among the most widely used tools to extract biomedical concept entities. However, their performance in extracting disease-specific terminology from literature has not been compared extensively, especially for complex neuropsychiatric disorders with a diverse set of phenotypic and clinical manifestations.MethodsWe comparatively evaluated these NLP tools using autism spectrum disorder (ASD) as a case study. We collected 827 ASD-related terms based on previous literature as the benchmark list for performance evaluation. Then, we applied CLAMP, cTAKES, and MetaMap on 544 full-text articles and 20,408 abstracts from PubMed to extract ASD-related terms. We evaluated the predictive performance using precision, recall, and F1 score.ResultsWe found that CLAMP has the best performance in terms of F1 score followed by cTAKES and then MetaMap. Our results show that CLAMP has much higher precision than cTAKES and MetaMap, while cTAKES and MetaMap have higher recall than CLAMP.ConclusionThe analysis protocols used in this study can be applied to other neuropsychiatric or neurodevelopmental disorders that lack well-defined terminology sets to describe their phenotypic presentations.

Natural language processing (NLP) tools in extracting biomedical concepts from research articles: a case study on autism spectrum disorder

期刊

BMC MEDICAL INFORMATICS AND DECISION MAKING

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Natural language processing (NLP) tools in extracting biomedical concepts from research articles: a case study on autism spectrum disorder

期刊

BMC MEDICAL INFORMATICS AND DECISION MAKING

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文