4.7 Article

Towards a Protein-Protein Interaction information extraction system: Recognizing named entities

期刊

KNOWLEDGE-BASED SYSTEMS
卷 57, 期 -, 页码 104-118

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2013.12.010

关键词

Biomedical named entity recognition; Protein-Protein Interaction; Dictionary look-up; Conditional random field; Support vector machine

资金

  1. MICINN, Spain [TIN2012-38603-C02-01]
  2. European Commission [269180]

向作者/读者索取更多资源

The majority of biological functions of any living being are related to Protein-Protein Interactions (PPI). PPI discoveries are reported in form of research publications whose volume grows day after day. Consequently, automatic PPI information extraction systems are a pressing need for biologists. In this paper we are mainly concerned with the named entity detection module of PPIES (the PPI information extraction system we are implementing) which recognizes twelve entity types relevant in PPI context. It is composed of two sub-modules: a dictionary look-up with extensive normalization and acronym detection, and a Conditional Random Field classifier. The dictionary look-up module has been tested with Interaction Method Task (IMT), and it improves by approximately 10% the current solutions that do not use Machine Learning (ML). The second module has been used to create a classifier using the Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA'04) data set. It does not use any external resources, or complex or ad hoc post-processing, and obtains 77.25%, 75.04% and 76.13 for precision, recall, and F1-measure, respectively, improving all previous results obtained for this data set. (C) 2013 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据