4.6 Article

Disambiguation in the biomedical domain: The role of ambiguity type

期刊

JOURNAL OF BIOMEDICAL INFORMATICS
卷 43, 期 6, 页码 972-981

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2010.08.009

关键词

Natural Language Processing, NLP; Word Sense Disambiguation, WSD; Ambiguity Biomedical documents

资金

  1. UK Engineering and Physical Sciences Research Council [EP/E004350/1, EP/D069548/1]
  2. EPSRC [EP/D069548/1, EP/E004350/1] Funding Source: UKRI
  3. Engineering and Physical Sciences Research Council [EP/D069548/1, EP/E004350/1] Funding Source: researchfish

向作者/读者索取更多资源

Word Sense Disambiguation (WSD). the automatic identification of the meanings of ambiguous terms in a document, is an important stage in text processing We describe a WSD system that has been developed specifically for the types of ambiguities found in biomedical documents This system uses a range of knowledge sources It employs both linguistic features, such as local collocations, and features derived from domain-specific knowledge sources, the Unified Medical Language System (UMLS) and Medical Subject Headings (MeSH) This system is applied to three types of ambiguities found in Medline abstracts. ambiguous terms, abbreviations with multiple expansions and names that are ambiguous between genes The WSD system is applied to the standard NLM-WSD data set, which consists of ambiguous terms from Medline abstracts, and was found to perform well in comparison with previously reported results. The system's performance and the contribution of each knowledge source depends upon the type of lexical ambiguity 87 9% of the ambiguous terms are correctly disambiguated using a combination of linguistic features and MeSH terms. 99% of abbreviations are disambiguated by combining all knowledge sources, while 97 2% of ambiguous gene names are disambiguated using the MeSH terms alone Analysis reveals that these differences are caused by the nature of each ambiguity type These results should be taken into account when deciding which information to use for WSD and the level of performance that can be expected (C) 2010 Elsevier Inc All rights reserved

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据