4.6 Article

Methods for Building Sense Inventories of Abbreviations in Clinical Notes

出版社

OXFORD UNIV PRESS
DOI: 10.1197/jamia.M2927

关键词

-

资金

  1. NLM [LM007659, LM008635, K22LM008805]
  2. NATIONAL LIBRARY OF MEDICINE [R01LM007659, R01LM008635, K22LM008805] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Objective: To develop methods for building corpus-specific sense inventories of abbreviations occurring in clinical documents. Design: A corpus of internal medicine admission notes was collected and instances of each clinical abbreviation in the corpus were clustered to different sense clusters. One instance from each cluster was manually annotated to generate a final list of senses. Two clustering-based methods (Expectation Maximization-EM and Farthest First-FF) and one random sampling method for sense detection were evaluated using a set of 12 clinical abbreviations. Measurements: The clustering-based sense detection methods were evaluated using a set of clinical abbreviations that were manually sense annotated. Sense Completeness and Annotation Cost were used to measure the performance of different methods. Clustering error rates were also reported for different clustering algorithms. Results: A clustering-based semi-automated method was developed to build corpus-specific sense inventories for abbreviations in hospital admission notes. Evaluation demonstrated that this method could largely reduce manual annotation cost and increase the completeness of sense inventories when compared with a manual annotation method using random samples. Conclusion: The authors developed an effective clustering-based method for building corpus-specific sense inventories for abbreviations in a clinical corpus. To the best of the authors knowledge, this is the first time clustering technologies have been used to help building sense inventories of abbreviations in clinical text. The results demonstrated that the clustering-based method performed better than the manual annotation method using random samples for the task of building sense inventories of clinical abbreviations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据