4.5 Article

Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges

出版社

OXFORD UNIV PRESS
DOI: 10.1093/database/baw161

关键词

-

资金

  1. National Institutes of Health [R13-GM109648-01A1, P20-GM103446]
  2. Intramural Research Program at National Library of Medicine
  3. US Department of Energy [DE-SC0010838]
  4. US National Science Foundation [DBI-1356374]
  5. Swiss Federal Government through the State Secretariat for Education, Research and Innovation (SERI)
  6. SyBIT project of the SystemsX.ch
  7. Swiss Initiative in Systems Biology
  8. Robert Bosch Foundation
  9. EMBO
  10. Wellcome Grant, UK PubMed Central Phase 3 Developments [098231/Z/12/Z]
  11. Wellcome Trust [098231/Z/12/Z] Funding Source: Wellcome Trust
  12. U.S. Department of Energy (DOE) [DE-SC0010838] Funding Source: U.S. Department of Energy (DOE)

向作者/读者索取更多资源

Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as showcased during a recent panel discussion at the BioCreative V Challenge Workshop. We draw on these applications as case studies to characterize common requirements for successfully applying text-mining techniques to practical biocuration needs. We note that system 'accuracy' remains a challenge and identify several additional common difficulties and potential research directions including (i) the 'scalability' issue due to the increasing need of mining information from millions of full-text articles, (ii) the 'interoperability' issue of integrating various text-mining systems into existing curation workflows and (iii) the 'reusability' issue on the difficulty of applying trained systems to text genres that are not seen previously during development. We then describe related efforts within the text-mining community, with a special focus on the BioCreative series of challenge workshops. We believe that focusing on the near-term challenges identified in this work will amplify the opportunities afforded by the continued adoption of text-mining tools. Finally, in order to sustain the curation ecosystem and have text-mining systems adopted for practical benefits, we call for increased collaboration between text-mining researchers and various stakeholders, including researchers, publishers and biocurators.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据