☆ 4.6 Article Proceedings Paper

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

BMC BIOINFORMATICS (2019)

期刊

BMC BIOINFORMATICS

卷 20, 期 -, 页码 -

出版社

BMC

DOI: 10.1186/s12859-019-3079-8

关键词

Word sense disambiguation; LSTM; Self-attention; Biomedical

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Mathematical & Computational Biology

资金

NVIDIA GPU Grant
National Institute on Aging [R21AG061431]
Open Access Fund of FSU Libraries

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background In recent years, deep learning methods have been applied to many natural language processing tasks to achieve state-of-the-art performance. However, in the biomedical domain, they have not out-performed supervised word sense disambiguation (WSD) methods based on support vector machines or random forests, possibly due to inherent similarities of medical word senses. Results In this paper, we propose two deep-learning-based models for supervised WSD: a model based on bi-directional long short-term memory (BiLSTM) network, and an attention model based on self-attention architecture. Our result shows that the BiLSTM neural network model with a suitable upper layer structure performs even better than the existing state-of-the-art models on the MSH WSD dataset, while our attention model was 3 or 4 times faster than our BiLSTM model with good accuracy. In addition, we trained universal models in order to disambiguate all ambiguous words together. That is, we concatenate the embedding of the target ambiguous word to the max-pooled vector in the universal models, acting as a hint. The result shows that our universal BiLSTM neural network model yielded about 90 percent accuracy. Conclusion Deep contextual models based on sequential information processing methods are able to capture the relative contextual information from pre-trained input word embeddings, in order to provide state-of-the-art results for supervised biomedical WSD tasks.

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

期刊

BMC BIOINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

期刊

BMC BIOINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文