☆ 4.6 Article

A flexible framework for deriving assertions from electronic medical records

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2011)

Journal

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION

Volume 18, Issue 5, Pages 568-573

Publisher

OXFORD UNIV PRESS

DOI: 10.1136/amiajnl-2011-000152

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Objective This paper describes natural-language-processing techniques for two tasks: identification of medical concepts in clinical text, and classification of assertions, which indicate the existence, absence, or uncertainty of a medical problem. Because so many resources are available for processing clinical texts, there is interest in developing a framework in which features derived from these resources can be optimally selected for the two tasks of interest. Materials and methods The authors used two machine-learning (ML) classifiers: support vector machines (SVMs) and conditional random fields (CRFs). Because SVMs and CRFs can operate on a large set of features extracted from both clinical texts and external resources, the authors address the following research question: Which features need to be selected for obtaining optimal results? To this end, the authors devise feature-selection techniques which greatly reduce the amount of manual experimentation and improve performance. Results The authors evaluated their approaches on the 2010 i2b2/VA challenge data. Concept extraction achieves 79.59 micro F-measure. Assertion classification achieves 93.94 micro F-measure. Discussion Approaching medical concept extraction and assertion classification through ML-based techniques has the advantage of easily adapting to new data sets and new medical informatics tasks. However, ML-based techniques perform best when optimal features are selected. By devising promising feature-selection techniques, the authors obtain results that outperform the current state of the art. Conclusion This paper presents two ML-based approaches for processing language in the clinical texts evaluated in the 2010 i2b2/VA challenge. By using novel feature-selection methods, the techniques presented in this paper are unique among the i2b2 participants.

A flexible framework for deriving assertions from electronic medical records

Journal

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A flexible framework for deriving assertions from electronic medical records

Journal

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper