4.6 Article

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification

Journal

Publisher

OXFORD UNIV PRESS
DOI: 10.1136/amiajnl-2011-000154

Keywords

-

Funding

  1. National Library of Medicine [U54-LM008748]
  2. Consortium for Healthcare Informatics Research (CHIR), VA HSR HIR [08-374]
  3. VA Informatics and Computing Infrastructure (VINCI), VA HSR HIR [08-204]
  4. MedQuist
  5. project Akenaton [ANR-07-TECSAN-001]
  6. OSEO, FRENCH State agency for innovation

Ask authors/readers for more resources

Objective This paper describes the approaches the authors developed while participating in the i2b2/VA 2010 challenge to automatically extract medical concepts and annotate assertions on concepts and relations between concepts. Design The authors' approaches rely on both rule-based and machine-learning methods. Natural language processing is used to extract features from the input texts; these features are then used in the authors' machine-learning approaches. The authors used Conditional Random Fields for concept extraction, and Support Vector Machines for assertion and relation annotation. Depending on the task, the authors tested various combinations of rule-based and machine-learning methods. Results The authors' assertion annotation system obtained an F-measure of 0.931, ranking fifth out of 21 participants at the i2b2/VA 2010 challenge. The authors' relation annotation system ranked third out of 16 participants with a 0.709 F-measure. The 0.773 F-measure the authors obtained on concept extraction did not make it to the top 10. Conclusion On the one hand, the authors confirm that the use of only machine-learning methods is highly dependent on the annotated training data, and thus obtained better results for well-represented classes. On the other hand, the use of only a rule-based method was not sufficient to deal with new types of data. Finally, the use of hybrid approaches combining machine-learning and rule-based approaches yielded higher scores.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available