☆ 4.7 Article

Knowledge matters: Chest radiology report generation with general and specific knowledge

MEDICAL IMAGE ANALYSIS (2022)

期刊

MEDICAL IMAGE ANALYSIS

卷 80, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.media.2022.102510

关键词

Chest radiology report generation; Knowledge graph; Multi-head attention

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications Engineering, Biomedical Radiology, Nuclear Medicine & Medical Imaging

资金

CCF-Tencent Open Fund
National Natural Science Foundation of China [31900979]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Automatic chest radiology report generation is crucial in clinics for reducing the workload of radiologists and preventing misdiagnosis. This paper proposes a knowledge-enhanced approach that utilizes both general and specific medical knowledge to improve the quality of generated reports. Experimental results show that this approach outperforms state-of-the-art methods in terms of performance.

Automatic chest radiology report generation is critical in clinics which can relieve experienced radiologists from the heavy workload and remind inexperienced radiologists of misdiagnosis or missed diagnose. Existing approaches mainly formulate chest radiology report generation as an image captioning task and adopt the encoder-decoder framework. However, in the medical domain, such pure data-driven approaches suffer from the following problems: 1) visual and textual bias problem; 2) lack of expert knowledge. In this paper, we propose a knowledge-enhanced radiology report generation approach introduces two types of medical knowledge: 1) General knowledge, which is input independent and provides the broad knowledge for report generation; 2) Specific knowledge, which is input dependent and provides the fine-grained knowledge for chest X-ray report generation. To fully utilize both the general and specific knowledge, we also propose a knowledge-enhanced multi-head attention mechanism. By merging the visual features of the radiology image with general knowledge and specific knowledge, the proposed model can improve the quality of generated reports. The experimental results on the publicly available IU-Xray dataset show that the proposed knowledge-enhanced approach outperforms state-of-the-art methods in almost all metrics. And the results of MIMIC-CXR dataset show that the proposed knowledge-enhanced approach is on par with state-of-the-art methods. Ablation studies also demonstrate that both general and specific knowledge can help to improve the performance of chest radiology report generation.(c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license( http://creativecommons.org/licenses/by-nc-nd/4.0/ )

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.7

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Vision-knowledge fusion model for multi-domain medical report generation

Dexuan Xu, Huashi Zhu, Yu Huang, Zhi Jin, Weiping Ding, Hang Li, Menglong Ran

Summary: In this paper, a vision-knowledge fusion model based on medical images and knowledge graphs is proposed to fully utilize high-quality data from different diseases and languages. The model automatically constructs domain-specific knowledge graphs based on medical standards, fuses image and knowledge using a knowledge-based attention mechanism, and restores fine-grained knowledge through a triples restoration module. Experimental results show that the model outperforms previous benchmark methods and achieves excellent evaluation scores on two different diseases datasets. The interpretability and clinical usefulness of the model are validated, and it can be generalized to multiple domains and different diseases.

INFORMATION FUSION (2023)