4.7 Article

Knowledge matters: Chest radiology report generation with general and specific knowledge

期刊

MEDICAL IMAGE ANALYSIS
卷 80, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.media.2022.102510

关键词

Chest radiology report generation; Knowledge graph; Multi-head attention

资金

  1. CCF-Tencent Open Fund
  2. National Natural Science Foundation of China [31900979]

向作者/读者索取更多资源

Automatic chest radiology report generation is crucial in clinics for reducing the workload of radiologists and preventing misdiagnosis. This paper proposes a knowledge-enhanced approach that utilizes both general and specific medical knowledge to improve the quality of generated reports. Experimental results show that this approach outperforms state-of-the-art methods in terms of performance.
Automatic chest radiology report generation is critical in clinics which can relieve experienced radiologists from the heavy workload and remind inexperienced radiologists of misdiagnosis or missed diagnose. Existing approaches mainly formulate chest radiology report generation as an image captioning task and adopt the encoder-decoder framework. However, in the medical domain, such pure data-driven approaches suffer from the following problems: 1) visual and textual bias problem; 2) lack of expert knowledge. In this paper, we propose a knowledge-enhanced radiology report generation approach introduces two types of medical knowledge: 1) General knowledge, which is input independent and provides the broad knowledge for report generation; 2) Specific knowledge, which is input dependent and provides the fine-grained knowledge for chest X-ray report generation. To fully utilize both the general and specific knowledge, we also propose a knowledge-enhanced multi-head attention mechanism. By merging the visual features of the radiology image with general knowledge and specific knowledge, the proposed model can improve the quality of generated reports. The experimental results on the publicly available IU-Xray dataset show that the proposed knowledge-enhanced approach outperforms state-of-the-art methods in almost all metrics. And the results of MIMIC-CXR dataset show that the proposed knowledge-enhanced approach is on par with state-of-the-art methods. Ablation studies also demonstrate that both general and specific knowledge can help to improve the performance of chest radiology report generation.(c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license( http://creativecommons.org/licenses/by-nc-nd/4.0/ )

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Interdisciplinary Applications

DeepACEv2: Automated Chromosome Enumeration in Metaphase Cell Images Using Deep Convolutional Neural Networks

Li Xiao, Chunlong Luo, Tianqi Yu, Yufan Luo, Manqing Wang, Fuhai Yu, Yinhao Li, Chan Tian, Jie Qiao

IEEE TRANSACTIONS ON MEDICAL IMAGING (2020)

Article Computer Science, Interdisciplinary Applications

Label-Free Segmentation of COVID-19 Lesions in Lung CT

Qingsong Yao, Li Xiao, Peihang Liu, S. Kevin Zhou

Summary: The study introduces a label-free approach for segmenting COVID-19 lesions in CT via voxel-level anomaly modeling, reducing the burden of data annotation. By learning patterns of normal tissues, a network capable of distinguishing normal tissues from COVID-19 lesions was established.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2021)

Article Cell Biology

DeepContact: High-throughput quantification of membrane contact sites based on electron microscopy imaging

Liqing Liu, Shuxin Yang, Yang Liu, Xixia Li, Junjie Hu, Li Xiao, Tao Xu

Summary: In this study, a tool called DeepContact was developed for optimizing organelle segmentation and contact analysis using label-free electron microscopy (EM) through deep learning algorithms. DeepContact showed high efficiency and flexibility, capable of accommodating various organelle morphologies and identifying contacts of different widths. The study revealed previously unidentified coordinated rearrangements of organelles and a subtle wave of interaction between the endoplasmic reticulum (ER) and mitochondria in Sertoli cells during the seminiferous epithelial cycle, demonstrating the potential of DeepContact in bridging MCS dynamics to physiological and pathological processes.

JOURNAL OF CELL BIOLOGY (2022)

Article Computer Science, Artificial Intelligence

Radiology report generation with a learned knowledge base and multi-modal alignment

Shuxin Yang, Xian Wu, Shen Ge, Zhuozhao Zheng, S. Kevin Zhou, Li Xiao

Summary: This study presents an automatic, multi-modal approach for generating radiology reports from chest x-ray images. The approach consists of a learned knowledge base module and a multi-modal alignment module, which utilize the correlation between the descriptions in radiology reports and specific information in the x-ray images. The proposed model is evaluated using metrics from natural language generation and clinic efficacy, and it shows improved performance compared to state-of-the-art methods.

MEDICAL IMAGE ANALYSIS (2023)

Article Computer Science, Interdisciplinary Applications

A New Likelihood Ratio Method for Training Artificial Neural Networks

Yijie Peng, Li Xiao, Bernd Heidergott, L. Jeff Hong, Henry Lam

Summary: A new method for computing the gradients of artificial neural networks was proposed, which bypasses the continuity requirement of traditional methods by injecting artificial noises into signals. The method shows similar computational complexity and more transparent formulas compared to traditional methods. Additionally, a likelihood ratio-based method was developed to train more general ANNs and improve their robustness.

INFORMS JOURNAL ON COMPUTING (2022)

Article Computer Science, Artificial Intelligence

Simultaneous alignment and surface regression using hybrid 2D-3D networks for 3D coherent layer segmentation of retinal OCT images with full and annotations

Hong Liu, Dong Wei, Donghuan Lu, Xiaoying Tang, Liansheng Wang, Yefeng Zheng

Summary: This study proposes a framework based on hybrid 2D-3D convolutional neural networks for obtaining continuous 3D retinal layer surfaces from OCT volumes. The framework works well with both full and sparse annotations and utilizes alignment displacement vectors and layer segmentation to align the B-scans and segment the layers. Experimental results show that the framework outperforms state-of-the-art 2D deep learning methods in terms of layer segmentation accuracy and cross-B-scan 3D continuity.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

WarpDrive: Improving spatial normalization using manual refinements

Simon Oxenford, Ana Sofia Rios, Barbara Hollunder, Clemens Neudorfer, Alexandre Boutet, Gavin J. B. Elias, Jurgen Germann, Aaron Loh, Wissam Deeb, Bryan Salvato, Leonardo Almeida, Kelly D. Foote, Robert Amaral, Paul B. Rosenberg, David F. Tang-Wai, David A. Wolk, Anna D. Burke, Marwan N. Sabbagh, Stephen Salloway, M. Mallar Chakravarty, Gwenn S. Smith, Constantine G. Lyketsos, Michael S. Okun, William S., Zoltan Mari, Francisco A. Ponce, Andres Lozano, Wolf-Julian Neumann, Bassam Al-Fatly, Andreas Horn

Summary: Spatial normalization is a method to map subject brain images to an average template brain, allowing comparison of brain imaging results. We introduce a novel tool called WarpDrive, which enables manual refinements of image alignment after automated registration. The tool improves accuracy of data representation and aids in understanding patient outcomes.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Interpretable and intervenable ultrasonography-based machine learning models for pediatric appendicitis

Ricards Marcinkevics, Patricia Reis Wolfertstetter, Ugne Klimiene, Kieran Chin-Cheong, Alyssia Paschke, Julia Zerres, Markus Denzinger, David Niederberger, Sven Wellmann, Ece Ozkan, Christian Knorr, Julia E. Vogt

Summary: This study presents interpretable machine learning models for predicting the diagnosis, management, and severity of suspected appendicitis using ultrasound images. The proposed models utilize concept bottleneck models (CBM) that facilitate interpretation and intervention by clinicians, without compromising performance or requiring time-consuming image annotation.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Residual Aligner-based Network (RAN) : Motion-separable structure for coarse-to-fine discontinuous deformable registration

Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Bartlomiej W. Papiez

Summary: This article introduces a new method for medical image registration, which utilizes a separable motion backbone and a residual aligner module to better handle the discontinuous motion of multiple neighboring objects. The proposed method achieves excellent registration results on abdominal CT scans and lung CT scans.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

A knowledge-interpretable multi-task learning framework for automated thyroid nodule diagnosis in ultrasound videos

Xiangqiong Wu, Guanghua Tan, Hongxia Luo, Zhilun Chen, Bin Pu, Shengli Li, Kenli Li

Summary: This study develops a user-friendly framework for the automated diagnosis of thyroid nodules in ultrasound videos, simulating the diagnostic workflow of radiologists. By interpreting image characteristics and modeling temporal contextual information, the efficiency and generalizability of the diagnosis can be improved.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

DeepSSM: A blueprint for image-to-shape deep learning models

Riddhish Bhalodia, Shireen Elhabian, Jadie Adams, Wenzheng Tao, Ladislav Kavan, Ross Whitaker

Summary: This paper introduces DeepSSM, a deep learning-based framework for image-to-shape modeling. By learning the functional mapping from images to low-dimensional shape descriptors, DeepSSM can directly infer statistical representation of anatomy from 3D images. Compared to traditional methods, DeepSSM eliminates the need for heavy manual preprocessing and segmentation, and significantly improves computational time.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Automatic registration with continuous pose updates for marker-less surgical navigation in spine surgery

Florentin Liebmann, Marco von Atzigen, Dominik Stutz, Julian Wolf, Lukas Zingg, Daniel Suter, Nicola A. Cavalcanti, Laura Leoty, Hooman Esfandiari, Jess G. Snedeker, Martin R. Oswald, Marc Pollefeys, Mazda Farshad, Philipp Furnstahl

Summary: This study presents a marker-less approach for automatic registration and real-time navigation of lumbar spinal fusion surgery using a deep neural network, avoiding radiation exposure and surgical errors. The method was validated on an ex-vivo surgery and a public dataset.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Cycle consistent twin energy-based models for image-to-image translation

Piyush Tiwary, Kinjawl Bhattacharyya, A. P. Prathosh

Summary: Domain shift refers to the change of distributional characteristics between training and testing datasets, leading to performance drop. For medical image tasks, domain shift can be caused by changes in imaging modalities, devices, and staining mechanisms. Existing approaches based on generative models suffer from training difficulties and lack of diversity. In this paper, the authors propose the use of energy-based models (EBMs) for unpaired image-to-image translation in medical images. The proposed method, called Cycle Consistent Twin EBMs (CCT-EBM), employs a pair of EBMs in the latent space of an Auto-Encoder to ensure translation symmetry and coupling between domains.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

ReFs: A hybrid pre-training paradigm for 3D medical image segmentation

Yutong Xie, Jianpeng Zhang, Lingqiao Liu, Hu Wang, Yiwen Ye, Johan Verjans, Yong Xia

Summary: This paper proposes a hybrid pre-training paradigm that combines self-supervised learning and supervised learning to improve the representation quality for medical image segmentation tasks. It introduces a reference task in self-supervised learning and optimizes the model using a gradient matching method. The experimental results demonstrate the effectiveness of this approach on multiple medical image segmentation benchmarks.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Cell classification with worse-case boosting for intelligent cervical cancer screening

Youyi Song, Jing Zou, Kup-Sze Choi, Baiying Lei, Jing Qin

Summary: Cell classification is crucial for intelligent cervical cancer screening, but the variation in cells' appearance and shape poses challenges. A new learning algorithm, worse-case boosting, is proposed to improve classification accuracy for under-represented data. Experimental results demonstrate the effectiveness of this algorithm in two publicly available datasets, achieving a 4% improvement in accuracy.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Self-supervised multi-modal training from uncurated images and reports enables monitoring AI in radiology

Sangjoon Park, Eun Sun Lee, Kyung Sook Shin, Jeong Eun Lee, Jong Chul Ye

Summary: The increasing demand for AI systems to monitor human errors and abnormalities in healthcare presents challenges. This study presents a model called Medical X-VL, which is tailored for the medical domain and outperformed current state-of-the-art models in two medical image datasets. The model enables various zero-shot tasks for monitoring AI in the medical domain.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Signal domain adaptation network for limited-view optoacoustic tomography

Anna Klimovskaia Susmelj, Berkan Lafci, Firat Ozdemir, Neda Davoudi, Xose Luis Dean-Ben, Fernando Perez-Cruz, Daniel Razansky

Summary: Optoacoustic imaging is a technique that uses optical excitation and ultrasound detection for biological tissue imaging. The quality of the images depends on the extent of tomographic coverage provided by the ultrasound detector arrays. However, full coverage is not always possible due to experimental constraints. The proposed signal domain adaptation network aims to reduce limited-view artifacts in the images.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

SynCLay: Interactive synthesis of histology images from bespoke cellular layouts

Srijay Deshpande, Muhammad Dawood, Fayyaz Minhas, Nasir Rajpoot

Summary: In this work, a novel framework called SynCLay is proposed for automated synthesis of histology images based on user-defined cellular layouts. The framework can generate realistic and high-quality histology images with different cellular arrangements, which is helpful for studying the role of cells in the tumor microenvironment. The framework integrates a nuclear segmentation and classification model to refine nuclear structures and generate nuclear masks. Evaluation using quantitative metrics and feedback from pathologists shows that the synthetic images generated by SynCLay have high realism scores and can accurately differentiate between benign and malignant tumors.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

CenTime: Event-conditional modelling of censoring in survival analysis

Ahmed H. Shahin, An Zhao, Alexander C. Whitehead, Daniel C. Alexander, Joseph Jacob, David Barber

Summary: Survival analysis is a valuable tool in healthcare for predicting the time to specific events. This paper introduces CenTime, a novel approach that directly estimates the time to event. The method performs well with censored data and can be easily integrated with deep learning models. Compared to standard methods, CenTime offers superior performance in predicting event time while maintaining comparable ranking performance.

MEDICAL IMAGE ANALYSIS (2024)

Article Computer Science, Artificial Intelligence

Do we really need dice? The hidden region-size biases of segmentation losses

Bingyuan Liu, Jose Dolz, Adrian Galdran, Riadh Kobbi, Ismail Ben Ayed

Summary: Most segmentation losses, such as CE and Dice, are variants of the Cross-Entropy or Dice losses. This work provides a theoretical analysis that shows a deeper connection between CE and Dice than previously thought. From a constrained-optimization perspective, both CE and Dice decompose into similar ground-truth matching terms and region-size penalty terms. The analysis uncovers hidden region-size biases: Dice has an intrinsic bias towards extremely imbalanced solutions, while CE implicitly encourages the ground-truth region proportions. Based on this analysis, a principled and simple solution is proposed to explicitly control the region-size bias.

MEDICAL IMAGE ANALYSIS (2024)