4.6 Article Proceedings Paper

Dual-Path Adversarial Learning for Fully Convolutional Network (FCN)-Based Medical Image Segmentation

期刊

VISUAL COMPUTER
卷 34, 期 6-8, 页码 1043-1052

出版社

SPRINGER
DOI: 10.1007/s00371-018-1519-5

关键词

Adversarial learning; Fully convolutional networks (FCNs); Segmentation; Regions of interest (ROI)

资金

  1. Australia Research Council (ARC) grants

向作者/读者索取更多资源

Segmentation of regions of interest (ROIs) in medical images is an important step for image analysis in computer-aided diagnosis systems. In recent years, segmentation methods based on fully convolutional networks (FCNs) have achieved great success in general images. FCN performance is primarily due to it leveraging large labeled datasets to hierarchically learn the features that correspond to the shallow appearance as well as the deep semantics of the images. However, such dependence on large dataset does not translate well into medical images where there is a scarcity of annotated medical training data, and FCN results in coarse ROI detections and poor boundary definitions. To overcome this limitation, medical image-specific FCN methods have been introduced with post-processing techniques to refine the segmentation results; however, the performance of these methods is reliant on the appropriate tuning of a large number of parameters and dependence on data-specific post-processing techniques. In this study, we leverage the state-of-the-art image feature learning method of generative adversarial network (GAN) for its inherent ability to produce consistent and realistic images features by using deep neural networks and adversarial learning concept. We improve upon GAN such that ROI features can be learned at different levels of complexities (simple and complex), in a controlled manner, via our proposed dual-path adversarial learning (DAL). The outputs from our DAL are then augmented to the learned ROI features into the existing FCN training data, which increases the overall feature diversity. We conducted experiments on three public datasets with a variety of visual characteristics. Our results demonstrate that our DAL can improve FCN-based segmentation methods and outperform or be competitive in performances to the state-of-the-art methods without using medical image-specific optimizations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

ECSU-Net: An Embedded Clustering Sliced U-Net Coupled With Fusing Strategy for Efficient Intervertebral Disc Segmentation and Classification

Anam Nazir, Muhammad Nadeem Cheema, Bin Sheng, Ping Li, Huating Li, Guangtao Xue, Jing Qin, Jinman Kim, David Dagan Feng

Summary: This study proposes an Embedded Clustering Sliced U-Net (ECSU-Net) based on 2D U-Net for automatic vertebra segmentation. By integrating three modules, this method achieves excellent performance in terms of computational efficiency and accuracy.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Neurosciences

Enhancing medical image registration via appearance adjustment networks

Mingyuan Meng, Lei Bi, Michael Fulham, David Dagan Feng, Jinman Kim

Summary: This study proposes an Appearance Adjustment Network (AAN) to enhance the adaptability of deep learning-based registration methods (DLRs) to appearance variations. By providing appearance transformations and generating anatomy-preserving transformations through an anatomy-constrained loss function, our AAN improves the performance of DLRs. Experimental results show that our AAN outperforms state-of-the-art optimization-based registration methods (ORs) and existing DLRs on three public datasets.

NEUROIMAGE (2022)

Article Computer Science, Information Systems

DeepMTS: Deep Multi-Task Learning for Survival Prediction in Patients With Advanced Nasopharyngeal Carcinoma Using Pretreatment PET/CT

Mingyuan Meng, Bingxin Gu, Lei Bi, Shaoli Song, David Dagan Feng, Jinman Kim

Summary: This study proposes a 3D deep multi-task survival model for advanced NPC, which performs survival prediction and tumor segmentation simultaneously. By introducing a hard-sharing segmentation backbone, interference from non-relevant background information is reduced. In addition, a cascaded survival network is introduced to capture prognostic information beyond the primary tumor, leveraging global tumor information derived from the segmentation backbone. Experimental results with two clinical datasets demonstrate the superior performance of the proposed model compared to traditional radiomics-based survival prediction models and existing deep survival models.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2022)

Article Computer Science, Interdisciplinary Applications

Graph-Based Intercategory and Intermodality Network for Multilabel Classification and Melanoma Diagnosis of Skin Lesions in Dermoscopy and Clinical Images

Xiaohang Fu, Lei Bi, Ashnil Kumar, Michael Fulham, Jinman Kim

Summary: The identification of melanoma can be done through the analysis of clinical and dermoscopy images. Current methods lack the ability to fully utilize information from both modalities and exploit the intercategory relationships in the 7PC. This study proposes a graph-based network with two modules to address these limitations and improves classification performance.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2022)

Article Biology

Deep multimodal graph-based network for survival prediction from highly multiplexed images and patient variables

Xiaohang Fu, Ellis Patrick, Jean Y. H. Yang, David Dagan Feng, Jinman Kim

Summary: The spatial architecture and phenotypic heterogeneity of tumor cells are associated with cancer prognosis and outcomes. Imaging mass cytometry captures high-dimensional maps of disease-relevant biomarkers at single-cell resolution, which can inform patient-specific prognosis. However, existing methods for survival prediction do not utilize spatial phenotype information at the single-cell level, and there is a lack of end-to-end methods that integrate imaging data with clinical information for improved accuracy. We propose a deep multimodal graph-based network that considers spatial phenotype information and clinical variables to enhance survival prediction, and demonstrate its effectiveness in breast cancer datasets.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Computer Science, Information Systems

Vision-Language Transformer for Interpretable Pathology Visual Question Answering

Usman Naseem, Matloob Khushi, Jinman Kim

Summary: Pathology visual question answering (PathVQA) aims to answer medical questions using pathology images. Existing methods have limitations in capturing the high and low-level interactions between vision and language features required for VQA. Additionally, these methods lack interpretability in justifying the retrieved answers. To address these limitations, a vision-language transformer called TraP-VQA is introduced, which embeds vision and language features for interpretable PathVQA. Our experiments demonstrate that TraP-VQA outperforms state-of-the-art methods and validate its robustness on medical VQA datasets, along with the capability of the integrated vision-language model. Visualization results explain the reasoning behind the retrieved PathVQA answers.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2023)

Article Automation & Control Systems

Unsupervised Landmark Detection-Based Spatiotemporal Motion Estimation for 4-D Dynamic Medical Images

Yuyu Guo, Lei Bi, Dongming Wei, Liyun Chen, Zhengbin Zhu, Dagan Feng, Ruiyan Zhang, Qian Wang, Jinman Kim

Summary: In this study, we propose a dense-sparse-dense (DSD) motion estimation framework that utilizes unsupervised 3D landmark detection network and motion reconstruction network to extract sparse landmarks and construct motion field in two stages. The method improves the accuracy of motion estimation and preserves anatomical topology.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Computer Science, Interdisciplinary Applications

A Shortened Model for Logan Reference Plot Implemented via the Self-Supervised Neural Network for Parametric PET Imaging

Wenxiang Ding, Qiaoqiao Ding, Kewei Chen, Miao Zhang, Li Lv, David Dagan Feng, Lei Bi, Jinman Kim, Qiu Huang

Summary: Dynamic PET imaging provides more comprehensive physiological information than conventional static PET imaging. The proposed modified Logan reference plot model and self-supervised convolutional neural network improve noise performance and accurately estimate the distribution volume ratio in dynamic PET with a shortened scanning protocol. The method has the potential to add clinical value by providing both DVR and SUV simultaneously.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2023)

Article Computer Science, Cybernetics

RHMD: A Real-World Dataset for Health Mention Classification on Reddit

Usman Naseem, Matloob Khushi, Jinman Kim, Adam G. Dunn

Summary: People on social media using disease and symptom words to discuss their health can introduce biases in data-driven public health applications. This study presents a new dataset called RHMD, which consists of 10,015 manually annotated Reddit posts. The dataset is labeled with four categories and provides a comprehensive performance analysis of baseline methods. The release of this dataset is expected to facilitate the development of new methods for detecting health mentions in user-generated text.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2023)

Review Automation & Control Systems

A Review of Predictive and Contrastive Self-supervised Learning for Medical Images

Wei-Chien Wang, Euijoon Ahn, Dagan Feng, Jinman Kim

Summary: Over the last decade, supervised deep learning has made significant progress in computer vision tasks using manually annotated big data. However, the limited availability of high-quality annotated medical imaging data hinders the application of deep learning in medical image analysis. A potential solution is the use of self-supervised learning (SSL), particularly contrastive SSL, which has shown promise in rivaling or surpassing supervised learning. This review examines state-of-the-art contrastive SSL algorithms originally designed for natural images, explores their adaptations for medical images, and discusses recent advances, current limitations, and future directions in applying contrastive SSL in the medical domain.

MACHINE INTELLIGENCE RESEARCH (2023)

Proceedings Paper Imaging Science & Photographic Technology

Non-iterative Coarse-to-Fine Registration Based on Single-Pass Deep Cumulative Learning

Mingyuan Meng, Lei Bi, Dagan Feng, Jinman Kim

Summary: In this study, a Non-Iterative Coarse-to-finE registration Network (NICE-Net) is proposed for deformable image registration. By using a Single-pass Deep Cumulative Learning (SDCL) decoder and a Selectively-propagated Feature Learning (SFL) encoder, NICE-Net outperforms state-of-the-art iterative deep registration methods without increasing the runtime.

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI (2022)

Article Computer Science, Cybernetics

Hybrid Text Representation for Explainable Suicide Risk Identification on Social Media

Usman Naseem, Matloob Khushi, Jinman Kim, Adam G. Dunn

Summary: The article introduces a hybrid text representation method for explaining suicide risk identification on social media. The method achieves excellent results on a public suicide dataset and demonstrates advantages in clinical and public health practice.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

SparseVoxNet: 3-D Object Recognition With Sparsely Aggregation of 3-D Dense Blocks

Ahmad Karambakhsh, Bin Sheng, Ping Li, Huating Li, Jinman Kim, Younhyun Jung, C. L. Philip Chen

Summary: The article introduces a novel solution for 3-D object recognition from volumetric data by combining three compact CNN models, low-cost SparseNet, and feature representation technique. By estimating extra geometrical information, an optimized network is achieved and improves the recognition results.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

暂无数据