☆ 4.5 Article

CNN-G: Convolutional Neural Network Combined With Graph for Image Segmentation With Theoretical Analysis

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS (2021)

Journal

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS

Volume 13, Issue 3, Pages 631-644

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TCDS.2020.2998497

Keywords

Image segmentation; Feature extraction; Semantics; Image edge detection; Deep learning; Convolutional neural networks; Graph neural network (GNN); image segmentation; self-attention; structure pattern learning

Funding

National Key Research and Development Plan [2017YFC1700106]
International Partnership Program of Chinese Academy of Sciences [GJHZ1849]
Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences (CIFMS) [2018-I2M-AI004]
Open Fund of the State Key Laboratory of Management and Control for Complex Systems [20180107]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The article introduces a novel model that transforms semantic segmentation into graph node classification to address the inadequacy of deep convolutional neural networks in extracting overall structure information. The model represents the image as a graph, with nodes initialized by feature maps obtained by CNN and edges reflecting relationships between nodes. By introducing a graph neural network for node classification, the model effectively expands the receptive field.

Deep convolutional neural network (CNN), although recognized to be considerably successful in its application to semantic segmentation, is inadequate for extracting the overall structure information, for its representing images with the data in the Euclidean space. To improve this inadequacy, a new model in the graph domain that transforms semantic segmentation into graph node classification is proposed for semantic segmentation. In this model, the image is represented by a graph, with its nodes initialized by the feature map obtained by a CNN, and its edges reflecting the relationships of the nodes. The node relationships that are taken into consideration include distance-based ones and semantic ones, respectively, calculated with the Gauss kernel function and attention mechanism. The graph neural network is also introduced in this model for the classification of graph nodes, which can expand the receptive field without the loss of location information and combine the structure with the feature extraction. Most importantly, it is theoretically concluded that the proposed graph model takes the same role as a Laplace regularization term in image segmentation, which has been proven by multiple comparative experiments that show the effectiveness of the model in image semantic segmentation. The learned attention is visualized by the heatmap to validate the structure learning ability of our model. The results of these experiments show the importance of structural information in image segmentation. Hence, an idea of deep learning combined with graph structural information is provided in theory and method.

CNN-G: Convolutional Neural Network Combined With Graph for Image Segmentation With Theoretical Analysis

Journal

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

CNN-G: Convolutional Neural Network Combined With Graph for Image Segmentation With Theoretical Analysis

Journal

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper