☆ 4.6 Article

Graph-based semi-supervised learning with genomic data integration using condition-responsive genes applied to phenotype classification

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2018)

Journal

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION

Volume 25, Issue 1, Pages 99-108

Publisher

OXFORD UNIV PRESS

DOI: 10.1093/jamia/ocx032

Keywords

gene expression; DNA methylation; semi-supervised learning; graph theory; ovarian cancer; data integration

Funding

Institute for Collaborative Biotechnologies from the US Army Research Office [W911NF-10-2-0111]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Data integration methods that combine data from different molecular levels such as genome, epigenome, transcriptome, etc., have received a great deal of interest in the past few years. It has been demonstrated that the synergistic effects of different biological data types can boost learning capabilities and lead to a better understanding of the underlying interactions among molecular levels. In this paper we present a graph-based semi-supervised classification algorithm that incorporates latent biological knowledge in the form of biological pathways with gene expression and DNA methylation data. The process of graph construction from biological pathways is based on detecting condition-responsive genes, where 3 sets of genes are finally extracted: all condition responsive genes, high-frequency condition-responsive genes, and P-value-filtered genes. The proposed approach is applied to ovarian cancer data downloaded from the Human Genome Atlas. Extensive numerical experiments demonstrate superior performance of the proposed approach compared to other state-of-the-art algorithms, including the latest graph-based classification techniques. Simulation results demonstrate that integrating various data types enhances classification performance and leads to a better understanding of interrelations between diverse omics data types. The proposed approach outperforms many of the state-of-the-art data integration algorithms.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

SMGCL: Semi-supervised Multi-view Graph Contrastive Learning

Hui Zhou, Maoguo Gong, Shanfeng Wang, Yuan Gao, Zhongying Zhao

Summary: Graph contrastive learning (GCL) aims to generate supervision information by transforming graph data itself, and it has become a focus of graph research recently. However, most GCL methods are unsupervised and struggle with balancing multi-view graph information. To address this, we propose a semi-supervised multi-view graph contrastive learning (SMGCL) framework for graph classification. The framework captures comparative relations between label-independent and label-dependent node pairs across different views and incorporates a label augmentation module and a shared decoder module to enhance discriminative representations and extract underlying relationships between representations and graph topology. Experimental results demonstrate the superiority of our proposed framework for graph classification tasks.

KNOWLEDGE-BASED SYSTEMS (2023)