4.6 Article

Detecting Interactive Gene Groups for Single-Cell RNA-Seq Data Based on Co-Expression Network Analysis and Subgraph Learning

期刊

CELLS
卷 9, 期 9, 页码 -

出版社

MDPI
DOI: 10.3390/cells9091938

关键词

single-cell RNA-seq; machine learning; interactive gene groups; co-expression networks; subgraph learning

资金

  1. New Energy and Industrial Technology Development Organization (NEDO)
  2. Japan Society for the Promotion of Science (JSPS) [18H03250, 19K20280]
  3. Grants-in-Aid for Scientific Research [18H03250, 19K20280] Funding Source: KAKEN

向作者/读者索取更多资源

High-throughput sequencing technologies have enabled the generation of single-cell RNA-seq (scRNA-seq) data, which explore both genetic heterogeneity and phenotypic variation between cells. Some methods have been proposed to detect the related genes causing cell-to-cell variability for understanding tumor heterogeneity. However, most existing methods detect the related genes separately, without considering gene interactions. In this paper, we proposed a novel learning framework to detect the interactive gene groups for scRNA-seq data based on co-expression network analysis and subgraph learning. We first utilized spectral clustering to identify the subpopulations of cells. For each cell subpopulation, the differentially expressed genes were then selected to construct a gene co-expression network. Finally, the interactive gene groups were detected by learning the dense subgraphs embedded in the gene co-expression networks. We applied the proposed learning framework on a real cancer scRNA-seq dataset to detect interactive gene groups of different cancer subtypes. Systematic gene ontology enrichment analysis was performed to examine the detected genes groups by summarizing the key biological processes and pathways. Our analysis shows that different subtypes exhibit distinct gene co-expression networks and interactive gene groups with different functional enrichment. The interactive genes are expected to yield important references for understanding tumor heterogeneity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

An oversampling framework for imbalanced classification based on Laplacian eigenmaps

Xiucai Ye, Hongmin Li, Akira Imakura, Tetsuya Sakurai

NEUROCOMPUTING (2020)

Review Biochemistry & Molecular Biology

Better Performance with Transformer: CPPFormer in the Precise Prediction of Cell-penetrating Peptides

Yuyang Xue, Xiucai Ye, Lesong Wei, Xin Zhang, Tetsuya Sakurai, Leyi Wei

Summary: The Transformer model has become the mainstream model in natural language processing, and bioinformatics has made remarkable progress in drug design and protein property prediction through machine learning. This study proposes a deep model-based method called CPPFormer, which achieves the best performance in drug penetration tasks for CPPs by implementing the attention structure of the Transformer and a feature extractor.

CURRENT MEDICINAL CHEMISTRY (2022)

Article Computer Science, Information Systems

Anonymous Dynamic Group Authenticated Key Agreements Using Physical Unclonable Functions for Internet of Medical Things

Tian-Fu Lee, Xiucai Ye, Syuan-Han Lin

Summary: Group authenticated key agreements (GAKAs) using physical unclonable function (PUF) solve the security and efficiency problems of the Internet of Medical Things (IoMT) by enabling authentication and secure information exchange among medical sensor devices.

IEEE INTERNET OF THINGS JOURNAL (2022)

Article Biochemical Research Methods

iLoc-miRNA: extracellular/intracellular miRNA prediction using deep BiLSTM with attention mechanism

Zhao-Yue Zhang, Lin Ning, Xiucai Ye, Yu-He Yang, Yasunori Futamura, Tetsuya Sakurai, Hao Lin

Summary: The location of miRNAs in cells plays a crucial role in their regulatory function. Current prediction algorithms for miRNA subcellular localization have limitations. In this study, a new data partitioning strategy and deep learning algorithm were proposed to accurately predict miRNA subcellular localization and explore the underlying mechanisms through motif analysis. Additionally, a user-friendly web server was established for convenient use.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Physics, Multidisciplinary

Successive Trajectory Privacy Protection with Semantics Prediction Differential Privacy

Jing Zhang, Yanzi Li, Qian Ding, Liwei Lin, Xiucai Ye

Summary: The paper introduces a semantics- and prediction-based differential privacy protection scheme for trajectory data to ensure data privacy and solve the balance issue between privacy budget and service quality.

ENTROPY (2022)

Article Chemistry, Analytical

Enhanced Authenticated Key Agreement for Surgical Applications in a Tactile Internet Environment

Tian-Fu Lee, Xiucai Ye, Wei-Yu Chen, Chi-Chang Chang

Summary: The Tactile Internet plays a significant role in electronic medicine, but inappropriate and insecure authentication key agreements may endanger patients' lives. Researchers propose an enhanced scheme to address the limitations of the Kamil scheme, providing better security functionalities and maintaining a lightweight computational cost.

SENSORS (2022)

Article Biochemical Research Methods

SiameseCPP: a sequence-based Siamese network to predict cell -penetrating peptides by contrastive learning

Xin Zhang, Lesong Wei, Xiucai Ye, Kai Zhang, Saisai Teng, Zhongshen Li, Junru Jin, Minjae Kim, Tetsuya Sakurai, Lizhen Cui, Balachandran Manavalan, Leyi Wei

Summary: A novel deep learning framework SiameseCPP is proposed for automated prediction of cell-penetrating peptides (CPPs). SiameseCPP learns discriminative representations of CPPs based on a well-pretrained model and a Siamese neural network comprising a transformer and gated recurrent units. Comprehensive experiments demonstrate that SiameseCPP outperforms existing baseline models for CPP prediction and exhibits satisfactory generalization ability on other functional peptide datasets.

BRIEFINGS IN BIOINFORMATICS (2023)

Article Biotechnology & Applied Microbiology

Molecular language models: RNNs or transformer?

Yangyang Chen, Zixu Wang, Xiangxiang Zeng, Yayang Li, Pengyong Li, Xiucai Ye, Tetsuya Sakurai

Summary: Language models can learn complex molecular distributions. This research investigates the differences between RNNs and the Transformer-Layer in learning complex molecular distributions and evaluates their performance on various molecular generation tasks. The results show that both language models can learn complex molecular distributions and the SMILES-based representation outperforms SELFIES.

BRIEFINGS IN FUNCTIONAL GENOMICS (2023)

Article Biochemical Research Methods

Adaptive learning embedding features to improve the predictive performance of SARS-CoV-2 phosphorylation sites

Shihu Jiao, Xiucai Ye, Chunyan Ao, Tetsuya Sakurai, Quan Zou, Lei Xu

Summary: The rapid and extensive transmission of SARS-CoV-2 has caused a global health emergency, and identifying phosphorylation sites plays a crucial role in understanding the molecular mechanisms of infection. However, current prediction tools lack accuracy and efficiency. This study presents a comprehensive analysis of SARS-CoV-2 infection in human lung cells and introduces a deep learning predictor, PSPred-ALE, designed to identify phosphorylation sites. The predictor utilizes a self-adaptive learning embedding algorithm and a multihead attention module to improve prediction accuracy. Comparative analysis shows that PSPred-ALE outperforms existing prediction tools. The proposed model can assist biomedical scientists in understanding phosphorylation mechanisms in SARS-CoV-2 infection.

BIOINFORMATICS (2023)

Article Biology

Multi-omics clustering for cancer subtyping based on latent subspace learning

Xiucai Ye, Yifan Shang, Tianyi Shi, Weihang Zhang, Tetsuya Sakurai

Summary: The increased availability of high-throughput technologies has allowed researchers to study disease etiology across multiple omics layers, particularly in improving cancer subtype identification. In this study, a novel multi-omics clustering method called MCLS is proposed, which effectively deals with missing multi-omics data. The method utilizes complete multi-omics data to construct a latent subspace using feature extraction and decomposition techniques, and then applies spectral clustering to identify clusters. Experimental results demonstrate that MCLS outperforms state-of-the-art methods in cancer subtype identification, providing valuable insights into cancer research.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Biochemical Research Methods

Collaborative analysis for drug discovery by federated learning on non-IID data

Dong Huang, Xiucai Ye, Ying Zhang, Tetsuya Sakurai

Summary: Collaborative analysis has become a promising approach for drug discovery with the availability of large-scale QSAR datasets. This paper proposes a novel framework for collaborative drug discovery using federated learning on non-IID datasets, addressing the difficulty of training on non-IID data by globally sharing a small subset of data among all institutions. The experimental results demonstrate competitive predictive accuracy while respecting data privacy.

METHODS (2023)

Article Biochemical Research Methods

SiameseCPP: a sequence-based Siamese network to predict cell-penetrating peptides by contrastive learning

Xin Zhang, Lesong Wei, Xiucai Ye, Kai Zhang, Saisai Teng, Zhongshen Li, Junru Jin, Min Jae Kim, Tetsuya Sakurai, Lizhen Cui, Balachandran Manavalan, Leyi Wei

Summary: In this study, a novel deep learning framework called SiameseCPP is proposed for automated prediction of CPPs. SiameseCPP utilizes contrastive learning to build a CPP predictive model and demonstrates superior performance and generalizability on multiple functional peptide datasets.

BRIEFINGS IN BIOINFORMATICS (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Ensemble Learning for Cluster Number Detection Based on Shared Nearest Neighbor Graph and Spectral Clustering

Weihang Zhang, Xiucai Ye, Testuya Sakurai

Summary: This paper proposes a clustering method that automatically generates candidate cluster numbers and corresponding cluster partitions, and recovers a shared low-rank similarity matrix through ensemble learning. Spectral clustering is applied to detect the cluster number using the candidate cluster numbers. Experimental results demonstrate that the proposed method can accurately detect the cluster numbers and achieve better clustering results compared to existing methods.

2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) (2022)

Article Computer Science, Information Systems

Adaptive Unsupervised Feature Learning for Gene Signature Identification in Non-Small-Cell Lung Cancer

Xiucai Ye, Weihang Zhang, Tetsuya Sakurai

IEEE ACCESS (2020)

暂无数据