☆ 4.7 Article

Subtype-WESLR: identifying cancer subtype with weighted ensemble sparse latent representation of multi-view data

BRIEFINGS IN BIOINFORMATICS (2022)

期刊

BRIEFINGS IN BIOINFORMATICS

卷 23, 期 1, 页码 -

出版社

OXFORD UNIV PRESS

DOI: 10.1093/bib/bbab398

关键词

subtype discovery; multi-omics integration; weighted ensemble clustering; sparse subspace learning; Laplacian regularization

类别

Biochemical Research Methods Mathematical & Computational Biology

资金

National Natural Science Foundation of China [11631015, 12026601, U1611265]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The discovery of cancer subtypes through high-throughput technologies and multi-view data integration methods has become a significant research topic in oncology. By utilizing weighted ensemble sparse latent representation, researchers are able to identify cancer subtypes more accurately and reliably, demonstrating superiority over competing methods.

The discovery of cancer subtypes has become much-researched topic in oncology. Dividing cancer patients into subtypes can provide personalized treatments for heterogeneous patients. High-throughput technologies provide multiple omics data for cancer subtyping. Integration of multi-view data is used to identify cancer subtypes in many computational methods, which obtain different subtypes for the same cancer, even using the same multi-omics data. To a certain extent, these subtypes from distinct methods are related, which may have certain guiding significance for cancer subtyping. It is a challenge to effectively utilize the valuable information of distinct subtypes to produce more accurate and reliable subtypes. A weighted ensemble sparse latent representation (subtype-WESLR) is proposed to detect cancer subtypes on heterogeneous omics data. Using a weighted ensemble strategy to fuse base clustering obtained by distinct methods as prior knowledge, subtype-WESLR projects each sample feature profile from each data type to a common latent subspace while maintaining the local structure of the original sample feature space and consistency with the weighted ensemble and optimizes the common subspace by an iterative method to identify cancer subtypes. We conduct experiments on various synthetic datasets and eight public multi-view datasets from The Cancer Genome Atlas. The results demonstrate that subtype-WESLR is better than competing methods by utilizing the integration of base clustering of exist methods for more precise subtypes.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.7

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Parea: Multi-view ensemble clustering for cancer subtype discovery

Bastian Pfeifer, Marcus D. Bloice, Michael G. Schimek

Summary: Multi-view clustering methods are crucial for stratifying patients into sub-groups based on similar molecular characteristics. We introduce Parea, a multi-view hierarchical ensemble clustering approach that outperforms the current state-of-the-art on six out of seven analyzed cancer types. We have integrated the Parea method into our Python package Pyrea (https://github.com/mdbloice/Pyrea), which facilitates the effortless and flexible design of ensemble workflows incorporating various fusion and clustering algorithms.

JOURNAL OF BIOMEDICAL INFORMATICS (2023)