☆ 4.0 Article

Incorporating Link Information in Feature Selection for Identifying Tumor Biomarkers by Using miRNA-mRNA Paired Expression Data

CURRENT PROTEOMICS (2018)

期刊

CURRENT PROTEOMICS

卷 15, 期 2, 页码 165-171

出版社

BENTHAM SCIENCE PUBL LTD

DOI: 10.2174/1570164614666171031160232

关键词

Feature selection; miRNA-mRNA paired expression; regularization; tumor; biomarker; gene expression

类别

Biochemical Research Methods Biochemistry & Molecular Biology

资金

Shanghai Municipal Natural Science Foundation [16ZR1448700]
Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background: Feature selection methods have been commonly used in differential expression analysis. The selected genes can serve as potential biomarkers, and play important roles in disease diagnosis and prognosis. Recently, many studies have shown that an efficient way to enhance the performance of feature selection is incorporating data properties, such as the correlation between instances or attributes in heterogeneous data. Gene expression data is a typical kind of linked data, in which genes are related by co-regulation, and samples are groups by similar disease status. However, most of the analysis approaches for gene expression data are designed for generic data, without consideration of data characteristics. Objective: In this paper, we aim to identify miRNA biomarkers by using feature selection methods. Benefitting from the abundant mRNA-miRNA parallel expression data, mining the linked data can provide valuable information for feature selection and biomarker identification. Method: Using mRNA-miRNA paired data, we infer connections between data samples by mRNA expression levels, and incorporate the link information into a graph regularization method to achieve feature selection for miRNAs. Results: The experiments were conducted on three public miRNA-mRNA microarray data sets. The new method greatly reduces feature dimensionality, and achieves high classification accuracy. Experimental comparisons show that it outperforms the classic regularization methods and state-of-the-art feature selection methods. Conclusion: Taking data properties into consideration has been demonstrated as an effective way to improve the performance of feature selection. Specifically, link information in gene expression data provides useful hints to design structured regularization and assists biomarker identification.

Incorporating Link Information in Feature Selection for Identifying Tumor Biomarkers by Using miRNA-mRNA Paired Expression Data

期刊

CURRENT PROTEOMICS

出版社

BENTHAM SCIENCE PUBL LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Incorporating Link Information in Feature Selection for Identifying Tumor Biomarkers by Using miRNA-mRNA Paired Expression Data

期刊

CURRENT PROTEOMICS

出版社

BENTHAM SCIENCE PUBL LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文