☆ 4.6 Article

Predicting microRNA precursors with a generalized Gaussian components based density estimation algorithm

BMC BIOINFORMATICS (2010)

Journal

BMC BIOINFORMATICS

Volume 11, Issue -, Pages -

Publisher

BMC

DOI: 10.1186/1471-2105-11-S1-S52

Keywords

Funding

National Science Council of the Republic of China, Taiwan [NSC 97-2627-P-001-002, NSC 96-2320-B-006-027-MY2, NSC 96-2221-E-006-232-MY2]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Background: MicroRNAs (miRNAs) are short non-coding RNA molecules, which play an important role in post-transcriptional regulation of gene expression. There have been many efforts to discover miRNA precursors (pre-miRNAs) over the years. Recently, ab initio approaches have attracted more attention because they do not depend on homology information and provide broader applications than comparative approaches. Kernel based classifiers such as support vector machine (SVM) are extensively adopted in these ab initio approaches due to the prediction performance they achieved. On the other hand, logic based classifiers such as decision tree, of which the constructed model is interpretable, have attracted less attention. Results: This article reports the design of a predictor of pre-miRNAs with a novel kernel based classifier named the generalized Gaussian density estimator (G(2)DE) based classifier. The G(2)DE is a kernel based algorithm designed to provide interpretability by utilizing a few but representative kernels for constructing the classification model. The performance of the proposed predictor has been evaluated with 692 human pre-miRNAs and has been compared with two kernel based and two logic based classifiers. The experimental results show that the proposed predictor is capable of achieving prediction performance comparable to those delivered by the prevailing kernel based classification algorithms, while providing the user with an overall picture of the distribution of the data set. Conclusion: Software predictors that identify pre-miRNAs in genomic sequences have been exploited by biologists to facilitate molecular biology research in recent years. The G(2)DE employed in this study can deliver prediction accuracy comparable with the state-of-the-art kernel based machine learning algorithms. Furthermore, biologists can obtain valuable insights about the different characteristics of the sequences of pre-miRNAs with the models generated by the G(2)DE based predictor.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Bayesian Aerosol Retrieval-Based PM2.5 Estimation through Hierarchical Gaussian Process Models

Junbo Zhang, Daoji Li, Yingzhi Xia, Qifeng Liao

Summary: This paper proposes a new two-step approach to estimate 1-km-resolution PM2.5 concentrations in Shanghai using satellite data. The approach refines AOD data to a higher resolution using a Bayesian retrieval method and then uses a hierarchical Gaussian process model to estimate PM2.5 concentrations. The results show accurate predictive performance of the proposed approach.

MATHEMATICS (2022)