4.5 Article

Improving N6-methyladenosine site prediction with heuristic selection of nucleotide physical-chemical properties

Journal

ANALYTICAL BIOCHEMISTRY
Volume 508, Issue -, Pages 104-113

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ab.2016.06.001

Keywords

RNA sequence; N-6-methyladenosine; Physical-chemical property selection; Feature representation; Support vector machine

Funding

  1. National Natural Science Foundation of China [61373062, 61233011, 61572242, 61222306]
  2. Natural Science Foundation of Jiangsu [BK20141403]
  3. China Postdoctoral Science Foundation [2014T70526]
  4. Fundamental Research Funds for the Central Universities [30916011327]
  5. Science and Technology Commission of Shanghai Municipality [16JC1404300]
  6. The Six Top Talents of Jiangsu Province [2013-XXRJ-022]

Ask authors/readers for more resources

N-6-methyladenosine (m(6)A) is one of the most common and abundant post-transcriptional RNA modifications found in viruses and most eukaryotes. m(6)A plays an essential role in many vital biological processes to regulate gene expression. Because of its widespread distribution across the genomes, the identification of m(6)A sites from RNA sequences is of significant importance for better understanding the regulatory mechanism of m(6)A. Although progress has been achieved in m(6)A site prediction, challenges remain. This article aims to further improve the performance of m(6)A site prediction by introducing a new heuristic nucleotide physical-chemical property selection (HPCS) algorithm. The proposed HPCS algorithm can effectively extract an optimized subset of nucleotide physical chemical properties under the prescribed feature representation for encoding an RNA sequence into a feature vector. We demonstrate the efficacy of the proposed HPCS algorithm under different feature representations, including pseudo dinucleotide composition (PseDNC), auto-covariance (AC), and cross-covariance (CC). Based on the proposed HPCS algorithm, we implemented an m6A site predictor, called M(6)A-HPCS, which is freely available at http://csbio.njust.edu.cn/bioinf/M(6)A-HPCS. Experimental results over rigorous jackknife tests on benchmark datasets demonstrated that the proposed M(6)A-HPCS achieves higher success rates and outperforms existing state-of-the-art sequence-based m(6)A site predictors. (C) 2016 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available