4.7 Article

PEA: an integrated R toolkit for plant epitranscriptome analysis

Journal

BIOINFORMATICS
Volume 34, Issue 21, Pages 3747-3749

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bty421

Keywords

-

Funding

  1. National Natural Science Foundation of China [31570371]
  2. Youth 1000-Talent Program of China
  3. Hundred Talents Program of Shaanxi Province of China
  4. Agricultural Science and Technology Innovation and Research Project of Shaanxi Province, China [2015NY011]
  5. Youth Talent Program of State Key Laboratory of Crop Stress Biology for Arid Areas [CSBAAQN2016001]
  6. Fund of Northwest AF University

Ask authors/readers for more resources

Motivation: The epitranscriptome, also known as chemical modifications of RNA (CMRs), is a newly discovered layer of gene regulation, the biological importance of which emerged through analysis of only a small fraction of CMRs detected by high-throughput sequencing technologies. Understanding of the epitranscriptome is hampered by the absence of computational tools for the systematic analysis of epitranscriptome sequencing data. In addition, no tools have yet been designed for accurate prediction of CMRs in plants, or to extend epitranscriptome analysis from a fraction of the transcriptome to its entirety. Results: Here, we introduce PEA, an integrated R toolkit to facilitate the analysis of plant epitranscriptome data. The PEA toolkit contains a comprehensive collection of functions required for read mapping, CMR calling, motif scanning and discovery and gene functional enrichment analysis. PEA also takes advantage of machine learning (ML) technologies for transcriptome-scale CMR prediction, with high prediction accuracy, using the Positive Samples Only Learning algorithm, which addresses the two-class classification problem by using only positive samples (CMRs), in the absence of negative samples (non-CMRs). Hence PEA is a versatile epitranscriptome analysis pipeline covering CMR calling, prediction and annotation and we describe its application to predict N-6-methyladenosine (m(6)A) modifications in Arabidopsis thaliana. Experimental results demonstrate that the toolkit achieved 71.6% sensitivity and 73.7% specificity, which is superior to existing m(6)A predictors. PEA is potentially broadly applicable to the in-depth study of epitranscriptomics.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available