4.7 Article

RVboost: RNA-seq variants prioritization using a boosting method

Journal

BIOINFORMATICS
Volume 30, Issue 23, Pages 3414-3416

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btu577

Keywords

-

Funding

  1. Everett and Jane Hauck to the Center for Individualized Medicine at Mayo Clinic Jacksonville Florida
  2. Donna Foundation
  3. proceeds of the National Marathon to Fight Breast Cancer
  4. NIH [P50 CA097274]
  5. NATIONAL CANCER INSTITUTE [P50CA097274] Funding Source: NIH RePORTER

Ask authors/readers for more resources

Motivation: RNA-seq has become the method of choice to quantify genes and exons, discover novel transcripts and detect fusion genes. However, reliable variant identification from RNA-seq data remains challenging because of the complexities of the transcriptome, the challenges of accurately mapping exon boundary spanning reads and the bias introduced during the sequencing library preparation. Method: We developed RVboost, a novel method specific for RNA variant prioritization. RVboost uses several attributes unique in the process of RNA library preparation, sequencing and RNA-seq data analyses. It uses a boosting method to train a model of 'good quality' variants using common variants from HapMap, and prioritizes and calls the RNA variants based on the trained model. We packaged RVboost in a comprehensive workflow, which integrates tools of variant calling, annotation and filtering. Results: RVboost consistently outperforms the variant quality score recalibration from the Genome Analysis Tool Kit and the RNA-seq variant-calling pipeline SNPiR in 12 RNA-seq samples using ground-truth variants from paired exome sequencing data. Several RNA-seq- specific attributes were identified as critical to differentiate true and false variants, including the distance of the variant positions to exon boundaries, and the percent of the reads supporting the variant in the first six base pairs. The latter identifies false variants introduced by the random hexamer priming during the library construction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available