4.6 Article

Short paired-end reads trump long single-end reads for expression analysis

期刊

BMC BIOINFORMATICS
卷 21, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12859-020-3484-z

关键词

RNA-seq; Short read sequencing; Differential expression

资金

  1. Harvard University Division of Science

向作者/读者索取更多资源

Background Typical experimental design advice for expression analyses using RNA-seq generally assumes that single-end reads provide robust gene-level expression estimates in a cost-effective manner, and that the additional benefits obtained from paired-end sequencing are not worth the additional cost. However, in many cases (e.g., with Illumina NextSeq and NovaSeq instruments), shorter paired-end reads and longer single-end reads can be generated for the same cost, and it is not obvious which strategy should be preferred. Using publicly available data, we test whether short-paired end reads can achieve more robust expression estimates and differential expression results than single-end reads of approximately the same total number of sequenced bases. Results At both the transcript and gene levels, 2 x 40 paired-end reads unequivocally provide expression estimates that are more highly correlated with 2 x 125 than 1 x 75 reads; in nearly all cases, those correlations are also greater than for 1 x 125, despite the greater total number of sequenced bases for the latter. Across an array of metrics, differential expression tests based upon 2 x 40 consistently outperform those using 1 x 75. Conclusion Researchers seeking a cost-effective approach for gene-level expression analysis should prefer short paired-end reads over a longer single-end strategy. Short paired-end reads will also give reasonably robust expression estimates and differential expression results at the isoform level.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据