4.7 Article

Improved RNA secondary structure and tertiary base-pairing prediction using evolutionary profile, mutational coupling and two-dimensional transfer learning

期刊

BIOINFORMATICS
卷 37, 期 17, 页码 2589-2600

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab165

关键词

-

资金

  1. Australia Research Council [DP180102060, DP210101875]

向作者/读者索取更多资源

Recent advancements in computational prediction of RNA secondary structure through deep learning and transfer learning have significantly improved accuracy in canonical base-pairing structures as well as tertiary interactions such as pseudoknots and non-canonical base pairs. The new method shows high accuracy for RNAs with over 1000 homologous sequences and can also improve base-pairing prediction by incorporating artificial but functional homologous sequences generated from deep mutational scanning. This fully automatic method provides a powerful tool for capturing both secondary and tertiary base-pairing information for building three-dimensional models.
Motivation: The recent discovery of numerous non-coding RNAs (long non-coding RNAs, in particular) has transformed our perception about the roles of RNAs in living organisms. Our ability to understand them, however, is hampered by our inability to solve their secondary and tertiary structures in high resolution efficiently by existing experimental techniques. Computational prediction of RNA secondary structure, on the other hand, has received much-needed improvement, recently, through deep learning of a large approximate data, followed by transfer learning with gold-standard base-pairing structures from high-resolution 3-D structures. Here, we expand this single-sequence-based learning to the use of evolutionary profiles and mutational coupling. Results: The new method allows large improvement not only in canonical base-pairs (RNA secondary structures) but more so in base-pairing associated with tertiary interactions such as pseudoknots, non-canonical and lone base-pairs. In particular, it is highly accurate for those RNAs of more than 1000 homologous sequences by achieving >0.8 F1-score (harmonic mean of sensitivity and precision) for 14/16 RNAs tested. The method can also significantly improve base-pairing prediction by incorporating artificial but functional homologous sequences generated from deep mutational scanning without any modification. The fully automatic method (publicly available as server and standalone software) should provide the scientific community a new powerful tool to capture not only the secondary structure but also tertiary base-pairing information for building three-dimensional models. It also highlights the future of accurately solving the base-pairing structure by using a large number of natural and/or artificial homologous sequences.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据