4.8 Article

Accelerated Estimation of Frequency Classes in Site-Heterogeneous Profile Mixture Models

期刊

MOLECULAR BIOLOGY AND EVOLUTION
卷 35, 期 5, 页码 1266-1283

出版社

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msy026

关键词

mixture model; site-specific model; phylogenetics; protein models

资金

  1. Natural Sciences and Engineering Research Council of Canada
  2. Canada Research Chairs program

向作者/读者索取更多资源

As a consequence of structural and functional constraints, proteins tend to have site-specific preferences for particular amino acids. Failing to adjust for heterogeneity of frequencies over sites can lead to artifacts in phylogenetic estimation. Site-heterogeneous mixture-models have been developed to address this problem. However, due to prohibitive computational times, maximum likelihood implementations utilize fixed component frequency vectors inferred from sequences in a database that are external to the alignment under analysis. Here, we propose a composite likelihood approach to estimation of component frequencies for a mixture model that directly uses the data from the alignment of interest. In the common case that the number of taxa under study is not large, several adjustments to the default composite likelihood are shown to be necessary. In simulations, the approach is shown to provide large improvements over hierarchical clustering. For empirical data, substantial improvements in likelihoods are found over mixtures using fixed components.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据