Journal
ANNALS OF APPLIED PROBABILITY
Volume 32, Issue 6, Pages 4681-4705Publisher
INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/22-AAP1799
Keywords
Phylogenetics; gene duplication and loss; incomplete lineage sorting; statistical con-sistency
Categories
Funding
- NSF [DMS-1614242, CCF-1740707, DMS-1902892, DMS-1916378, DMS-2023239]
- Simons Fellowship
- Vilas Associates Award
Ask authors/readers for more resources
This study investigates species tree estimation using a stochastic model that incorporates incomplete lineage sorting and gene duplication and loss. Through a probabilistic analysis, sample complexity bounds for quartet-based inference methods are derived, highlighting the impact of duplication and loss rates.
We consider species tree estimation under a standard stochastic model of gene tree evolution that incorporates incomplete lineage sorting (as mod-eled by a coalescent process) and gene duplication and loss (as modeled by a branching process). Through a probabilistic analysis of the model, we derive sample complexity bounds for widely used quartet-based inference methods that highlight the effect of the duplication and loss rates in both subcritical and supercritical regimes.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available