Journal
HUMAN MUTATION
Volume 38, Issue 11, Pages 1454-1463Publisher
WILEY
DOI: 10.1002/humu.23313
Keywords
evolutionary conservation; ExAC; functional variation; genetic tolerance; HGMD; meta-domains; pathogenicity; Pfam; protein domain homology; variant interpretation
Categories
Funding
- The Netherlands Organization for Scientific Research [916-14-043, 918-15-667]
- Radboud Institute for Molecular Life Sciences
- Radboud University Medical Center [R0002793]
Ask authors/readers for more resources
Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population-based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease-causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 meta-domains. These meta-domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta-domains high-frequency population variation re-occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta-domains can improve the interpretation of genetic variation.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available