4.8 Article

High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies

Journal

MOLECULAR BIOLOGY AND EVOLUTION
Volume 36, Issue 11, Pages 2415-2431

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msz156

Keywords

heterochromatin; satellite repeats; long sequencing reads; great apes

Funding

  1. National Institute of General Medical Sciences of the National Institutes of Health [R01GM130691]
  2. Eberly College of Sciences at Penn State
  3. Huck Institute of Life Sciences at Penn State
  4. Institute for CyberScience at Penn State
  5. Pennsylvania Department of Health

Ask authors/readers for more resources

Satellite repeats are a structural component of centromeres and telomeres, and in some instances, their divergence is known to drive speciation. Due to their highly repetitive nature, satellite sequences have been understudied and underrepresented in genome assemblies. To investigate their turnover in great apes, we studied satellite repeats of unit sizes up to 50 bp in human, chimpanzee, bonobo, gorilla, and Sumatran and Bornean orangutans, using unassembled short and long sequencing reads. The density of satellite repeats, as identified from accurate short reads (Illumina), varied greatly among great ape genomes. These were dominated by a handful of abundant repeated motifs, frequently shared among species, which formed two groups: 1) the (AATGG)(n) repeat (critical for heat shock response) and its derivatives; and 2) subtelomeric 32-mers involved in telomeric metabolism. Using the densities of abundant repeats, individuals could be classified into species. However, clustering did not reproduce the accepted species phylogeny, suggesting rapid repeat evolution. Several abundant repeats were enriched in males versus females; using Y chromosome assemblies or Fluorescent In Situ Hybridization, we validated their location on the Y. Finally, applying a novel computational tool, we identified many satellite repeats completely embedded within long Oxford Nanopore and Pacific Biosciences reads. Such repeats were up to 59 kb in length and consisted of perfect repeats interspersed with other similar sequences. Our results based on sequencing reads generated with three different technologies provide the first detailed characterization of great ape satellite repeats, and open new avenues for exploring their functions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemical Research Methods

Family reunion via error correction: an efficient analysis of duplex sequencing data

Nicholas Stoler, Barbara Arbeithuber, Gundula Povysil, Monika Heinzl, Renato Salazar, Kateryna D. Makova, Irene Tiemann-Boege, Anton Nekrutenko

BMC BIOINFORMATICS (2020)

Article Biology

Pronounced somatic bottleneck in mitochondrial DNA of human hair

Alison Barrett, Barbara Arbeithuber, Arslan Zaidi, Peter Wilton, Ian M. Paul, Rasmus Nielsen, Kateryna D. Makova

PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES (2020)

Article Evolutionary Biology

Ampliconic Genes on the Great Ape Y Chromosomes: Rapid Evolution of Copy Number but Conservation of Expression Levels

Rahulsimham Vegesna, Marta Tomaszkiewicz, Oliver A. Ryder, Rebeca Campos-Sanchez, Paul Medvedev, Michael DeGiorgio, Kateryna D. Makova

GENOME BIOLOGY AND EVOLUTION (2020)

Article Biochemistry & Molecular Biology

Human L1 Transposition Dynamics Unraveled with Functional Data Analysis

Di Chen, Marzia A. Cremona, Zongtai Qi, Robi D. Mitra, Francesca Chiaromonte, Kateryna D. Makova

MOLECULAR BIOLOGY AND EVOLUTION (2020)

Article Biochemistry & Molecular Biology

Age-related accumulation of de novo mitochondrial mutations in mammalian oocytes and somatic tissues

Barbara Arbeithuber, James Hester, Marzia A. Cremona, Nicholas Stoler, Arslan Zaidi, Bonnie Higgins, Kate Anthony, Francesca Chiaromonte, Francisco J. Diaz, Kateryna D. Makova

PLOS BIOLOGY (2020)

Article Biochemistry & Molecular Biology

Non-B DNA: a major contributor to small- and large-scale variation in nucleotide substitution frequencies across the genome

Wilfried M. Guiblet, Marzia A. Cremona, Robert S. Harris, Di Chen, Kristin A. Eckert, Francesca Chiaromonte, Yi-Fei Huang, Kateryna D. Makova

Summary: Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures, which have been implicated in vital cellular processes. Non-B DNA hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genomewide variation in mutation rates remains unexplored. Non-B DNA substantially contributes to variation in substitution frequencies at small and large scales, highlighting its role in germline mutagenesis with implications to evolution and genetic diseases.

NUCLEIC ACIDS RESEARCH (2021)

Review Genetics & Heredity

Probably Correct: Rescuing Repeats with Short and Long Reads

Monika Cechova

Summary: The challenge of assembling short reads into a high-quality reference genome has been complicated by the repetitive nature of the human genome. The emergence of long reads has allowed for better characterization of difficult genomic regions and differentiation of identical sequences based on epigenetic marks. Although long reads still contain some sequencing errors, they provide new possibilities for solving the problem of multi-mapping reads.

GENES (2021)

Article Biochemistry & Molecular Biology

Discovery of an unusually high number of de novo mutations in sperm of older men using duplex sequencing

Renato Salazar, Barbara Arbeithuber, Maja Ivankovic, Monika Heinzl, Sofia Moura, Ingrid Hartl, Theresa Mair, Angelika Lahnsteiner, Thomas Ebner, Omar Shebl, Johannes Proell, Irene Tiemann-Boege

Summary: Researchers have discovered highly recurrent selfish mutations associated with congenital disorders in male germline. Using duplex sequencing, they examined the FGFR3 coding region and found that older donors harbor more mutations associated with congenital disorders.

GENOME RESEARCH (2022)

Article Multidisciplinary Sciences

Advanced age increases frequencies of de novo mitochondrial mutations in macaque oocytes and somatic tissues

Barbara Arbeithuber, Marzia A. Cremona, James Hester, Alison Barrett, Bonnie Higgins, Kate Anthony, Francesca Chiaromonte, Francisco J. Diaz, Kateryna D. Makova

Summary: Duplex sequencing technology reveals the accumulation of mtDNA mutations in somatic tissues and germline cells of primates as they age. The frequency of these mutations significantly increases in liver and muscle tissues with age, while it stabilizes in oocytes of older animals after 9 years of age.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2022)

Review Cell Biology

Satellite DNAs and human sex chromosome variation

Monika Cechova, Karen H. Miga

Summary: This review focuses on the biology of satellite DNA on human X and Y chromosomes and its impact on sex chromosome aneuploidies. The findings provide insights into the prevalence and consequences of these aneuploidies.

SEMINARS IN CELL & DEVELOPMENTAL BIOLOGY (2022)

Review Genetics & Heredity

Noncanonical DNA structures are drivers of genome evolution

Kateryna D. Makova, Matthias H. Weissensteiner

Summary: In addition to the canonical right-handed double helix, non-B DNA structures can form in the genomes across the tree of life. These structures regulate cellular processes and have the potential to drive genomic and phenotypic evolution. Recent studies have established non-B DNA as novel functional elements subject to natural selection, affecting the evolution of transposable elements and centromeres. Evolutionary analyses should consider not only DNA sequence, but also its structure.

TRENDS IN GENETICS (2023)

Article Evolutionary Biology

Transcript Isoform Diversity of Ampliconic Genes on the Y Chromosome of Great Apes

Marta Tomaszkiewicz, Kristoffer Sahlin, Paul Medvedev, Kateryna D. Makova

Summary: This study decoded the transcript sequences of nine YAG families in six great ape species and found evolutionarily conserved alternative splicing patterns in most families. It revealed that BPY2 and PRY families have distinct features and that the PRY family is undergoing pseudogenization. No selection signatures were detected in the YAG families shared among great apes, but many species-specific protein-coding transcripts were identified. Consensus disorder regions were predicted, providing a resource for future studies on male infertility.

GENOME BIOLOGY AND EVOLUTION (2023)

Review Veterinary Sciences

Boosting the potential of cattle breeding using molecular biology, genetics, and bioinformatics approaches - a review

Monika Cechova, Michaela Andrlikova

Summary: Cattle, as one of the most important farm animals, have undergone intense selection and genetic testing to enhance their agricultural potential. Modern technologies such as gene editing and in vitro embryo production are being used to accelerate the breeding process for genetically superior animals, adapting to changing environments and demands.

ACTA VETERINARIA BRNO (2021)

No Data Available